Scripted Monologue
German Speech Data - Scripted Monologue
Generic

$150.00
Version Number
01
Published date
Sep 1, 2021
Audio demo
Audio clips from the dataset that you can listen to.
How can I get the dataset?
After clicking the button and filling out the form, we will contact you to discuss the details.
Not what you're looking for?
We can collect a customized dataset according to your precise needs.
Use case(s)
mobile speech
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files
This dataset contains 80 hours of German Scripted Monologue data, recorded from speakers in Germany.
Seller Name
Defined.ai
Dataset details
About
Domain | Generic |
Total recordings | 62115 |
File size | 8.7GB |
Hours | 80 |
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced. | 0.4% |
Total prompts | 62115 |
Unique prompts | 62115 |
Average amount of recordings per speaker | 276.07 |
Demographic
Locale The language(s) and country(s) applicable to the speakers in the dataset. | de-de |
Language | German |
Country | Germany |
Female | Male | Unspecified View on chart | 51% | 49% | 0% |
Age View on chart | 17-60 |
Accent(s) View on chart | Baden-Württemberg, Bavaria, Berlin, Brandenburg, Bremen, Hamburg, Hesse, Lower Saxony, Mecklenburg-Western Pomerania, North Rhine-Westphalia, Other, Rhineland-Palatinate, Saarland, Saxony, Saxony-Anhalt, Schleswig-Holstein |
Audio Details
Words | 439617 |
Recording environment | silent |
Audio format | WAV |
Bits per sample | 16 |
Device type | mobile |
Communication band | broadband |
Sample rate | 16kHz |
Details on charts
Phonetic Distribution

Age Distribution

Gender distribution

Accent Distribution
