Scripted Monologue
German Speech Data - Scripted Monologue
Telecommunication

$150.00
Version Number
01
Published date
Sep 1, 2021
Audio demo
Audio clips from the dataset that you can listen to.
How can I get the dataset?
After clicking the button and filling out the form, we will contact you to discuss the details.
Not what you're looking for?
We can collect a customized dataset according to your precise needs.
Use case(s)
mobile speech
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files
This dataset contains 34 hours of German Scripted Monologue data, recorded from speakers in Germany.
Seller Name
Defined.ai
Dataset details
About
Domain | Telecommunication |
Total recordings | 21759 |
File size | 3.66GB |
Hours | 34 |
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced. | 0.6% |
Total prompts | 21759 |
Unique prompts | 8473 |
Average amount of recordings per speaker | 32.48 |
Demographic
Locale The language(s) and country(s) applicable to the speakers in the dataset. | de-de |
Language | German |
Country | Germany |
Female | Male | Unspecified View on chart | 50% | 50% | 0% |
Age View on chart | 18-77 |
Accent(s) View on chart | Alemannisch (Stuttgart, Ulm), Bayerisch (München, Nürnberg), Ostmitteldeutsch (Dresden, Leipzig, Berlin), Ostniederdeutsch (Rostock, Schwerin), Westmitteldeutsch (Köln, Bonn, Frankfurt), Westniederdeutsch (Hamburg, Bremen, Hanover) |
Audio Details
Words | 240496 |
Recording environment | noisy, silent |
Audio format | WAV |
Bits per sample | 16 |
Device type | mobile |
Communication band | broadband |
Sample rate | 16kHz |
Details on charts
Phonetic Distribution

Age Distribution

Gender distribution

Accent Distribution
