German - Germany

German Speech Data - Scripted Monologue - 80h

Generic
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Generic
Use case(s)
mobile speech
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Total recordings
62115
Hours
80
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
0.4%
Total prompts
62115
Unique prompts
62115
Average amount of recordings per speaker
276.07
License Type Link
Published date
Sep 1, 2021
File size
8.7GB
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files

Demographic

Number of speakers
225.000000
Locale The language(s) and country(s) applicable to the speakers in the dataset.
de-de
Language
German
Country
Germany
Female | Male | Unspecified View on chart
51% | 49% | 0%
17-60
Accent(s) View on chart
Baden-Württemberg, Bavaria, Berlin, Brandenburg, Bremen, Hamburg, Hesse, Lower Saxony, Mecklenburg-Western Pomerania, North Rhine-Westphalia, Other, Rhineland-Palatinate, Saarland, Saxony, Saxony-Anhalt, Schleswig-Holstein

Audio Details

Words
439617
Recording environment
silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz

Chart details