English - Global
English Speech Data - Scripted Monologue - 269h
Generic
Audio demo
Dataset Details
About
Domain
Generic
Total recordings
87140
File size
29.0GB
Hours
269
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
2.6%
Total prompts
87140
Unique prompts
700
Average amount of recordings per speaker
99.14
Demographic
Locale The language(s) and country(s) applicable to the speakers in the dataset.
en-xx
Language
English
Country
Global
Female | Male | Unspecified View on chart
48% | 52% | 0%
Age View on chart
18-63
Accent(s) View on chart
Alberta, Andhra Pradesh, Angus, Arizona, Assam, Barnsley, Bavaria, Berlin, Bexley, Bihar, Bradford, British Columbia, California, Delhi, Durham County, Edinburgh, City of, Europe, Glasgow City, Goa, Gujarat, Haryana, Hawaii, Hesse, Iowa, Jharkhand, Karnataka, Kerala, London, City of, Lower Saxony, Madhya Pradesh, Maharashtra, Middlesbrough, New Jersey, New Mexico, New South Wales, New York, North Carolina, Ontario, Pennsylvania, Portsmouth, Punjab, Rajasthan, Surrey, Tamil Nadu, Telangana, Texas, Ticino, Utah, Uttar Pradesh, Washington, West Bengal, World
Audio Details
Words
1297971
Recording environment
silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz
Chart details
Phonetic Distribution

Age Distribution

Gender distribution

Accent Distribution
