English - Arabic
English Speech Data - Scripted Monologue - 50h
Generic
Audio demo
Dataset Details
About
Domain
Generic
Total recordings
35629
File size
5.43GB
Hours
50
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
5.3%
Total prompts
35629
Unique prompts
8620
Average amount of recordings per speaker
101.22
Demographic
Locale The language(s) and country(s) applicable to the speakers in the dataset.
en-ar
Language
English
Country
Arabic
Female | Male | Unspecified View on chart
43% | 57% | 0%
Age View on chart
18-57
Accent(s) View on chart
Agadir-Ida-Ou-Tanane, Al Bāḩah, Al Ḩudūd ash Shamālīyah, Al Jawf, Al Madinah al Munawwarah, Al Qasim, Al ‘Aqabah, Alexandria, Alger, Ar Riyāḑ, Ash Sharqiyah, Asir, Beyrouth, Cairo, Casablanca [Dar el Beïda], Damietta, Fès, Giza, Ḩamāh, Irbid, Ismailia, Jazan, Jerusalem, Larache, Liban-Sud, Makkah al Mukarramah, Marrakech, Midelt, Minya, Mont-Liban, Monufia, M’diq-Fnideq, Najran, Oujda-Angad, Rabat, Salé, Tabuk, Taza
Audio Details
Words
234873
Recording environment
silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz
Chart details
Phonetic Distribution

Age Distribution

Gender distribution

Accent Distribution
