English - Arabic

English Speech Data - Scripted Monologue - 50h

Generic
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Generic
Total recordings
35629
File size
5.43GB
Hours
50
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
5.3%
Total prompts
35629
Unique prompts
8620
Average amount of recordings per speaker
101.22

Demographic

Locale The language(s) and country(s) applicable to the speakers in the dataset.
en-ar
Language
English
Country
Arabic
Female | Male | Unspecified View on chart
43% | 57% | 0%
18-57
Accent(s) View on chart
Agadir-Ida-Ou-Tanane, Al Bāḩah, Al Ḩudūd ash Shamālīyah, Al Jawf, Al Madinah al Munawwarah, Al Qasim, Al ‘Aqabah, Alexandria, Alger, Ar Riyāḑ, Ash Sharqiyah, Asir, Beyrouth, Cairo, Casablanca [Dar el Beïda], Damietta, Fès, Giza, Ḩamāh, Irbid, Ismailia, Jazan, Jerusalem, Larache, Liban-Sud, Makkah al Mukarramah, Marrakech, Midelt, Minya, Mont-Liban, Monufia, M’diq-Fnideq, Najran, Oujda-Angad, Rabat, Salé, Tabuk, Taza

Audio Details

Words
234873
Recording environment
silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz

Chart details