Arabic - Yemen

Arabic Speech Data - Scripted Monologue - 47h

Retail
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Retail
Total recordings
33947
File size
8.67GB
Hours
47
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
0%
Total prompts
33947
Unique prompts
20830
Average amount of recordings per speaker
283.61

Demographic

Locale The language(s) and country(s) applicable to the speakers in the dataset.
ar-ye
Language
Arabic
Country
Yemen
Female | Male | Unspecified View on chart
46% | 54% | 0%
19-41
Accent(s) View on chart
Abyan, Ad Dali, Adan, Al Hudaydah, Al Jawf, Al Mahrah, Al Mahwit, Amanat al Asimah, Amran, Dhamar, Ibb, Lahij, Other, Raymah, Sana, Shabwah, Taizz

Audio Details

Words
220402
Recording environment
noisy, silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz

Chart details