Arabic - Yemen
Arabic Speech Data - Scripted Monologue - 46h
Telecommunication
Audio demo
Dataset Details
About
Domain
Telecommunication
Total recordings
33778
File size
5.08GB
Hours
46
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
0%
Total prompts
33778
Unique prompts
20442
Average amount of recordings per speaker
12471.0
Demographic
Locale The language(s) and country(s) applicable to the speakers in the dataset.
ar-ye
Language
Arabic
Country
Yemen
Female | Male | Unspecified View on chart
44% | 56% | 0%
Age View on chart
19-41
Accent(s) View on chart
Abyan, Ad Dali, Adan, Al Hudaydah, Al Jawf, Al Mahrah, Al Mahwit, Amanat al Asimah, Amran, Dhamar, Ibb, Lahij, Other, Raymah, Sana, Shabwah, Taizz
Audio Details
Words
218318
Recording environment
noisy, silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz
Chart details
Age Distribution

Gender distribution

Accent Distribution
