Hindi - India

78 hours of Hindi Spontaneous IVR - 78h

Insurance
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Insurance
Use case(s)
call centre, conversational AI, IVR
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Total recordings
7200
Hours
78
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
0%
Average amount of recordings per speaker
66.36
License Type Link
Published date
Mar 20, 2022
File size
8.43GB
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files

Demographic

Number of speakers
217.000000
Locale The language(s) and country(s) applicable to the speakers in the dataset.
hi-in
Language
Hindi
Country
India
Female | Male | Unspecified View on chart
54% | 46% | 0%
18-56
Accent(s) View on chart
Andhra Pradesh, Arunachal Pradesh, Bihar, Chandigarh, Chhattisgarh, Delhi, Gujarat, Haryana, Himachal Pradesh, Karnataka, Kerala, Madhya Pradesh, Maharashtra, Odisha, Punjab, Rajasthan, Tamil Nadu, Telangana, Uttar Pradesh, Uttarakhand, West Bengal

Audio Details

Words
811618
Recording environment
noisy, silent
Audio format
WAV
Bits per sample
8
Device type
mobile
Communication band
narrowband
Sample rate
8kHz

Chart details