Chinese Simplified - PRC

Chinese Speech Data - Spontaneous IVR - 12h

Banking
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Banking
Use case(s)
call centre, conversational AI, IVR
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Total recordings
1333
Hours
12
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
0%
Average amount of recordings per speaker
46.77
License Type Link
Published date
Dec 19, 2021
File size
0.67GB
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files

Demographic

Number of speakers
57.000000
Locale The language(s) and country(s) applicable to the speakers in the dataset.
zh-cn
Language
Chinese Simplified
Country
PRC
Female | Male | Unspecified View on chart
53% | 46% | 2%
18-39
Accent(s) View on chart
Anhui Sheng, Beijing Shi, Fujian Sheng, Gansu Sheng, Guangdong Sheng, Hebei Sheng, Heilongjiang Sheng, Henan Sheng, Hubei Sheng, Hunan Sheng, Jiangsu Sheng, Jiangxi Sheng, Jilin Sheng, Liaoning Sheng, Other, Qinghai Sheng, Shaanxi Sheng, Shanghai Shi, Tianjin Shi, Zhejiang Sheng

Audio Details

Words
16389
Recording environment
noisy, silent
Audio format
WAV
Bits per sample
8
Device type
mobile
Communication band
narrowband
Sample rate
8kHz

Chart details