Mandarin Chinese - PRC

Chinese Simplified Speech Data - Spontaneous Dialogue - 223h

Retail
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Retail
Total recordings
2973
File size
24.02GB
Hours
223
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
0%
Average amount of recordings per speaker
29.0

Demographic

Locale The language(s) and country(s) applicable to the speakers in the dataset.
zh-cn
Language
Mandarin Chinese
Country
PRC
Female | Male | Unspecified View on chart
58% | 42% | 0%
18-61
Accent(s) View on chart
Anhui Sheng, Beijing Shi, Fujian Sheng, Gansu Sheng, Guangdong Sheng, Guangxi Zhuangzu Zizhiqu, Guizhou Sheng, Hainan Sheng, Hebei Sheng, Heilongjiang Sheng, Henan Sheng, Hubei Sheng, Hunan Sheng, Jiangsu Sheng, Jiangxi Sheng, Jilin Sheng, Liaoning Sheng, Nei Mongol Zizhiqu, Qinghai Sheng, Shandong Sheng, Shanghai Shi, Shanxi Sheng, Sichuan Sheng, Tianjin Shi, Xinjiang Uygur Zizhiqu, Xizang Zizhiqu, Zhejiang Sheng

Audio Details

Words
283943
Recording environment
noisy, silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
narrowband
Sample rate
8kHz

Chart details