English - United Kingdom

English Speech Data - Spontaneous Dialogue - 189h

Retail
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Retail
Use case(s)
call centre, conversational AI
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Total recordings
3471
Hours
189
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
5.72%
Average amount of recordings per speaker
26.4
License Type Link
Published date
Sep 1, 2021
File size
20.41GB
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files

Demographic

Number of speakers
263.000000
Locale The language(s) and country(s) applicable to the speakers in the dataset.
en-gb
Language
English
Country
United Kingdom
Female | Male | Unspecified View on chart
66% | 33% | 1%
18-70
Accent(s) View on chart
English - East and Central Midlands (Cambridge, Leicester, Nottingham), English - East Anglia (Norfolk, Ipswich), English - Geordie (Newcastle, Sunderland, Northumberland), English - Hampshire/Wiltshire, English - London and Greater London/Surrey, English - Mancunian (Manchester and Greater Manchester), English - Scouse/Northwestern (Liverpool, Lancashire, Blackpool), English - Sussex (East/West), English - West Country (Bristol, Gloucester, Somerset), English - West Midlands (Birmingham, Coventry), English - Yorkshire (Sheffield, Leeds, Middlesbrough), Irish - Belfast/East Ulster, Irish - Derry/West Ulster, Scottish - Aberdeen/Northern Lowlands, Scottish - Edinburgh-Dundee, Scottish - Glasgow-Stirling, Scottish - Inner/Outer Hebrides, Welsh - Southern (Cardiff, Newport), Welsh - Western (Swansea, Pembrokeshire)

Audio Details

Words
1784659
Recording environment
noisy, silent
Audio format
WAV
Bits per sample
8
Device type
mobile
Communication band
narrowband
Sample rate
8kHz

Chart details