English - Spanish

English Speech Data - Scripted Monologue - 20h

Generic
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Generic
Total recordings
11910
File size
2.15GB
Hours
20
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
4.1%
Total prompts
11910
Unique prompts
7702
Average amount of recordings per speaker
57.82

Demographic

Locale The language(s) and country(s) applicable to the speakers in the dataset.
en-es
Language
English
Country
Spanish
Female | Male | Unspecified View on chart
55% | 45% | 0%
19-65
Accent(s) View on chart
Álava, Albacete, Alicante, Almería, Asturias, Badajoz, Balears, Barcelona, Bizkaia, Burgos, Cáceres, Cádiz, California, Cantabria, Castellón, Ciudad de México, Córdoba, Distrito Capital de Bogotá, Falcón, Girona, Granada, Huelva, Huesca, Jaén, La Coruña, Las Palmas, León, Lima, Lleida, Madrid, Málaga, Murcia, Navarra, Ourense, Pontevedra, Quintana Roo, Salamanca, Santa Cruz de Tenerife, Santander, Sevilla, Soria, Tarragona, Toledo, Valencia, Valladolid, Valle del Cauca, Valparaíso, Veracruz de Ignacio de la Llave, Zaragoza

Audio Details

Words
99316
Recording environment
silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz

Chart details