Scripted Monologue
Spanish Speech Data - Scripted Monologue
Insurance

$150.00
Version Number
01
Published date
Sep 1, 2021
Audio demo
Audio clips from the dataset that you can listen to.
How can I get the dataset?
After clicking the button and filling out the form, we will contact you to discuss the details.
Not what you're looking for?
We can collect a customized dataset according to your precise needs.
Use case(s)
mobile speech
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files
This dataset contains 61 hours of Spanish Scripted Monologue data, recorded from speakers in Spain.
Seller Name
Defined.ai
Dataset details
About
Domain | Insurance |
Total recordings | 28957 |
File size | 6.67GB |
Hours | 61 |
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced. | 1.4% |
Total prompts | 28957 |
Unique prompts | 11515 |
Average amount of recordings per speaker | 64.78 |
Demographic
Locale The language(s) and country(s) applicable to the speakers in the dataset. | es-es |
Language | Spanish |
Country | Spain |
Female | Male | Unspecified View on chart | 59% | 41% | 0% |
Age View on chart | 18-62 |
Accent(s) View on chart | Andaluz (Andalucía), Aragonés (Aragón), Castellano del Norte, Castellano del Sur, Español de Asturias, Español de Galicia, Español de Valencia/Cataluña, Extremaduran (Suroeste de España), Islas Baleares, Islas Canarias, Leonés (León), Murciano (Murcia), Navarro (Navarra) |
Audio Details
Words | 551316 |
Recording environment | noisy, silent |
Audio format | WAV |
Bits per sample | 16 |
Device type | mobile |
Communication band | broadband |
Sample rate | 16kHz |
Details on charts
Phonetic Distribution

Age Distribution

Gender distribution

Accent Distribution
