Italian - Italy

Italian Speech Data - Scripted Monologue - 321h

Insurance
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Insurance
Use case(s)
mobile speech
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Total recordings
169273
Hours
321
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
0.38%
Total prompts
169273
Unique prompts
67684
Average amount of recordings per speaker
64.66
License Type Link
Published date
Sep 1, 2021
File size
24.27GB
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files

Demographic

Number of speakers
2618.000000
Locale The language(s) and country(s) applicable to the speakers in the dataset.
it-it
Language
Italian
Country
Italy
Female | Male | Unspecified View on chart
51% | 49% | 0%
18-75
Accent(s) View on chart
Agrigento, Alessandria, Ancona, Arezzo, Ascoli Piceno, Asti, Avellino, Bari, Barletta-Andria-Trani, Belluno, Benevento, Bergamo, Bologna, Bolzano, Brescia, Brindisi, Cagliari, Caltanissetta, Campobasso, Caserta, Catania, Catanzaro, Chieti, Como, Cosenza, Cremona, Crotone, Cuneo, Enna, Fermo, Ferrara, Firenze, Foggia, Forlì-Cesena, Friuli Venezia Giulia, Frosinone, Genova, Grosseto, Imperia, Isernia, L'Aquila, La Spezia, Latina, Lecce, Lecco, Livorno, Lodi, Lucca, Macerata, Matera, Messina, Milano, Modena, Monza e Brianza, Napoli, Novara, Nuoro, Oristano, Padova, Palermo, Parma, Pavia, Perugia, Pesaro e Urbino

Audio Details

Words
2276445
Recording environment
noisy, silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz

Chart details