Italian - Italy

Italian Speech Data - Scripted Monologue - 119h

Customer care
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Customer care
Use case(s)
mobile speech
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Total recordings
65979
Hours
119
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
1.03%
Total prompts
65979
Unique prompts
15998
Average amount of recordings per speaker
34.67
License Type Link
Published date
Sep 1, 2021
File size
12.82GB
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files

Demographic

Number of speakers
1903.000000
Locale The language(s) and country(s) applicable to the speakers in the dataset.
it-it
Language
Italian
Country
Italy
Female | Male | Unspecified View on chart
60% | 40% | 0%
18-87
Accent(s) View on chart
Agrigento, Alessandria, Ancona, Arezzo, Ascoli Piceno, Asti, Avellino, Bari, Barletta-Andria-Trani, Belluno, Benevento, Bergamo, Biella, Bologna, Bolzano, Brescia, Brindisi, Cagliari, Caltanissetta, Campobasso, Caserta, Catania, Catanzaro, Chieti, Como, Cosenza, Cremona, Crotone, Cuneo, Enna, Fermo, Ferrara, Firenze, Foggia, Forlì-Cesena, Friuli Venezia Giulia, Frosinone, Genova, Grosseto, Imperia, Isernia, L'Aquila, La Spezia, Latina, Lecce, Lecco, Livorno, Lodi, Lucca, Macerata, Mantova, Massa-Carrara, Matera, Messina, Milano, Modena, Monza e Brianza, Napoli, Novara, Nuoro, Oristano, Other, Padova, Palermo

Audio Details

Words
856187
Recording environment
silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz

Chart details