Italian - Italy

Italian Speech Data - Scripted Monologue - 257h

Telecommunication
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Telecommunication
Use case(s)
mobile speech
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Total recordings
98747
Hours
257
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
1.03%
Total prompts
98747
Unique prompts
25514
Average amount of recordings per speaker
50.64
License Type Link
Published date
Sep 1, 2021
File size
27.76GB
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files

Demographic

Number of speakers
1950.000000
Locale The language(s) and country(s) applicable to the speakers in the dataset.
it-it
Language
Italian
Country
Italy
Female | Male | Unspecified View on chart
59% | 41% | 0%
18-87
Accent(s) View on chart
Agrigento, Alessandria, Ancona, Arezzo, Ascoli Piceno, Asti, Avellino, Bari, Barletta-Andria-Trani, Belluno, Benevento, Bergamo, Biella, Bologna, Bolzano, Brescia, Brindisi, Cagliari, Caltanissetta, Campobasso, Caserta, Catania, Catanzaro, Chieti, Como, Cosenza, Cremona, Crotone, Cuneo, Enna, Fermo, Ferrara, Firenze, Foggia, Forlì-Cesena, Friuli Venezia Giulia, Frosinone, Genova, Grosseto, Imperia, Isernia, L'Aquila, La Spezia, Latina, Lecce, Lecco, Livorno, Lodi, Lucca, Macerata, Mantova, Massa-Carrara, Matera, Messina, Milano, Modena, Monza e Brianza, Napoli, Novara, Nuoro, Oristano, Other, Padova, Palermo

Audio Details

Words
2094897
Recording environment
silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz

Chart details