Italian - Italy

Italian Speech Data - Scripted Monologue - 214h

Telecommunication
Audio demo
Sender
Invitee

Dataset Details

About

Domain
Telecommunication
Use case(s)
mobile speech
Model Applications
Acoustic Modelling, ASR Testing, Benchmarking
Total recordings
93025
Hours
214
Word error rate (%) Measurement indicating errors in alignment of text representation (actual vs. perfect) of audio, taking into account words omitted, inserted or wrongly replaced.
0.38%
Total prompts
93025
Unique prompts
35283
Average amount of recordings per speaker
57.39
License Type Link
Published date
Sep 1, 2021
File size
23.09GB
Packaging description
A zip file containing metadata files in tsv format and a folder with all the audio files

Demographic

Number of speakers
1621.000000
Locale The language(s) and country(s) applicable to the speakers in the dataset.
it-it
Language
Italian
Country
Italy
Female | Male | Unspecified View on chart
50% | 50% | 0%
18-74
Accent(s) View on chart
Agrigento, Alessandria, Ancona, Arezzo, Ascoli Piceno, Asti, Avellino, Bari, Barletta-Andria-Trani, Benevento, Bergamo, Biella, Bologna, Bolzano, Brescia, Brindisi, Cagliari, Caltanissetta, Caserta, Catania, Catanzaro, Chieti, Como, Cosenza, Cremona, Cuneo, Enna, Fermo, Ferrara, Firenze, Foggia, Forlì-Cesena, Friuli Venezia Giulia, Frosinone, Genova, Grosseto, Imperia, Isernia, L'Aquila, La Spezia, Latina, Lecce, Lecco, Livorno, Lodi, Lucca, Macerata, Matera, Messina, Milano, Modena, Monza e Brianza, Napoli, Novara, Nuoro, Oristano, Padova, Palermo, Parma, Pavia, Perugia, Pesaro e Urbino, Pescara, Piacenza

Audio Details

Words
1652853
Recording environment
noisy, silent
Audio format
WAV
Bits per sample
16
Device type
mobile
Communication band
broadband
Sample rate
16kHz

Chart details