Changelog
After around a year since the last update a lot of changes have been made to the model and scripts
Mainly there was a work to improve the audio+text dataset importer and bump to DeepSpeech 0.9.3.
It isn't a stable release as we don't have time now to do a proper release and also because there it will be soon the new CV dataset and now italian will have more than 300 hours compared to the version used to generate this.
For instructions how it was generated, parameters and other stuff check https://github.com/MozillaItalia/DeepSpeech-Italian-Model/wiki/Training-Notes-DeepSpeech-0.9.3-(2021.07.22-pre-release)
Trainer
- CommonVoice 6.1 (Cleaned) : 126h
- MITADS-Speech (Cleaned): 349h
Total 475h
Available in 2 version transfer used transfer learning form the official English model release by mozilla and other one is from scratch .
Thanks
This release was not possible without @eziolotta that did... everything!
Me (@Mte90) worked on the project management side about the model and with the help for the server offered by the Turin university we were able to do everything.
License
CC0 as public domain.