Skip to content

Commit 626d875

Browse files
author
KevNetG
authored
Added usage description
1 parent 3deb871 commit 626d875

File tree

1 file changed

+21
-1
lines changed

1 file changed

+21
-1
lines changed

README.md

+21-1
Original file line numberDiff line numberDiff line change
@@ -1 +1,21 @@
1-
# speech-to-text-voxforge
1+
# speech-to-text-voxforge
2+
3+
## Download the speech corpus
4+
In order to download the speech corpus run
5+
6+
```shell
7+
python downloader.py "voxforge-corpus"
8+
```
9+
10+
You can additionally specify the amount of speaker directories to be downloaded using `-n` or the amount of threads to be used for the download using `-w`:
11+
12+
```shell
13+
python downloader.py "voxforge-corpus" -n 20000 -w 15
14+
```
15+
16+
## Generate training data
17+
If you want to generate a training data file for the [speech recognition tool](https://github.com/KevNetG/speech-to-text), run `generator.py` providing the path to the directory where the voxforge corpus was being downloaded and a path to the new file where the training data should be stored. The data will be stored as JSON.
18+
19+
```shell
20+
python generator.py "voxforge-corpus" "training_data.json"
21+
```

0 commit comments

Comments
 (0)