Skip to content
Open
Changes from 1 commit
Commits
Show all changes
145 commits
Select commit Hold shift + click to select a range
28f6545
WIP v0 MLS English recipe
kinanmartin Apr 9, 2025
ac0c0ed
update prepare.sh, fix asr_datamodule.py
kinanmartin Apr 11, 2025
a1fc642
change default path
kinanmartin Apr 11, 2025
defc71b
replace file
kinanmartin Apr 13, 2025
efe015d
cleaned-up version of recipe
kinanmartin Apr 15, 2025
8c1c710
symlink copied files to librispeech recipe dir
kinanmartin Apr 15, 2025
8985259
separate transcript prep stage from bpe train stage
kinanmartin Apr 15, 2025
a34d34a
pre-commit hooks
kinanmartin Apr 15, 2025
ce44150
readme
kinanmartin Apr 15, 2025
68e3cea
instead of on-the-fly features, precompute fbank and manifests in pre…
kinanmartin Apr 23, 2025
d6e3c98
move compute_fbank_mls_english.py, add validate_manifest.py, add shar…
kinanmartin Apr 24, 2025
4ca8ee9
adjusted prepare.sh to only calculate fbank and manifest together; ad…
kinanmartin Apr 30, 2025
59519a4
fix validation manifest name
kinanmartin Apr 30, 2025
f2e0171
fix stage 2 and 3
kinanmartin Apr 30, 2025
fa84782
optimize with num_jobs on save_audios
kinanmartin May 1, 2025
abebb6a
new version of multi_ja_en prepare.sh script which swaps Librispeech …
kinanmartin May 9, 2025
c83b115
add fbank
baileyeet May 1, 2025
61e81bf
Revert "add fbank"
baileyeet May 2, 2025
3751441
deprecate params.bilingual=0, replace ReazonSpeechAsrDataModule for M…
kinanmartin May 13, 2025
6d71d9c
remove bilingual tag from train.py
baileyeet May 13, 2025
5417e09
restore version of mls_english compute_fbank_mls_english.py and prepa…
kinanmartin May 14, 2025
782e1fb
fix stage 5 output pathing
kinanmartin May 15, 2025
f4b2987
switch mls_english clone from https to ssh
kinanmartin May 21, 2025
a8ecb16
use huggingface_hub library to download mls_english
kinanmartin May 22, 2025
3307836
Combined updates. Changed BBPE path structure, changed dataset path s…
kinanmartin Jun 4, 2025
2f1c611
fix decode script data module usage
kinanmartin Jun 6, 2025
eafbd64
add utility file for updating the storage_path of cutsets for use in …
kinanmartin Jun 6, 2025
b167ac7
add utility file for creating subsets of mls english. must be fixed t…
kinanmartin Jun 6, 2025
ad1be22
Parametrize dev and test split sizes.
kinanmartin Jun 10, 2025
78ee595
Add failsafe for MLS English dev set key alternate name as validation
kinanmartin Jun 11, 2025
fd3fbe6
Update README.md to reflect MLS English dataset
kinanmartin Jun 11, 2025
c77a847
add step 4: display manifest stats to mls_eng
baileyeet Jun 11, 2025
cdf246c
update manifest dir path
baileyeet Jun 11, 2025
f3e59df
add stage 6 - update cutset paths to prepare
baileyeet Jun 11, 2025
ddc2daa
remove commented out codels
baileyeet Jun 12, 2025
f6ad423
changes to train script - no need for limiting utterance length here
baileyeet Jun 12, 2025
19b62c0
remove unused local scripts
baileyeet Jun 12, 2025
5f2f684
make prepare.sh symlinks relative
kinanmartin Jul 8, 2025
70a7940
changes to asr_datamodule for musan support
baileyeet Jul 1, 2025
df923f3
typos
baileyeet Jul 1, 2025
5ec9389
commenting
baileyeet Jul 1, 2025
de35cc2
remove comment
baileyeet Jul 4, 2025
f51621b
resolve typos and import issues
baileyeet Jul 9, 2025
4e92879
update musan path
baileyeet Jul 10, 2025
093a035
update musan paths
baileyeet Jul 10, 2025
0f700ed
update musan symlinks
baileyeet Jul 11, 2025
d5cc030
attempt to fix musan paths
baileyeet Jul 14, 2025
aee7b87
working changes for musan mixing
baileyeet Jul 15, 2025
310aaec
Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
baileyeet Jul 16, 2025
542620c
Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
baileyeet Jul 16, 2025
f7fec4a
Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
baileyeet Jul 16, 2025
154ef43
Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
baileyeet Jul 16, 2025
6012edb
black and isort formatting
baileyeet Jul 16, 2025
dc4db37
PR review suggestions implemented
baileyeet Jul 16, 2025
9d93d63
Update RESULTS.md
baileyeet Jul 18, 2025
aed139f
Musan implementation for ReazonSpeech (#1988)
baileyeet Jul 18, 2025
dbd8977
Manually fix merge conflict in multi_ja_en/ASR/zipformer/train.py
kinanmartin Jul 28, 2025
1c5d792
Validate generated manifest files. (#338)
csukuangfj May 2, 2022
c92c606
WIP v0 MLS English recipe
kinanmartin Apr 9, 2025
ba6d8e8
update prepare.sh, fix asr_datamodule.py
kinanmartin Apr 11, 2025
0ab0274
change default path
kinanmartin Apr 11, 2025
1b8a306
replace file
kinanmartin Apr 13, 2025
e76b749
cleaned-up version of recipe
kinanmartin Apr 15, 2025
313afea
symlink copied files to librispeech recipe dir
kinanmartin Apr 15, 2025
c532a50
separate transcript prep stage from bpe train stage
kinanmartin Apr 15, 2025
24db8c1
pre-commit hooks
kinanmartin Apr 15, 2025
996334f
readme
kinanmartin Apr 15, 2025
fe88d1d
instead of on-the-fly features, precompute fbank and manifests in pre…
kinanmartin Apr 23, 2025
a8f45bc
move compute_fbank_mls_english.py, add validate_manifest.py, add shar…
kinanmartin Apr 24, 2025
eb2168b
adjusted prepare.sh to only calculate fbank and manifest together; ad…
kinanmartin Apr 30, 2025
2504b23
fix validation manifest name
kinanmartin Apr 30, 2025
73dea24
fix stage 2 and 3
kinanmartin Apr 30, 2025
0e86ef8
optimize with num_jobs on save_audios
kinanmartin May 1, 2025
06e4291
new version of multi_ja_en prepare.sh script which swaps Librispeech …
kinanmartin May 9, 2025
7d462aa
add fbank
baileyeet May 1, 2025
31a37c7
Revert "add fbank"
baileyeet May 2, 2025
99db0e4
deprecate params.bilingual=0, replace ReazonSpeechAsrDataModule for M…
kinanmartin May 13, 2025
8b035a0
remove bilingual tag from train.py
baileyeet May 13, 2025
7bea23e
restore version of mls_english compute_fbank_mls_english.py and prepa…
kinanmartin May 14, 2025
2265e1a
fix stage 5 output pathing
kinanmartin May 15, 2025
5682978
switch mls_english clone from https to ssh
kinanmartin May 21, 2025
1093e78
use huggingface_hub library to download mls_english
kinanmartin May 22, 2025
1b1a317
Combined updates. Changed BBPE path structure, changed dataset path s…
kinanmartin Jun 4, 2025
68bff93
fix decode script data module usage
kinanmartin Jun 6, 2025
b25254f
add utility file for updating the storage_path of cutsets for use in …
kinanmartin Jun 6, 2025
d136086
add utility file for creating subsets of mls english. must be fixed t…
kinanmartin Jun 6, 2025
b6d43a4
Parametrize dev and test split sizes.
kinanmartin Jun 10, 2025
9c318da
Add failsafe for MLS English dev set key alternate name as validation
kinanmartin Jun 11, 2025
065ca31
Update README.md to reflect MLS English dataset
kinanmartin Jun 11, 2025
0a4ed5e
add step 4: display manifest stats to mls_eng
baileyeet Jun 11, 2025
1ddd3cd
update manifest dir path
baileyeet Jun 11, 2025
606789b
add stage 6 - update cutset paths to prepare
baileyeet Jun 11, 2025
76bae70
remove commented out codels
baileyeet Jun 12, 2025
ac94174
changes to train script - no need for limiting utterance length here
baileyeet Jun 12, 2025
9c91775
remove unused local scripts
baileyeet Jun 12, 2025
694ecb9
make prepare.sh symlinks relative
kinanmartin Jul 8, 2025
d7ee48e
Validate generated manifest files. (#338)
csukuangfj May 2, 2022
1996507
changes to asr_datamodule for musan support
baileyeet Jul 1, 2025
ed2c0a4
typos
baileyeet Jul 1, 2025
5fb4bdf
commenting
baileyeet Jul 1, 2025
1cf544b
remove comment
baileyeet Jul 4, 2025
c610c6d
resolve typos and import issues
baileyeet Jul 9, 2025
6272827
update musan path
baileyeet Jul 10, 2025
aeffb15
update musan paths
baileyeet Jul 10, 2025
4475815
update musan symlinks
baileyeet Jul 11, 2025
a310d8f
attempt to fix musan paths
baileyeet Jul 14, 2025
60f326b
working changes for musan mixing
baileyeet Jul 15, 2025
95f58e6
Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
baileyeet Jul 16, 2025
865b859
Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
baileyeet Jul 16, 2025
b19929c
Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
baileyeet Jul 16, 2025
2f1f419
Update egs/multi_ja_en/ASR/local/utils/update_cutset_paths.py
baileyeet Jul 16, 2025
7b4abba
black and isort formatting
baileyeet Jul 16, 2025
8dd2c0f
PR review suggestions implemented
baileyeet Jul 16, 2025
94cf8c3
support left pad for make_pad_mask (#1990)
yfyeung Jul 16, 2025
0ca7595
Update RESULTS.md
baileyeet Jul 18, 2025
11df2a8
Musan implementation for ReazonSpeech (#1988)
baileyeet Jul 18, 2025
2d8e3fd
Fix transformer decoder layer (#1995)
csukuangfj Jul 18, 2025
f15a783
Validate generated manifest files. (#338)
csukuangfj May 2, 2022
c23af2e
musan implementation for mls_english
baileyeet Aug 5, 2025
ed79fa3
revert unrelated transformer.py diffs from rebase
baileyeet Aug 5, 2025
636121c
remove bilingual tag from train.py
baileyeet May 13, 2025
0967f5f
Manually fix merge conflict in multi_ja_en/ASR/zipformer/train.py
kinanmartin Jul 28, 2025
f210002
Validate generated manifest files. (#338)
csukuangfj May 2, 2022
ee2a6d6
remove bilingual tag from train.py
baileyeet May 13, 2025
dee07de
Validate generated manifest files. (#338)
csukuangfj May 2, 2022
f9ceead
Validate generated manifest files. (#338)
csukuangfj May 2, 2022
4e05d70
fix stash commit
baileyeet Aug 6, 2025
130c2a5
Merge branch 'multi_ja_en_mls_english_clean' into musan-mls-clean-final
baileyeet Aug 6, 2025
5400f43
training and decoding compatibility changes
baileyeet Aug 11, 2025
8c08c9c
Create RESULTS.md
baileyeet Aug 14, 2025
8e18616
Update RESULTS.md
baileyeet Aug 14, 2025
556a3f0
Update README.md
baileyeet Aug 14, 2025
36fc1f1
Merge pull request #4 from reazon-research/musan-mls-clean-final
kinanmartin Aug 22, 2025
7231cf4
Remove changes to files outside of relevant recipes
kinanmartin Aug 29, 2025
a4c1db5
reformat
baileyeet Sep 2, 2025
2859c22
Update RESULTS.md
baileyeet Sep 2, 2025
9a940c3
Update RESULTS.md
baileyeet Sep 2, 2025
f64a706
Update egs/multi_ja_en/ASR/RESULTS.md
kinanmartin Sep 2, 2025
ef7664e
Update egs/mls_english/ASR/local/utils/asr_datamodule.py
kinanmartin Sep 2, 2025
bc2560c
Update training commands and decode.py accuracy values, add streaming…
kinanmartin Sep 3, 2025
ecbe985
Update streaming train and export commands
kinanmartin Sep 4, 2025
a30e80c
Remove accidentally added submodule musan-k2-v2-reazonspeech-medium
baileyeet Sep 11, 2025
9d389cd
Update egs/reazonspeech/ASR/local/compute_fbank_musan.py
baileyeet Sep 11, 2025
8c84639
Update egs/mls_english/ASR/zipformer/streaming_decode.py
baileyeet Sep 11, 2025
d74e232
Merge branch 'master' into multi_ja_en_mls_english_clean
baileyeet Sep 11, 2025
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
49 changes: 39 additions & 10 deletions egs/multi_ja_en/ASR/RESULTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -11,6 +11,7 @@ The training command is:
```shell
./zipformer/train.py \
--world-size 8 \
--causal 1 \
--num-epochs 10 \
--start-epoch 1 \
--use-fp16 1 \
Expand Down Expand Up @@ -82,6 +83,7 @@ The training command is:
```shell
./zipformer/train.py \
--world-size 8 \
--causal 1 \
--num-epochs 10 \
--start-epoch 1 \
--use-fp16 1 \
Expand All @@ -93,24 +95,51 @@ The training command is:
The decoding command is:

```shell
./zipformer/decode.py \
--epoch 10 \
--avg 1 \
--exp-dir ./zipformer/exp \
--decoding-method modified_beam_search \
--manifest-dir data/manifests
TODO
```

To export the model with onnx:
To export the model with sherpa onnx:

```shell
./zipformer/export-onnx.py \
./zipformer/export-onnx-streaming.py \
--tokens ./data/lang/bbpe_2000/tokens.txt \
--use-averaged-model 0 \
--epoch 10 \
--avg 1 \
--decode-chunk-len 32 \
--exp-dir ./zipformer/exp
--exp-dir ./zipformer/exp-15k15k-streaming \
--num-encoder-layers "2,2,3,4,3,2" \
--downsampling-factor "1,2,4,8,4,2" \
--feedforward-dim "512,768,1024,1536,1024,768" \
--num-heads "4,4,4,8,4,4" \
--encoder-dim "192,256,384,512,384,256" \
--query-head-dim 32 \
--value-head-dim 12 \
--pos-head-dim 4 \
--pos-dim 48 \
--encoder-unmasked-dim "192,192,256,256,256,192" \
--cnn-module-kernel "31,31,15,15,15,31" \
--decoder-dim 512 \
--joiner-dim 512 \
--causal True \
--chunk-size 16 \
--left-context-frames 128 \
--fp16 True
```

(Adjust the `chunk-size` and `left-context-frames` as necessary)

To export the model as Torchscript (`.jit`):

```shell
./zipformer/export.py \
--exp-dir ./zipformer/exp-15k15k-streaming \
--causal 1 \
--chunk-size 16 \
--left-context-frames 128 \
--tokens data/lang/bbpe_2000/tokens.txt \
--epoch 10 \

Copy link
Copy Markdown
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since your dataset is large, can you also try --iter instead of --epoch?

--avg 1 \
--jit 1
```

You may also use decode chunk sizes `16`, `32`, `64`, `128`.
Expand Down
Loading