Releases: lucidrains/st-moe-pytorch
Releases · lucidrains/st-moe-pytorch
0.1.8
Full Changelog: 0.1.7...0.1.8
0.1.7
Full Changelog: 0.1.6...0.1.7
0.1.6
Full Changelog: 0.1.5...0.1.6
0.1.5
make sure contiguous
0.1.4
router z loss should be calculated on the unnoised gating logits
0.1.2
allow for noising of gates
0.1.1
researcher will want to log the unweighted auxiliary losses
0.1.0
rename loss_coef to balance_loss_coef, sum the balance and router z-l…
0.0.30
handle variable sequence lengths if `allow_var_seq_len = True` on `Ex…
0.0.29
any combinatino of number of experts and world size should not break