Skip to content

Releases: lucidrains/st-moe-pytorch

0.0.6

20 Aug 15:04
Compare
Choose a tag to compare
remove dropout, as in the paper, they show it is unhelpful (and also …

0.0.5

20 Aug 15:00
Compare
Choose a tag to compare
when doing eval, turn off balance and router z loss calculations

0.0.3

20 Aug 14:24
Compare
Choose a tag to compare
init expert weights and biases

0.0.2

19 Aug 18:16
Compare
Choose a tag to compare
first pass for router z loss

0.0.1

19 Aug 17:29
Compare
Choose a tag to compare
start cleaning up, add the ff geglu based experts with multiplicative…