5 Commits

Author SHA1 Message Date
HELSON
dbdc9a7783 added Multiply Jitter and capacity factor eval for MOE (#434) 2022-03-16 16:47:44 +08:00
1SAA
82023779bb Added TPExpert for special situation 2022-03-11 15:50:28 +08:00
1SAA
219df6e685 Optimized MoE layer and fixed some bugs;
Decreased moe tests;

Added FFNExperts and ViTMoE model
2022-03-11 15:50:28 +08:00
HELSON
1ff5be36c2 Added moe parallel example (#140) 2022-01-17 15:34:04 +08:00
HELSON
dceae85195 Added MoE parallel (#127) 2022-01-07 15:08:36 +08:00