Hongxin Liu
dc2cdaf3e8
[shardformer] optimize seq parallelism (#6086)
* [shardformer] optimize seq parallelism
* [shardformer] fix gpt2 fused linear col
* [plugin] update gemini plugin
* [plugin] update moe hybrid plugin
* [test] update gpt2 fused linear test
* [shardformer] fix gpt2 fused linear reduce
2024-10-11 13:44:40 +08:00
..
2024-09-10 12:06:50 +08:00
2023-09-19 14:20:26 +08:00
2024-06-03 15:26:01 +08:00
2023-09-19 14:20:26 +08:00
2024-08-22 09:21:34 +08:00
2024-08-22 09:21:34 +08:00
2024-04-29 10:40:11 +08:00
2023-09-19 14:20:26 +08:00
2024-04-29 10:40:11 +08:00
2024-09-14 10:40:01 +08:00
2024-06-05 11:29:32 +08:00
2024-09-10 10:31:09 +08:00
2024-08-22 09:21:34 +08:00
2024-08-06 16:29:37 +08:00
2024-08-22 09:21:34 +08:00
2024-09-10 17:30:53 +08:00
2024-06-28 14:00:08 +08:00
2024-08-22 09:21:34 +08:00
2024-10-11 13:44:40 +08:00
2023-10-16 11:28:44 +08:00
2024-04-29 10:40:11 +08:00
2024-08-06 16:29:37 +08:00
2022-03-11 15:50:28 +08:00