flybird1111
7a3dfd0c64
[shardformer] update shardformer to use flash attention 2 (#4392)
* cherry-pick flash attention 2
cherry-pick flash attention 2
* [shardformer] update shardformer to use flash attention 2
[shardformer] update shardformer to use flash attention 2, fix
[shardformer] update shardformer to use flash attention 2, fix
[shardformer] update shardformer to use flash attention 2, fix
2023-08-15 23:25:14 +08:00
..
2023-05-11 16:30:58 +08:00
2023-08-15 23:25:14 +08:00
2023-04-06 14:51:35 +08:00
2023-05-15 17:20:56 +08:00
2023-06-25 13:34:15 +08:00
2023-07-04 16:05:01 +08:00
2023-08-15 23:25:14 +08:00
2023-07-31 22:13:29 +08:00
2023-08-15 23:25:14 +08:00
2023-04-06 14:51:35 +08:00
2022-06-10 11:27:38 +08:00
2023-04-06 14:51:35 +08:00
2023-04-06 14:51:35 +08:00
2023-05-11 16:30:58 +08:00
2023-04-06 14:51:35 +08:00
2023-07-04 16:07:47 +08:00
2023-04-06 14:51:35 +08:00
2023-08-15 23:25:14 +08:00
2023-07-18 23:53:38 +08:00
2023-05-11 16:30:58 +08:00
2023-08-15 23:25:14 +08:00
2023-05-11 16:30:58 +08:00
2023-04-06 14:51:35 +08:00
2023-06-05 15:58:31 +08:00
2023-08-15 23:25:14 +08:00
2023-08-15 23:25:14 +08:00
2023-07-04 16:05:01 +08:00
2023-05-11 16:30:58 +08:00
2023-08-15 23:25:14 +08:00
2023-08-11 15:09:24 +08:00