ColossalAI/tests/test_shardformer/test_model
2025-05-23 11:16:36 +08:00
..
__init__.py
_utils.py [shardformer] support ep for deepseek v3 (#6185) 2025-02-11 16:10:25 +08:00
test_shard_bert.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_blip2.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_bloom.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_chatglm2.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
test_shard_command.py fix 2025-05-16 11:39:50 +08:00
test_shard_deepseek_v3.py [release] update version (#6195) 2025-02-20 11:36:18 +08:00
test_shard_deepseek.py Update test_shard_deepseek.py 2025-05-23 11:16:36 +08:00
test_shard_falcon.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_gpt2.py [upgrade] upgrade gpt2 (#6291) 2025-05-08 14:10:21 +08:00
test_shard_gptj.py [release] update version (#5752) 2024-05-31 19:40:26 +08:00
test_shard_llama.py [Zerobubble] merge main. (#6142) 2024-11-19 19:00:36 +08:00
test_shard_mistral.py [upgrade]upgrade mistral (#6296) 2025-05-21 16:14:45 +08:00
test_shard_mixtral.py [hotfix] fix hybrid checkpointio for sp+dp (#6184) 2025-02-06 17:21:04 +08:00
test_shard_opt.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_shard_qwen2.py [Feature] Split cross-entropy computation in SP (#5959) 2024-09-10 12:06:50 +08:00
test_shard_sam.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00
test_shard_t5.py [misc] update torch version (#6206) 2025-02-24 14:35:48 +08:00
test_shard_vit.py [CI/tests] simplify some test case to reduce testing time (#5755) 2024-06-04 13:57:54 +08:00
test_shard_whisper.py [misc] refactor launch API and tensor constructor (#5666) 2024-04-29 10:40:11 +08:00