ColossalAI/tests/kit/model_zoo/transformers
2025-07-10 13:57:52 +08:00
..
__init__.py [feat] support qwen3 in shardformer 2025-07-10 13:57:52 +08:00
albert.py
bert.py [upgrade]Upgrade transformers (#6320) 2025-05-27 14:29:01 +08:00
blip2.py
bloom.py
chatglm2.py [test] fix chatglm test kit (#5793) 2024-06-11 16:54:31 +08:00
command.py [Feature] Zigzag Ring attention (#5905) 2024-08-16 13:56:38 +08:00
deepseek_v3.py [release] update version (#6195) 2025-02-20 11:36:18 +08:00
deepseek.py [misc] remove debug/print code 2024-08-01 10:06:59 +08:00
falcon.py
gpt.py [pre-commit.ci] auto fixes from pre-commit.com hooks 2025-05-27 08:48:12 +00:00
gptj.py
llama.py [Feature] Zigzag Ring attention (#5905) 2024-08-16 13:56:38 +08:00
mistral.py [Feature] Zigzag Ring attention (#5905) 2024-08-16 13:56:38 +08:00
mixtral.py [Feature] MoE Ulysses Support (#5918) 2024-08-01 10:06:59 +08:00
opt.py [upgrade]Upgrade transformers (#6320) 2025-05-27 14:29:01 +08:00
qwen2.py [Feature] Zigzag Ring attention (#5905) 2024-08-16 13:56:38 +08:00
qwen3.py [feat] support qwen3 in shardformer 2025-07-10 13:57:52 +08:00
sam.py
t5.py [shardformer] Support the T5ForTokenClassification model (#5816) 2024-06-27 16:40:38 +08:00
vit.py
whisper.py