ColossalAI/tests/kit/model_zoo/transformers
2025-07-10 13:57:52 +08:00
..
__init__.py [feat] support qwen3 in shardformer 2025-07-10 13:57:52 +08:00
albert.py [misc] update pre-commit and run all files (#4752) 2023-09-19 14:20:26 +08:00
bert.py [upgrade]Upgrade transformers (#6320) 2025-05-27 14:29:01 +08:00
blip2.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
bloom.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
chatglm2.py [test] fix chatglm test kit (#5793) 2024-06-11 16:54:31 +08:00
command.py [Feature] Zigzag Ring attention (#5905) 2024-08-16 13:56:38 +08:00
deepseek_v3.py [release] update version (#6195) 2025-02-20 11:36:18 +08:00
deepseek.py [misc] remove debug/print code 2024-08-01 10:06:59 +08:00
falcon.py [shardformer]: support gpt-j, falcon, Mistral and add interleaved pipeline for bert (#5088) 2023-11-28 16:54:42 +08:00
gpt.py [pre-commit.ci] auto fixes from pre-commit.com hooks 2025-05-27 08:48:12 +00:00
gptj.py [workflow] fixed oom tests (#5275) 2024-01-16 18:55:13 +08:00
llama.py [Feature] Zigzag Ring attention (#5905) 2024-08-16 13:56:38 +08:00
mistral.py [Feature] Zigzag Ring attention (#5905) 2024-08-16 13:56:38 +08:00
mixtral.py [Feature] MoE Ulysses Support (#5918) 2024-08-01 10:06:59 +08:00
opt.py [upgrade]Upgrade transformers (#6320) 2025-05-27 14:29:01 +08:00
qwen2.py [Feature] Zigzag Ring attention (#5905) 2024-08-16 13:56:38 +08:00
qwen3.py [feat] support qwen3 in shardformer 2025-07-10 13:57:52 +08:00
sam.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
t5.py [shardformer] Support the T5ForTokenClassification model (#5816) 2024-06-27 16:40:38 +08:00
vit.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00
whisper.py [test] merge old components to test to model zoo (#4945) 2023-10-20 10:35:08 +08:00