mirror of
https://github.com/hpcaitech/ColossalAI.git
synced 2025-07-30 06:51:15 +00:00
* [fix] fix qwen VocabParallelLMHead1D and gather output * fix tp bug * fix consumer * [feat] Support Distributed LogProb for GRPO Training * [fix] fix loss func * [fix] fix log prob plugin * [fix] fix qwen modeling param * [fix] rm comments * [fix] rm hard-code;fix non-dist version * [fix] fix test file param name and benchmark tp gather output=True/False * [fix] rm non-dist version in dist log prob * [fix] fix comments * [fix] fix dis log prob plugin * [fix] fix test case * [fix] fix qwen VocabParallelLMHead1D and gather output * [fix] fix DistLogProb comments * [fix] restore tp size * [fix] fix comments * [fix] fix comment; fix LogSoftmax usage --------- Co-authored-by: Tong Li <tong.li35271158@gmail.com> |
||
---|---|---|
.. | ||
test_hybrid_parallel_grad_clip_norm | ||
test_layer | ||
test_model | ||
__init__.py | ||
test_flash_attention.py | ||
test_shard_utils.py | ||
test_with_torch_ddp.py |