ColossalAI/applications/ColossalChat/coati
2025-02-20 10:25:19 +00:00
..
dataset [application] add lora sft example (#6192) 2025-02-18 13:06:38 +08:00
experience_buffer Add GRPO and Support RLVR for PPO (#6186) 2025-02-18 09:43:36 +08:00
experience_maker fix inference rebatching bug 2025-02-20 17:28:49 +08:00
models Add GRPO and Support RLVR for PPO (#6186) 2025-02-18 09:43:36 +08:00
quant [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
ray [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
trainer [pre-commit.ci] auto fixes from pre-commit.com hooks 2025-02-20 10:25:19 +00:00
utils Add GRPO and Support RLVR for PPO (#6186) 2025-02-18 09:43:36 +08:00
__init__.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00