1
0
mirror of https://github.com/hpcaitech/ColossalAI.git synced 2025-05-08 08:28:11 +00:00
ColossalAI/applications/ColossalChat/coati
2025-04-26 14:00:28 +08:00
..
dataset add prompt template () 2025-04-22 10:39:47 +08:00
distributed fix checkpoint naming; add num_epoch parameter () 2025-04-26 14:00:28 +08:00
experience_buffer Add GRPO and Support RLVR for PPO () 2025-02-18 09:43:36 +08:00
experience_maker Add GRPO and Support RLVR for PPO () 2025-02-18 09:43:36 +08:00
models Add GRPO and Support RLVR for PPO () 2025-02-18 09:43:36 +08:00
quant [ColossalChat] Update RLHF V2 () 2024-03-29 14:12:29 +08:00
ray [ColossalChat] Update RLHF V2 () 2024-03-29 14:12:29 +08:00
trainer [feat] Support DAPO () 2025-04-25 17:39:17 +08:00
utils Add GRPO and Support RLVR for PPO () 2025-02-18 09:43:36 +08:00
__init__.py [ColossalChat] Update RLHF V2 () 2024-03-29 14:12:29 +08:00