ColossalAI/applications/ColossalChat/coati/distributed/reward
YeAnbang 14f237ce7e
[feat] Support boxed math reward (#6284)
* fix pp+tp, fix dataloader

* fixed plugin micro-batch size

* support boxed reward

* add boxed reward

* fix pp state dict incomplete issue

* Revert "fix pp state dict incomplete issue"

This reverts commit 6c1b3b694f.
2025-04-29 16:46:47 +08:00
..
reward_fn.py [feat] Support boxed math reward (#6284) 2025-04-29 16:46:47 +08:00
reward_utils.py [feat] Support boxed math reward (#6284) 2025-04-29 16:46:47 +08:00
verifiable_reward.py update reward 2025-03-10 14:19:10 +08:00