ColossalAI/applications/ColossalChat/coati/distributed/reward
2025-06-05 18:05:22 +08:00
..
code_reward support code generation tasks 2025-06-05 18:05:22 +08:00
reward_fn.py support code generation tasks 2025-06-05 18:05:22 +08:00
reward_utils.py [feat] Support boxed math reward (#6284) 2025-04-29 16:46:47 +08:00
verifiable_reward.py support code generation tasks 2025-06-05 18:05:22 +08:00