This website requires JavaScript.
Explore
Help
Register
Sign In
github
/
ColossalAI
Watch
1
Star
0
Fork
0
You've already forked ColossalAI
mirror of
https://github.com/hpcaitech/ColossalAI.git
synced
2025-09-18 07:31:19 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
grpo-latest-dev-reward-update
ColossalAI
/
applications
/
ColossalChat
/
coati
History
YeAnbang
d06042b434
rewrite reward fn
2025-05-01 11:28:05 +08:00
..
dataset
add prompt template (
#6273
)
2025-04-22 10:39:47 +08:00
distributed
rewrite reward fn
2025-05-01 11:28:05 +08:00
experience_buffer
Add GRPO and Support RLVR for PPO (
#6186
)
2025-02-18 09:43:36 +08:00
experience_maker
Add GRPO and Support RLVR for PPO (
#6186
)
2025-02-18 09:43:36 +08:00
models
Add GRPO and Support RLVR for PPO (
#6186
)
2025-02-18 09:43:36 +08:00
quant
…
ray
…
trainer
[feat] Support DAPO (
#6263
)
2025-04-25 17:39:17 +08:00
utils
Add GRPO and Support RLVR for PPO (
#6186
)
2025-02-18 09:43:36 +08:00
__init__.py
…