Files
ColossalAI/applications/ColossalChat/rl_example.py
sglucas 083766d54c Add new implementations of RL algorithms (#6383)
* add new algorithm

* move common calculations

* delete data

* move common calculations of rewards

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-09-03 13:48:06 +08:00

19 KiB