ColossalAI/applications/ColossalChat/coati/trainer
Tong Li f585d4e38e
[ColossalChat] Hotfix for ColossalChat (#5910)
* add ignore and tiny llama

* fix path issue

* run style

* fix issue

* update bash

* add ignore and tiny llama

* fix path issue

* run style

* fix issue

* update bash

* fix ddp issue

* add Qwen 1.5 32B
2024-07-19 13:40:07 +08:00
..
callbacks [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
__init__.py add orpo 2024-06-27 07:20:28 +00:00
base.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
dpo.py fix eval 2024-07-11 03:35:03 +00:00
orpo.py fix orpo cross entropy loss 2024-07-15 02:12:05 +00:00
ppo.py [ColossalChat] Update RLHF V2 (#5286) 2024-03-29 14:12:29 +08:00
rm.py fix eval 2024-07-11 03:35:03 +00:00
sft.py [ColossalChat] Hotfix for ColossalChat (#5910) 2024-07-19 13:40:07 +08:00
utils.py [pre-commit.ci] pre-commit autoupdate (#5572) 2024-07-01 17:16:41 +08:00