ColossalAI/applications/ColossalChat/examples/training_scripts
2024-07-19 15:23:31 +08:00
..
hostfile add SimPO 2024-06-24 02:12:20 +00:00
train_dpo.py fix orpo cross entropy loss 2024-07-15 02:12:05 +00:00
train_dpo.sh [ColossalChat] Hotfix for ColossalChat (#5910) 2024-07-19 13:40:07 +08:00
train_kto.py add kto 2024-07-18 07:54:11 +00:00
train_kto.sh fix style, add kto data sample 2024-07-18 08:38:56 +00:00
train_orpo.py fix orpo cross entropy loss 2024-07-15 02:12:05 +00:00
train_orpo.sh [ColossalChat] Hotfix for ColossalChat (#5910) 2024-07-19 13:40:07 +08:00
train_ppo.py replace the customized dataloader setup with the build-in one 2024-06-07 09:43:42 +00:00
train_ppo.sh [ColossalChat] Hotfix for ColossalChat (#5910) 2024-07-19 13:40:07 +08:00
train_rm.py fix orpo cross entropy loss 2024-07-15 02:12:05 +00:00
train_rm.sh [ColossalChat] Hotfix for ColossalChat (#5910) 2024-07-19 13:40:07 +08:00
train_sft.py [ColossalChat] Hotfix for ColossalChat (#5910) 2024-07-19 13:40:07 +08:00
train_sft.sh Merge branch 'main' into kto 2024-07-19 15:23:31 +08:00