ColossalAI/applications/ColossalChat/examples/training_scripts
Hongxin Liu 56fe130b15
[hotfix] fix lora load (#6231)
* [hotfix] fix lora load

* [hotfix] fix hp load

* accelerate deepseek loading
2025-03-01 19:04:14 +08:00
..
hostfile add SimPO 2024-06-24 02:12:20 +00:00
lora_config.json [Chat] Fix lora (#5946) 2024-07-31 14:10:17 +08:00
lora_finetune.py [hotfix] fix lora load (#6231) 2025-03-01 19:04:14 +08:00
lora_sft_data.jsonl [application] add lora sft example data (#6198) 2025-02-18 20:18:18 +08:00
train_dpo.py [Coati] Train DPO using PP (#6054) 2024-10-11 19:32:00 +08:00
train_dpo.sh refactor evaluation 2024-07-22 05:57:39 +00:00
train_grpo.py Add GRPO and Support RLVR for PPO (#6186) 2025-02-18 09:43:36 +08:00
train_grpo.sh Add GRPO and Support RLVR for PPO (#6186) 2025-02-18 09:43:36 +08:00
train_kto.py [ColossalChat] Add PP support (#6001) 2024-08-21 10:47:39 +08:00
train_kto.sh refactor evaluation 2024-07-22 05:57:39 +00:00
train_orpo.py [ColossalChat] Add PP support (#6001) 2024-08-21 10:47:39 +08:00
train_orpo.sh refactor evaluation 2024-07-22 05:57:39 +00:00
train_ppo.py Add GRPO and Support RLVR for PPO (#6186) 2025-02-18 09:43:36 +08:00
train_ppo.sh [ColossalChat] Hotfix for ColossalChat (#5910) 2024-07-19 13:40:07 +08:00
train_rm.py [ColossalChat] Add PP support (#6001) 2024-08-21 10:47:39 +08:00
train_rm.sh refactor evaluation 2024-07-22 05:57:39 +00:00
train_sft.py [Coati] Train DPO using PP (#6054) 2024-10-11 19:32:00 +08:00
train_sft.sh [Chat] Fix lora (#5946) 2024-07-31 14:10:17 +08:00