hostfile
|
add SimPO
|
2024-06-24 02:12:20 +00:00 |
lora_config.json
|
[Chat] Fix lora (#5946)
|
2024-07-31 14:10:17 +08:00 |
lora_finetune.py
|
[hotfix] fix lora load (#6231)
|
2025-03-01 19:04:14 +08:00 |
train_dpo.py
|
[Coati] Train DPO using PP (#6054)
|
2024-10-11 19:32:00 +08:00 |
train_dpo.sh
|
refactor evaluation
|
2024-07-22 05:57:39 +00:00 |
train_grpo.py
|
Add GRPO and Support RLVR for PPO (#6186)
|
2025-02-18 09:43:36 +08:00 |
train_grpo.sh
|
Add GRPO and Support RLVR for PPO (#6186)
|
2025-02-18 09:43:36 +08:00 |
train_kto.py
|
[ColossalChat] Add PP support (#6001)
|
2024-08-21 10:47:39 +08:00 |
train_kto.sh
|
refactor evaluation
|
2024-07-22 05:57:39 +00:00 |
train_orpo.py
|
[ColossalChat] Add PP support (#6001)
|
2024-08-21 10:47:39 +08:00 |
train_orpo.sh
|
refactor evaluation
|
2024-07-22 05:57:39 +00:00 |
train_ppo.py
|
Add GRPO and Support RLVR for PPO (#6186)
|
2025-02-18 09:43:36 +08:00 |
train_rm.py
|
[ColossalChat] Add PP support (#6001)
|
2024-08-21 10:47:39 +08:00 |
train_rm.sh
|
refactor evaluation
|
2024-07-22 05:57:39 +00:00 |
train_sft.py
|
[Coati] Train DPO using PP (#6054)
|
2024-10-11 19:32:00 +08:00 |
train_sft.sh
|
[Chat] Fix lora (#5946)
|
2024-07-31 14:10:17 +08:00 |