Wenhao Chen
|
b03d64d010
|
[chat] refactor trainer class (#4080)
* to: add SLTrainer
* refactor: refactor RMTrainer and SFTTrainer
* fix: fix init file
* feat: remove on_learn_epoch fn as not used
* fix: align with modified gemini arguments
* to: add OnPolicyTrainer
* revert: add _on_learn_epoch fn
* refactor: refactor PPOTrainer
* style: rename PPOTrainer argument
* fix: align with modified PPO arguments
* test: align with modified train_prompts arguments
* chore: modify train_prompts
* docs: align with modified arguments
* fix: remove unnecessary output
* fix: move dataloader to fit fn of SLTrainer
* fix: move dataloader to fit fn of OnPolicyTrainer
* fix: modify usage of prompt and pretrain dataloader
|
2023-06-29 10:48:09 +08:00 |
|
digger-yu
|
ad6460cf2c
|
[NFC] fix typo applications/ and colossalai/ (#3735)
|
2023-05-15 11:46:25 +08:00 |
|
digger-yu
|
65bdc3159f
|
fix some spelling error with applications/Chat/examples/ (#3692)
* fix spelling error with examples/comminity/
* fix spelling error with example/
|
2023-05-06 11:27:23 +08:00 |
|
tanitna
|
1a60dc07a8
|
[chat] typo accimulation_steps -> accumulation_steps (#3662)
|
2023-04-28 15:42:57 +08:00 |
|
digger-yu
|
d7bf284706
|
[chat] polish code note typo (#3612)
|
2023-04-20 17:22:15 +08:00 |
|
Fazzie-Maqianli
|
6afeb1202a
|
add community example dictionary (#3465)
|
2023-04-06 15:04:48 +08:00 |
|