add example (#3286)
@ -18,6 +18,8 @@
|
|||||||
- [Stage2 - Training reward model](#stage2---training-reward-model)
|
- [Stage2 - Training reward model](#stage2---training-reward-model)
|
||||||
- [Stage3 - Training model with reinforcement learning by human feedback](#stage3---training-model-with-reinforcement-learning-by-human-feedback)
|
- [Stage3 - Training model with reinforcement learning by human feedback](#stage3---training-model-with-reinforcement-learning-by-human-feedback)
|
||||||
- [Coati7B examples](#coati7b-examples)
|
- [Coati7B examples](#coati7b-examples)
|
||||||
|
- [Generation](#generation)
|
||||||
|
- [Open QA](#open-qa)
|
||||||
- [FAQ](#faq)
|
- [FAQ](#faq)
|
||||||
- [How to save/load checkpoint](#how-to-saveload-checkpoint)
|
- [How to save/load checkpoint](#how-to-saveload-checkpoint)
|
||||||
- [The Plan](#the-plan)
|
- [The Plan](#the-plan)
|
||||||
@ -77,6 +79,7 @@ pip install .
|
|||||||
### Supervised datasets collection
|
### Supervised datasets collection
|
||||||
|
|
||||||
we colllected 104K bilingual dataset of Chinese and English, and you can find the datasets in this repo
|
we colllected 104K bilingual dataset of Chinese and English, and you can find the datasets in this repo
|
||||||
|
[InstructionWild](https://github.com/XueFuzhao/InstructionWild)
|
||||||
|
|
||||||
Here is how we collected the data
|
Here is how we collected the data
|
||||||
<p align="center">
|
<p align="center">
|
||||||
@ -143,6 +146,73 @@ We also support training reward model with true-world data. See `examples/train_
|
|||||||
|
|
||||||
## Coati7B examples
|
## Coati7B examples
|
||||||
|
|
||||||
|
### Generation
|
||||||
|
|
||||||
|
<details><summary><b>E-mail</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
</details>
|
||||||
|
|
||||||
|
<details><summary><b>coding</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
|
<details><summary><b>regex</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
|
<details><summary><b>Tex</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
|
<details><summary><b>writing</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
|
<details><summary><b>Table</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
|
### Open QA
|
||||||
|
<details><summary><b>Game</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
|
<details><summary><b>Travel</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
|
<details><summary><b>Physical</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
|
<details><summary><b>Chemical</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
|
<details><summary><b>Economy</b></summary>
|
||||||
|
|
||||||
|

|
||||||
|
|
||||||
|
</details>
|
||||||
|
|
||||||
## FAQ
|
## FAQ
|
||||||
|
|
||||||
|
BIN
applications/Chat/assets/Phd.png
Normal file
After Width: | Height: | Size: 273 KiB |
BIN
applications/Chat/assets/chemical.png
Normal file
After Width: | Height: | Size: 307 KiB |
BIN
applications/Chat/assets/economy.png
Normal file
After Width: | Height: | Size: 390 KiB |
BIN
applications/Chat/assets/game.png
Normal file
After Width: | Height: | Size: 403 KiB |
BIN
applications/Chat/assets/physical.png
Normal file
After Width: | Height: | Size: 171 KiB |
BIN
applications/Chat/assets/quick_sort.png
Normal file
After Width: | Height: | Size: 173 KiB |
BIN
applications/Chat/assets/regex.png
Normal file
After Width: | Height: | Size: 40 KiB |
BIN
applications/Chat/assets/table.png
Normal file
After Width: | Height: | Size: 116 KiB |
BIN
applications/Chat/assets/tex.png
Normal file
After Width: | Height: | Size: 284 KiB |
BIN
applications/Chat/assets/travel.png
Normal file
After Width: | Height: | Size: 230 KiB |
BIN
applications/Chat/assets/writing.png
Normal file
After Width: | Height: | Size: 229 KiB |