236 Commits

Author SHA1 Message Date
YuliangLiu0306
fd6add575d [examples] polish AutoParallel readme (#3270) 2023-03-28 10:40:07 +08:00
Frank Lee
73d3e4d309 [booster] implemented the torch ddd + resnet example (#3232)
* [booster] implemented the torch ddd + resnet example

* polish code
2023-03-27 10:24:14 +08:00
NatalieC323
280fcdc485 polish code (#3194)
Co-authored-by: YuliangLiu0306 <72588413+YuliangLiu0306@users.noreply.github.com>
2023-03-24 18:44:43 +08:00
Yan Fang
189347963a [auto] fix requirements typo for issue #3125 (#3209) 2023-03-23 10:22:08 +08:00
NatalieC323
e5f668f280 [dreambooth] fixing the incompatibity in requirements.txt (#3190)
* Update requirements.txt

* Update environment.yaml

* Update README.md

* Update environment.yaml

* Update README.md

* Update README.md

* Delete requirements_colossalai.txt

* Update requirements.txt

* Update README.md
2023-03-21 16:01:13 +08:00
Zihao
18dbe76cae [auto-parallel] add auto-offload feature (#3154)
* add auto-offload feature

* polish code

* fix syn offload runtime pass bug

* add offload example

* fix offload testing bug

* fix example testing bug
2023-03-21 14:17:41 +08:00
NatalieC323
4e921cfbd6 [examples] Solving the diffusion issue of incompatibility issue#3169 (#3170)
* Update requirements.txt

* Update environment.yaml

* Update README.md

* Update environment.yaml
2023-03-20 14:19:05 +08:00
binmakeswell
3c01280a56 [doc] add community contribution guide (#3153)
* [doc] update contribution guide

* [doc] update contribution guide

* [doc] add community contribution guide
2023-03-17 11:07:24 +08:00
github-actions[bot]
0aa92c0409 Automated submodule synchronization (#3105)
Co-authored-by: github-actions <github-actions@github.com>
2023-03-13 08:58:06 +08:00
binmakeswell
018936a3f3 [tutorial] update notes for TransformerEngine (#3098) 2023-03-10 16:30:52 +08:00
Kirthi Shankar Sivamani
65a4dbda6c [NVIDIA] Add FP8 example using TE (#3080)
Signed-off-by: Kirthi Shankar Sivamani <ksivamani@nvidia.com>
2023-03-10 16:24:08 +08:00
Fazzie-Maqianli
5d5f475d75 [diffusers] fix ci and docker (#3085) 2023-03-10 10:35:15 +08:00
Camille Zhong
e58a3c804c Fix the version of lightning and colossalai in Stable Diffusion environment requirement (#3073)
1. Modify the README of stable diffusion
2. Fix the version of pytorch lightning&lightning and colossalai version to enable codes running successfully.
2023-03-10 09:55:58 +08:00
binmakeswell
360674283d [example] fix redundant note (#3065) 2023-03-09 10:59:28 +08:00
Tomek
af3888481d [example] fixed opt model downloading from huggingface 2023-03-09 10:47:41 +08:00
ramos
2ef855c798 support shardinit option to avoid OPT OOM initializing problem (#3037)
Co-authored-by: poe <poe@nemoramo>
2023-03-08 13:45:15 +08:00
Ziyue Jiang
400f63012e [pipeline] Add Simplified Alpa DP Partition (#2507)
* add alpa dp split

* add alpa dp split

* use fwd+bwd instead of fwd only

---------

Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>
2023-03-07 10:34:31 +08:00
binmakeswell
52a5078988 [doc] add ISC tutorial (#2997)
* [doc] add ISC tutorial

* [doc] add ISC tutorial

* [doc] add ISC tutorial

* [doc] add ISC tutorial
2023-03-06 10:36:38 +08:00
github-actions[bot]
827a0af8cc Automated submodule synchronization (#2982)
Co-authored-by: github-actions <github-actions@github.com>
2023-03-03 10:55:45 +08:00
github-actions[bot]
da056285f2 [format] applied code formatting on changed files in pull request 2922 (#2923)
Co-authored-by: github-actions <github-actions@github.com>
2023-02-27 19:29:06 +08:00
binmakeswell
12bafe057f [doc] update installation for GPT (#2922) 2023-02-27 18:28:34 +08:00
binmakeswell
0afb55fc5b [doc] add os scope, update tutorial install and tips (#2914) 2023-02-27 14:59:27 +08:00
Alex_996
a4fc125c34 Fix typos (#2863)
Fix typos, `6.7 -> 6.7b`
2023-02-22 10:59:48 +08:00
dawei-wang
55424a16a5 [doc] fix GPT tutorial (#2860)
Fix hpcaitech/ColossalAI#2851
2023-02-22 10:58:52 +08:00
Zheng Zeng
597914317b [doc] fix typo in opt inference tutorial (#2849) 2023-02-21 17:16:13 +08:00
github-actions[bot]
a5721229d9 Automated submodule synchronization (#2740)
Co-authored-by: github-actions <github-actions@github.com>
2023-02-20 17:35:46 +08:00
Haofan Wang
47ecb22387 [example] add LoRA support (#2821)
* add lora

* format
2023-02-20 16:23:12 +08:00
Jiarui Fang
bf0204604f [exmaple] add bert and albert (#2824) 2023-02-20 10:35:55 +08:00
Fazzie-Maqianli
ba84cd80b2 fix pip install colossal (#2764) 2023-02-17 09:54:21 +08:00
cloudhuang
43dffdaba5 [doc] fixed a typo in GPT readme (#2736) 2023-02-15 22:24:45 +08:00
Fazzie-Maqianli
d03f4429c1 add ci (#2641) 2023-02-15 09:55:53 +08:00
github-actions[bot]
d701ef81b1 Automated submodule synchronization (#2707)
Co-authored-by: github-actions <github-actions@github.com>
2023-02-15 09:39:44 +08:00
github-actions[bot]
88416019e7 Automated submodule synchronization (#2648)
Co-authored-by: github-actions <github-actions@github.com>
2023-02-13 18:10:54 +08:00
binmakeswell
9ab14b20b5 [doc] add CVPR tutorial (#2666) 2023-02-10 20:43:34 +08:00
Jiatong (Julius) Han
a255a38f7f [example] Polish README.md (#2658)
* [tutorial] polish readme.md

* [example] Update README.md
2023-02-09 20:43:55 +08:00
Fazzie-Maqianli
292c81ed7c fix/transformer-verison (#2581) 2023-02-08 13:50:27 +08:00
Frank Lee
4ae02c4b1c [tutorial] added energonai to opt inference requirements (#2625) 2023-02-07 16:58:06 +08:00
binmakeswell
0556f5d468 [tutorial] add video link (#2619) 2023-02-07 15:14:51 +08:00
github-actions[bot]
ae86be1fd2 Automated submodule synchronization (#2607)
Co-authored-by: github-actions <github-actions@github.com>
2023-02-07 09:33:27 +08:00
binmakeswell
039b0c487b [tutorial] polish README (#2568) 2023-02-04 17:49:52 +08:00
oahzxl
4f5ef73a43 [tutorial] update fastfold tutorial (#2565)
* update readme

* update

* update
2023-02-03 16:54:28 +08:00
Fazzie-Maqianli
79079a9d0c Merge pull request #2561 from Fazziekey/v2
bug/fix diffusion ckpt problem
2023-02-03 15:42:49 +08:00
Fazzie
cad1f50512 fix ckpt 2023-02-03 15:39:59 +08:00
YuliangLiu0306
f477a14f4a [hotfix] fix autoparallel demo (#2533) 2023-01-31 17:42:45 +08:00
HELSON
6e0faa70e0 [gemini] add profiler in the demo (#2534) 2023-01-31 14:21:22 +08:00
Fazzie
f35326881c fix README 2023-01-31 10:51:13 +08:00
HELSON
66dfcf5281 [gemini] update the gpt example (#2527) 2023-01-30 17:58:05 +08:00
LuGY
ecbad93b65 [example] Add fastfold tutorial (#2528)
* add fastfold example

* pre-commit polish

* pre-commit polish readme and add empty test ci

* Add test_ci and reduce the default sequence length
2023-01-30 17:08:18 +08:00
Jiarui Fang
fd8d19a6e7 [example] update lightning dependency for stable diffusion (#2522) 2023-01-29 13:52:15 +08:00
HELSON
707b11d4a0 [gemini] update ddp strict mode (#2518)
* [zero] add strict ddp mode for chunk init

* [gemini] update gpt example
2023-01-28 14:35:25 +08:00