197 Commits

Author SHA1 Message Date
binmakeswell
039b0c487b [tutorial] polish README (#2568) 2023-02-04 17:49:52 +08:00
oahzxl
4f5ef73a43 [tutorial] update fastfold tutorial (#2565)
* update readme

* update

* update
2023-02-03 16:54:28 +08:00
Fazzie-Maqianli
79079a9d0c Merge pull request #2561 from Fazziekey/v2
bug/fix diffusion ckpt problem
2023-02-03 15:42:49 +08:00
Fazzie
cad1f50512 fix ckpt 2023-02-03 15:39:59 +08:00
YuliangLiu0306
f477a14f4a [hotfix] fix autoparallel demo (#2533) 2023-01-31 17:42:45 +08:00
HELSON
6e0faa70e0 [gemini] add profiler in the demo (#2534) 2023-01-31 14:21:22 +08:00
Fazzie
f35326881c fix README 2023-01-31 10:51:13 +08:00
HELSON
66dfcf5281 [gemini] update the gpt example (#2527) 2023-01-30 17:58:05 +08:00
LuGY
ecbad93b65 [example] Add fastfold tutorial (#2528)
* add fastfold example

* pre-commit polish

* pre-commit polish readme and add empty test ci

* Add test_ci and reduce the default sequence length
2023-01-30 17:08:18 +08:00
Jiarui Fang
fd8d19a6e7 [example] update lightning dependency for stable diffusion (#2522) 2023-01-29 13:52:15 +08:00
HELSON
707b11d4a0 [gemini] update ddp strict mode (#2518)
* [zero] add strict ddp mode for chunk init

* [gemini] update gpt example
2023-01-28 14:35:25 +08:00
HELSON
2d1a7dfe5f [zero] add strict ddp mode (#2508)
* [zero] add strict ddp mode

* [polish] add comments for strict ddp mode

* [zero] fix test error
2023-01-20 14:04:38 +08:00
jiaruifang
32390cbe8f add test_ci.sh to dreambooth 2023-01-19 09:46:28 +08:00
jiaruifang
025b482dc1 [example] dreambooth example 2023-01-18 18:42:56 +08:00
jiaruifang
e58cc441e2 polish code and fix dataloader bugs 2023-01-18 12:00:08 +08:00
jiaruifang
a4b75b78a0 [hotfix] gpt example titans bug #2493 2023-01-18 11:37:16 +08:00
binmakeswell
fcc6d61d92 [example] fix requirements (#2488) 2023-01-17 13:07:25 +08:00
Jiarui Fang
3a21485ead [example] titans for gpt (#2484) 2023-01-16 15:55:41 +08:00
Jiarui Fang
7c31706227 [CI] add test_ci.sh for palm, opt and gpt (#2475) 2023-01-16 14:44:29 +08:00
Jiarui Fang
e4c38ba367 [example] stable diffusion add roadmap (#2482) 2023-01-16 12:14:49 +08:00
ver217
f525d1f528 [example] update gpt gemini example ci test (#2477) 2023-01-13 22:37:31 +08:00
Ziyue Jiang
fef5c949c3 polish pp middleware (#2476)
Co-authored-by: Ziyue Jiang <ziyue.jiang@gmail.com>
2023-01-13 16:56:01 +08:00
Frank Lee
8b7495dd54 [example] integrate seq-parallel tutorial with CI (#2463) 2023-01-13 14:40:05 +08:00
ver217
8e85d2440a [example] update vit ci script (#2469)
* [example] update vit ci script

* [example] update requirements

* [example] update requirements
2023-01-13 13:31:27 +08:00
Jiarui Fang
867c8c2d3a [zero] low level optim supports ProcessGroup (#2464) 2023-01-13 10:05:58 +08:00
Frank Lee
e6943e2d11 [example] integrate autoparallel demo with CI (#2466)
* [example] integrate autoparallel demo with CI

* polish code

* polish code

* polish code

* polish code
2023-01-12 16:26:42 +08:00
YuliangLiu0306
c20529fe78 [examples] update autoparallel tutorial demo (#2449)
* [examples] update autoparallel tutorial demo

* add test_ci.sh

* polish

* add conda yaml
2023-01-12 14:30:58 +08:00
Haofan Wang
cfd1d5ee49 [example] fixed seed error in train_dreambooth_colossalai.py (#2445) 2023-01-11 16:56:15 +08:00
Frank Lee
ac18a445fa [example] updated large-batch optimizer tutorial (#2448)
* [example] updated large-batch optimizer tutorial

* polish code

* polish code
2023-01-11 16:27:31 +08:00
Frank Lee
39163417a1 [example] updated the hybrid parallel tutorial (#2444)
* [example] updated the hybrid parallel tutorial

* polish code
2023-01-11 15:17:17 +08:00
YuliangLiu0306
2731531bc2 [autoparallel] integrate device mesh initialization into autoparallelize (#2393)
* [autoparallel] integrate device mesh initialization into autoparallelize

* add megatron solution

* update gpt autoparallel examples with latest api

* adapt beta value to fit the current computation cost
2023-01-11 14:03:49 +08:00
Frank Lee
a3e5496156 [example] improved the clarity yof the example readme (#2427)
* [example] improved the clarity yof the example readme

* polish workflow

* polish workflow

* polish workflow

* polish workflow

* polish workflow

* polish workflow
2023-01-11 10:46:32 +08:00
Frank Lee
63be79d505 [example] removed duplicated stable diffusion example (#2424) 2023-01-11 10:07:18 +08:00
ZijianYY
fe0f7970a2 [examples] adding tflops to PaLM (#2365) 2023-01-10 16:18:56 +08:00
HELSON
d84e747975 [hotfix] add DISTPAN argument for benchmark (#2412)
* change the benchmark config file

* change config

* revert config file

* rename distpan to distplan
2023-01-10 11:39:25 +08:00
Frank Lee
8327932d2c [workflow] refactored the example check workflow (#2411)
* [workflow] refactored the example check workflow

* polish code

* polish code

* polish code

* polish code

* polish code

* polish code

* polish code

* polish code

* polish code

* polish code

* polish code
2023-01-10 11:26:19 +08:00
HELSON
498b5ca993 [hotfix] fix gpt gemini example (#2404)
* [hotfix] fix gpt gemini example

* [example] add new assertions
2023-01-09 15:52:17 +08:00
jiaruifang
b2e0d502b8 [doc] hotfix #2377 2023-01-07 19:44:50 +08:00
Jiarui Fang
8f72b6f8fb [hotfix] fix implement error in diffusers 2023-01-07 07:56:39 +08:00
1SAA
33f3023e19 [hotfix] fix implement error in diffusers 2023-01-06 18:37:18 +08:00
Jiarui Fang
12c8bf38d7 [Pipeline] Refine GPT PP Example 2023-01-06 18:03:45 +08:00
Ziyue Jiang
ad00894f7f polish 2023-01-06 16:03:16 +08:00
Jiarui Fang
1aaeb596c6 [example] gpt, shard init on all processes (#2366) 2023-01-06 15:44:50 +08:00
Ziyue Jiang
3a15b20421 Move GPT PP Example 2023-01-06 14:48:58 +08:00
HELSON
48d33b1b17 [gemini] add get static torch model (#2356) 2023-01-06 13:41:19 +08:00
Fazzie-Maqianli
7a332b1734 Merge pull request #2338 from haofanwang/patch-1
Fix a typo in train_dreambooth_colossalai.py
2023-01-06 11:50:18 +08:00
YuliangLiu0306
8b1e0dfd80 [example] upload auto parallel gpt2 demo (#2354) 2023-01-06 11:38:38 +08:00
Jiarui Fang
00a9c781fd [example] add google doc for benchmark results of GPT (#2355) 2023-01-06 11:38:15 +08:00
Jiarui Fang
509a87f3ff [example] make gpt example directory more clear (#2353) 2023-01-06 11:11:26 +08:00
Ikko Eltociear Ashimine
5e4bced0a3 [NFC] Update roberta/README.md (#2350) 2023-01-06 10:09:14 +08:00