github/ColossalAI

mirror of https://github.com/hpcaitech/ColossalAI.git synced 2026-04-26 09:42:27 +00:00

Files

History

ver217 dbe62c67b8 add an example of ViT-B/16 and remove w_norm clipping in LAMB (#29 )

2021-11-18 23:45:09 +08:00

..

add an example of ViT-B/16 and remove w_norm clipping in LAMB (#29 )

2021-11-18 23:45:09 +08:00

acc.jpeg

add an example of ViT-B/16 and remove w_norm clipping in LAMB (#29 )

2021-11-18 23:45:09 +08:00

hooks.py

add an example of ViT-B/16 and remove w_norm clipping in LAMB (#29 )

2021-11-18 23:45:09 +08:00

loss.jpeg

add an example of ViT-B/16 and remove w_norm clipping in LAMB (#29 )

2021-11-18 23:45:09 +08:00

mixup.py

add an example of ViT-B/16 and remove w_norm clipping in LAMB (#29 )

2021-11-18 23:45:09 +08:00

README.md

add an example of ViT-B/16 and remove w_norm clipping in LAMB (#29 )

2021-11-18 23:45:09 +08:00

train_dali.py

add an example of ViT-B/16 and remove w_norm clipping in LAMB (#29 )

2021-11-18 23:45:09 +08:00

vit-b16.py

add an example of ViT-B/16 and remove w_norm clipping in LAMB (#29 )

2021-11-18 23:45:09 +08:00

README.md

Overview

Here is an example of training ViT-B/16 on Imagenet-1K. We use 8x A100 in this example. For simplicity and speed, we didn't apply RandAug and we just used Mixup. With LAMB optimizer, we can scale the batch size to 32K with a little accuracy loss.

How to run

Using slurm:

srun python train_dali.py --local_rank=$SLURM_PROCID --world_size=$SLURM_NPROCS --host=$HOST --port=29500 --config=vit-b16.py

Results