ColossalAI/colossalai/zero
HELSON a9b8300d54
[zero] improve adaptability for not-shard parameters (#708)
* adapt post grad hooks for not-shard parameters
* adapt optimizer for not-shard parameters
* offload gradients for not-replicated parameters
2022-04-11 13:38:51 +08:00
..
init_ctx [zero] improve adaptability for not-shard parameters (#708) 2022-04-11 13:38:51 +08:00
shard_utils [zero] refactor memstats collector (#706) 2022-04-11 10:46:08 +08:00
sharded_model [zero] improve adaptability for not-shard parameters (#708) 2022-04-11 13:38:51 +08:00
sharded_optim [zero] improve adaptability for not-shard parameters (#708) 2022-04-11 13:38:51 +08:00
sharded_param [zero] adapt zero hooks for unsharded module (#699) 2022-04-08 20:23:26 +08:00
__init__.py [refactor] remove old zero code (#517) 2022-03-25 14:54:39 +08:00