Files
ColossalAI/colossalai/nn/optimizer
HELSON a9b8300d54 [zero] improve adaptability for not-shard parameters (#708)
* adapt post grad hooks for not-shard parameters
* adapt optimizer for not-shard parameters
* offload gradients for not-replicated parameters
2022-04-11 13:38:51 +08:00
..
2022-04-02 17:04:05 +08:00
2022-04-01 16:27:03 +08:00
2022-04-02 17:04:05 +08:00
2022-01-21 10:44:30 +08:00
2022-01-21 10:44:30 +08:00
2022-04-02 17:04:05 +08:00