mirror of
https://github.com/hpcaitech/ColossalAI.git
synced 2026-05-03 01:19:15 +00:00
* place params on cpu after zero init context * polish code * bucketzed cpu gpu tensor transter * find a bug in sharded optim unittest * add offload unittest for ShardedOptimV2. * polish code and make it more robust