ver217
|
ce5a7dcab0
|
[zero] Update sharded model v2 using sharded param v2 (#323)
|
2022-03-08 18:18:06 +08:00 |
|
Jiarui Fang
|
cec05b25c9
|
[zero] update zero context init with the updated test utils (#327)
|
2022-03-08 14:45:01 +08:00 |
|
Jiarui Fang
|
d6abd933f2
|
[zero] zero init context (#321)
* add zero init context
* add more flags for zero init context
fix bug of repeated converting param to ShardedParamV2
* polish code
|
2022-03-07 16:14:40 +08:00 |
|
Jiarui Fang
|
29521cba0a
|
[zero] yet an improved sharded param (#311)
|
2022-03-04 15:49:23 +08:00 |
|
Jiarui Fang
|
2f6295bf78
|
[zero] polish shard strategy (#310)
* init shard param from shape tuple
* add more unitest for shard param
* add set_payload method for ShardedParam
* [zero] add shareded tensor class
* polish code
* add shard stratgy
* move shard and gather logic to shard strategy from shard tensor.
* polish code
|
2022-03-04 15:35:07 +08:00 |
|
ver217
|
2aa440358d
|
fix sharded param hook and unit test
|
2022-03-04 15:27:39 +08:00 |
|
Jiarui Fang
|
408cba655b
|
[zero] sharded tensor (#305)
* init shard param from shape tuple
* add more unitest for shard param
* add set_payload method for ShardedParam
* [zero] add shareded tensor class
* polish code
|
2022-03-04 10:46:13 +08:00 |
|
Jiarui Fang
|
6c78946fdd
|
Polish sharded parameter (#297)
* init shard param from shape tuple
* add more unitest for shard param
* add more unittests to shareded param
|
2022-03-03 12:42:57 +08:00 |
|