Baizhou Zhang
a14d352088
[pipeline] add pipeline forward for variants of gpt2 (#4238)
* add forward for GPTLMHeadModel
* add test for gpt_lm
* arranging get_held_layers method
* arrange forward replacement
* add forward for GPT2ForTokenClassification
* add forward for GPT2ForSequenceClassification
* fix test_shard_gpt2.py
* add GPT2DoubleHeadsmodel & fix bugs
* add id checking in get_shared_params
2023-08-15 23:25:14 +08:00
..
2023-05-11 16:30:58 +08:00
2023-08-15 23:25:14 +08:00
2023-04-06 14:51:35 +08:00
2023-05-15 17:20:56 +08:00
2023-06-25 13:34:15 +08:00
2023-07-04 16:05:01 +08:00
2023-07-31 22:13:29 +08:00
2023-07-31 22:13:29 +08:00
2023-08-15 23:25:14 +08:00
2023-04-06 14:51:35 +08:00
2022-06-10 11:27:38 +08:00
2023-04-06 14:51:35 +08:00
2023-04-06 14:51:35 +08:00
2023-05-11 16:30:58 +08:00
2023-04-06 14:51:35 +08:00
2023-07-04 16:07:47 +08:00
2023-04-06 14:51:35 +08:00
2023-07-04 16:05:01 +08:00
2023-07-18 23:53:38 +08:00
2023-05-11 16:30:58 +08:00
2023-08-01 18:52:14 +08:00
2023-05-11 16:30:58 +08:00
2023-04-06 14:51:35 +08:00
2023-06-05 15:58:31 +08:00
2023-08-15 23:25:14 +08:00
2023-08-15 23:25:14 +08:00
2023-07-04 16:05:01 +08:00
2023-05-11 16:30:58 +08:00
2023-08-09 14:24:45 +08:00
2023-08-11 15:09:24 +08:00
2022-03-11 15:50:28 +08:00