mirror of
https://github.com/hpcaitech/ColossalAI.git
synced 2025-09-08 04:24:47 +00:00
[gemini] hotfix NaN loss while using Gemini + tensor_parallel (#5150)
* fix aaa fix fix fix * fix * fix * test ci * fix ci fix
This commit is contained in:
@@ -61,7 +61,7 @@ loss_fn = lambda x: x.loss
|
||||
|
||||
config = transformers.GPTJConfig(
|
||||
n_layer=2,
|
||||
n_head=16,
|
||||
n_head=4,
|
||||
vocab_size=50258,
|
||||
attn_pdrop=0,
|
||||
embd_pdrop=0,
|
||||
|
Reference in New Issue
Block a user