mirror of
https://github.com/hpcaitech/ColossalAI.git
synced 2025-09-07 12:01:39 +00:00
[Inference]Fused the gate and up proj in mlp,and optimized the autograd process. (#5365)
* fused the gate and up proj in mlp * fix code styles * opt auto_grad * rollback test_inference_engine.py * modifications based on the review feedback. * fix bugs in flash attn * Change reshape to view * fix test_rmsnorm_triton.py
This commit is contained in:
@@ -10,7 +10,7 @@ def greedy_sample(
|
||||
"""
|
||||
Sample tokens greedyly.
|
||||
"""
|
||||
results = torch.argmax(logprobs, dim=-1).cpu()
|
||||
results = torch.argmax(logprobs, dim=-1)
|
||||
return results
|
||||
|
||||
|
||||
|
Reference in New Issue
Block a user