Files
ColossalAI/colossalai/inference/modeling
yuehuayingxueluo bfff9254ac [inference] Adapted to Rotary Embedding and RMS Norm (#5283)
* adapted to rotary_embedding

* adapted to nopad rms norm

* fix bugs in benchmark

* fix flash_decoding.py
2024-01-22 10:55:34 +08:00
..