ColossalAI

mirror of https://github.com/hpcaitech/ColossalAI.git synced 2026-04-11 14:43:10 +00:00

Files

Jianghai 1f8a75d470 [Inference] Update rms norm kernel, benchmark with vLLM (#5315 )

* add

* xi

* del

* del

* fix

2024-01-29 10:22:33 +08:00

__init__.py

2024-01-26 14:00:10 +08:00

context_attn_unpad.py

2024-01-26 14:00:10 +08:00

custom_autotune.py

add autotune (#4822 )

2023-09-28 13:47:35 +08:00

flash_decoding.py

2024-01-26 14:00:10 +08:00

fused_rotary_embedding.py

fix (#5311 )

2024-01-26 15:02:12 +08:00

gptq_triton.py

2023-10-20 13:39:34 +08:00

kvcache_copy.py

2024-01-22 10:55:34 +08:00

llama_act_combine_kernel.py

2023-11-02 02:21:24 +00:00

no_pad_rotary_embedding.py

fix (#5311 )

2024-01-26 15:02:12 +08:00

qkv_matmul_kernel.py

2023-09-19 14:20:26 +08:00

rms_layernorm.py

2024-01-29 10:22:33 +08:00

rotary_cache_copy.py

fix (#5311 )

2024-01-26 15:02:12 +08:00

softmax.py

2023-09-19 14:20:26 +08:00