This website requires JavaScript.
Explore
Help
Register
Sign In
github
/
ColossalAI
Watch
1
Star
0
Fork
0
You've already forked ColossalAI
mirror of
https://github.com/hpcaitech/ColossalAI.git
synced
2026-02-21 06:22:09 +00:00
Code
Issues
Packages
Projects
Releases
Wiki
Activity
Files
279300dc5f34db219c90a297c0996d00221eae96
ColossalAI
/
examples
/
inference
/
benchmark_ops
History
Steve Luo
ccf72797e3
feat baichuan2 rmsnorm whose hidden size equals to 5120 (
#5611
)
2024-04-19 15:34:53 +08:00
..
benchmark_context_attn_unpad.py
[Inference]Move benchmark-related code to the example directory. (
#5408
)
2024-02-28 16:46:03 +08:00
benchmark_decoding_attn.py
[Inference/Kernel] Add Paged Decoding kernel, sequence split within the same thread block (
#5531
)
2024-04-18 16:45:07 +08:00
benchmark_flash_decoding_attention.py
[Inference/Kernel] Add Paged Decoding kernel, sequence split within the same thread block (
#5531
)
2024-04-18 16:45:07 +08:00
benchmark_fused_rotary_embdding_unpad.py
[Inference/kernel]Add Fused Rotary Embedding and KVCache Memcopy CUDA Kernel (
#5418
)
2024-03-13 17:20:03 +08:00
benchmark_kv_cache_memcopy.py
[Inference]Add CUDA KVCache Kernel (
#5406
)
2024-02-28 14:36:50 +08:00
benchmark_rmsnorm.py
feat baichuan2 rmsnorm whose hidden size equals to 5120 (
#5611
)
2024-04-19 15:34:53 +08:00
benchmark_rotary_embedding.py
[Inference/kernel]Add Fused Rotary Embedding and KVCache Memcopy CUDA Kernel (
#5418
)
2024-03-13 17:20:03 +08:00
benchmark_xine_copy.py
[Inference/kernel]Add Fused Rotary Embedding and KVCache Memcopy CUDA Kernel (
#5418
)
2024-03-13 17:20:03 +08:00