Files
ColossalAI/colossalai/kernel/triton/flash_decoding.py
Yuanheng Zhao 3da9993b0d [Kernel/Fix] Revise flash attention triton kernel API and add benchmark (#5301)
* fix decoding kernel pytest

* revise and add triton context attn benchmark
2024-01-23 17:16:02 +08:00

11 KiB