Yuanheng Zhao
537a3cbc4d
[kernel] Support New KCache Layout - Triton Kernel (#5677)
* kvmemcpy triton for new kcache layout
* revise tests for new kcache layout
* naive triton flash decoding - new kcache layout
* rotary triton kernel - new kcache layout
* remove redundancy - triton decoding
* remove redundancy - triton kvcache copy
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2024-05-03 17:20:45 +08:00
..
2024-04-08 15:09:40 +08:00
2023-09-19 14:20:26 +08:00
2024-01-09 10:20:05 +08:00
2023-09-19 14:20:26 +08:00
2024-03-27 13:57:00 +08:00
2024-04-08 15:09:40 +08:00
2024-04-03 17:15:47 +08:00
2023-09-19 14:20:26 +08:00
2023-09-19 14:20:26 +08:00
2023-09-19 14:20:26 +08:00
2024-05-03 17:20:45 +08:00
2024-04-08 15:09:40 +08:00
2024-01-09 10:20:05 +08:00
2024-03-25 12:31:09 +08:00
2024-04-08 15:09:40 +08:00
2024-04-08 15:09:40 +08:00
2024-04-03 17:15:47 +08:00
2023-10-16 11:28:44 +08:00
2024-03-26 17:22:27 +08:00
2024-01-09 10:20:05 +08:00
2022-03-11 15:50:28 +08:00