Yuanheng Zhao
d63c469f45
[Infer] Revise and Adapt Triton Kernels for Spec-Dec (#5401)
* [Infer/Fix] Fix Dependency in test - RMSNorm kernel (#5399)
fix dependency in pytest
* resolve conflicts for revising flash-attn
* adapt kv cache copy kernel for spec-dec
* fix seqlen-n kvcache copy kernel/tests
* test kvcache copy - use torch.equal
* add assertions
* (trivial) comment out
2024-04-10 11:07:51 +08:00
..
2024-04-08 15:09:40 +08:00
2023-09-19 14:20:26 +08:00
2024-01-09 10:20:05 +08:00
2023-09-19 14:20:26 +08:00
2024-03-27 13:57:00 +08:00
2024-04-08 15:09:40 +08:00
2024-04-03 17:15:47 +08:00
2023-09-19 14:20:26 +08:00
2023-09-19 14:20:26 +08:00
2023-09-19 14:20:26 +08:00
2024-04-08 15:09:40 +08:00
2024-04-10 11:07:51 +08:00
2024-04-08 15:09:40 +08:00
2024-01-09 10:20:05 +08:00
2024-03-25 12:31:09 +08:00
2024-04-08 15:09:40 +08:00
2024-04-08 15:09:40 +08:00
2024-04-03 17:15:47 +08:00
2023-10-16 11:28:44 +08:00
2024-03-26 17:22:27 +08:00
2024-01-09 10:20:05 +08:00
2022-03-11 15:50:28 +08:00