yuehuayingxueluo
b45000f839
[Inference]Add Streaming LLM (#5745)
* Add Streaming LLM
* add some parameters to llama_generation.py
* verify streamingllm config
* add test_streamingllm.py
* modified according to the opinions of review
* add Citation
* change _block_tables tolist
2024-06-05 10:51:19 +08:00
..
2024-06-03 15:26:01 +08:00
2024-03-05 21:52:30 +08:00
2024-01-09 10:20:05 +08:00
2024-04-29 10:40:11 +08:00
2024-04-18 18:15:50 +08:00
2024-05-29 15:35:54 +08:00
2024-04-28 10:51:27 +08:00
2024-03-05 15:35:54 +08:00
2024-05-14 13:52:45 +08:00
2024-04-25 14:45:52 +08:00
2024-05-14 13:52:45 +08:00
2024-06-03 15:26:01 +08:00
2024-06-05 10:51:19 +08:00
2024-05-14 13:52:45 +08:00
2024-05-21 22:12:15 +08:00
2024-05-17 18:18:59 +08:00
2024-05-20 15:50:53 +00:00
2023-09-19 14:20:26 +08:00
2024-03-05 21:52:30 +08:00
2024-05-24 17:24:16 +08:00
2024-04-28 10:51:27 +08:00
2024-04-28 10:51:27 +08:00
2024-06-03 11:25:18 +08:00
2024-05-14 13:52:45 +08:00
2024-04-18 16:10:18 +08:00
2024-01-29 13:49:39 +08:00
2024-05-28 05:16:02 +00:00
2024-04-08 15:09:40 +08:00
2024-04-29 10:40:11 +08:00