yuehuayingxueluo
b45000f839
[Inference]Add Streaming LLM (#5745)
* Add Streaming LLM
* add some parameters to llama_generation.py
* verify streamingllm config
* add test_streamingllm.py
* modified according to the opinions of review
* add Citation
* change _block_tables tolist
2024-06-05 10:51:19 +08:00
..
2024-05-08 15:20:53 +00:00
2024-05-31 19:40:26 +08:00
2024-05-08 11:30:15 +08:00
2024-05-08 11:30:15 +08:00
2024-01-11 13:39:56 +00:00
2024-02-23 10:51:35 +08:00
2024-05-08 11:30:15 +08:00
2024-05-24 19:34:15 +08:00
2024-05-08 11:30:15 +08:00
2024-05-08 11:30:15 +08:00
2024-05-08 15:20:53 +00:00
2024-05-05 16:28:56 +00:00
2024-05-05 16:28:56 +00:00
2024-05-31 19:40:26 +08:00
2024-06-05 10:51:19 +08:00