ColossalAI

mirror of https://github.com/hpcaitech/ColossalAI.git synced 2026-04-25 01:03:35 +00:00

Files

Jianghai ef4c14a5e2 [Inference] Fix bug in ChatGLM2 Tensor Parallelism (#5014 )

* fix bug

* fix

* fix multiquery

* fix multiquery

---------

Co-authored-by: CjhHa1 <cjh18671720497outlook.com>

2023-11-07 15:01:50 +08:00

2023-10-30 10:52:19 +08:00

_utils.py

2023-09-19 14:20:26 +08:00

test_bloom_infer.py

2023-10-30 14:04:37 +08:00

test_chatglm2_infer.py

2023-11-07 15:01:50 +08:00

test_infer_engine.py

2023-09-19 14:20:26 +08:00

test_kvcache_manager.py

2023-09-19 14:20:26 +08:00

test_llama2_infer.py

2023-10-30 14:04:37 +08:00

test_llama_infer.py

2023-10-30 14:04:37 +08:00

test_pipeline_infer.py

2023-11-06 17:08:12 +08:00