Logo
Explore Help
Register Sign In
github/ColossalAI
1
0
Fork 0
You've already forked ColossalAI
mirror of https://github.com/hpcaitech/ColossalAI.git synced 2026-04-25 01:03:35 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
576a2f7b10711bcdb43b86da6a5afaa98f4ad867
ColossalAI/tests/test_infer
History
Jianghai ef4c14a5e2 [Inference] Fix bug in ChatGLM2 Tensor Parallelism (#5014)
* fix bug

* fix

* fix multiquery

* fix multiquery

---------

Co-authored-by: CjhHa1 <cjh18671720497outlook.com>
2023-11-07 15:01:50 +08:00
..
test_dynamic_batching
[Inference] Dynamic Batching Inference, online and offline (#4953)
2023-10-30 10:52:19 +08:00
_utils.py
[misc] update pre-commit and run all files (#4752)
2023-09-19 14:20:26 +08:00
test_bloom_infer.py
[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention (#4965)
2023-10-30 14:04:37 +08:00
test_chatglm2_infer.py
[Inference] Fix bug in ChatGLM2 Tensor Parallelism (#5014)
2023-11-07 15:01:50 +08:00
test_infer_engine.py
[misc] update pre-commit and run all files (#4752)
2023-09-19 14:20:26 +08:00
test_kvcache_manager.py
[misc] update pre-commit and run all files (#4752)
2023-09-19 14:20:26 +08:00
test_llama2_infer.py
[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention (#4965)
2023-10-30 14:04:37 +08:00
test_llama_infer.py
[Kernels]Updated Triton kernels into 2.1.0 and adding flash-decoding for llama token attention (#4965)
2023-10-30 14:04:37 +08:00
test_pipeline_infer.py
[format] applied code formatting on changed files in pull request 4926 (#5007)
2023-11-06 17:08:12 +08:00
Powered by Gitea Version: 1.25.2 Page: 116ms Template: 4ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API