Logo
Explore Help
Register Sign In
github/ColossalAI
1
0
Fork 0
You've already forked ColossalAI
mirror of https://github.com/hpcaitech/ColossalAI.git synced 2026-02-21 06:22:09 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
04aca9e55bd91ea4dd8d1231aa66df7848b08f03
ColossalAI/tests/test_infer
History
yuehuayingxueluo 04aca9e55b [Inference/Kernel]Add get_cos_and_sin Kernel (#5528)
* Add get_cos_and_sin kernel

* fix code comments

* fix code typos

* merge common codes of get_cos_and_sin kernel.

* Fixed a typo

* Changed 'asset allclose' to 'assert equal'.
2024-04-01 13:47:14 +08:00
..
test_models
[Infer] Optimize Blocked KVCache And Kernels Using It (#5325)
2024-01-30 16:06:09 +08:00
test_ops
[Inference/Kernel]Add get_cos_and_sin Kernel (#5528)
2024-04-01 13:47:14 +08:00
_utils.py
[Inference] Add the logic of the inference engine (#5173)
2024-01-11 13:39:56 +00:00
test_batch_bucket.py
[Fix/Inference] Fix format of input prompts and input model in inference engine (#5395)
2024-02-23 10:51:35 +08:00
test_config_and_struct.py
[Inference] Optimize and Refactor Inference Batching/Scheduling (#5367)
2024-02-19 17:18:20 +08:00
test_cuda_graph.py
[fix]
2024-03-25 11:37:58 +08:00
test_inference_engine.py
[Inference/kernel]Add Fused Rotary Embedding and KVCache Memcopy CUDA Kernel (#5418)
2024-03-13 17:20:03 +08:00
test_kvcache_manager.py
[Inference] Optimize and Refactor Inference Batching/Scheduling (#5367)
2024-02-19 17:18:20 +08:00
test_request_handler.py
[Inference] Optimize and Refactor Inference Batching/Scheduling (#5367)
2024-02-19 17:18:20 +08:00
Powered by Gitea Version: 1.25.2 Page: 398ms Template: 63ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API