ColossalAI

mirror of https://github.com/hpcaitech/ColossalAI.git synced 2026-04-26 01:35:21 +00:00

Files

yuehuayingxueluo 12f10d5b0b [Fix/Inference]Fix CUDA Rotary Rmbedding GQA (#5623 )

* fix rotary embedding GQA

* change test_rotary_embdding_unpad.py KH

2024-04-23 13:44:49 +08:00

2024-04-15 16:53:02 +08:00

2024-04-23 13:44:49 +08:00

_utils.py

2024-01-11 13:39:56 +00:00

test_batch_bucket.py

2024-02-23 10:51:35 +08:00

test_config_and_struct.py

2024-02-19 17:18:20 +08:00

test_cuda_graph.py

2024-04-18 16:56:46 +08:00

test_drafter.py

2024-04-10 11:07:52 +08:00

test_inference_engine.py

2024-04-23 13:09:55 +08:00

test_kvcache_manager.py

2024-02-19 17:18:20 +08:00

test_request_handler.py

2024-02-19 17:18:20 +08:00