1
0
mirror of https://github.com/hpcaitech/ColossalAI.git synced 2025-05-05 15:08:18 +00:00
ColossalAI/tests
Bin Jia 1db6727678
[Pipeline inference] Combine kvcache with pipeline inference ()
* merge kvcache with pipeline inference and refactor the code structure

* support ppsize > 2

* refactor pipeline code

* do pre-commit

* modify benchmark

* fix bench mark

* polish code

* add docstring and update readme

* refactor the code

* fix some logic bug of ppinfer

* polish readme

* fix typo

* skip infer test
2023-10-27 16:19:54 +08:00
..
kit [test] merge old components to test to model zoo () 2023-10-20 10:35:08 +08:00
test_analyzer [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_auto_parallel [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_autochunk [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_booster [gemini] support gradient accumulation () 2023-10-17 14:07:21 +08:00
test_checkpoint_io [hotfix] fix lr scheduler bug in torch 2.0 () 2023-10-12 14:04:24 +08:00
test_cluster [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_config [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_device [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_fx [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_gptq [feature] add gptq for inference () 2023-09-22 11:02:50 +08:00
test_infer [Pipeline inference] Combine kvcache with pipeline inference () 2023-10-27 16:19:54 +08:00
test_infer_ops [Refactor] Integrated some lightllm kernels into token-attention () 2023-10-19 22:22:47 +08:00
test_lazy [lazy] support from_pretrained () 2023-09-26 11:04:11 +08:00
test_legacy [test] merge old components to test to model zoo () 2023-10-20 10:35:08 +08:00
test_moe [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_optimizer [test] merge old components to test to model zoo () 2023-10-20 10:35:08 +08:00
test_pipeline [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_shardformer [hotfix] fix torch 2.0 compatibility () 2023-10-18 11:05:25 +08:00
test_smoothquant [inference] Add smmoothquant for llama () 2023-10-16 11:28:44 +08:00
test_tensor [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_utils [misc] update pre-commit and run all files () 2023-09-19 14:20:26 +08:00
test_zero [test] merge old components to test to model zoo () 2023-10-20 10:35:08 +08:00
__init__.py [zero] Update sharded model v2 using sharded param v2 () 2022-03-11 15:50:28 +08:00