mirror of
https://github.com/hpcaitech/ColossalAI.git
synced 2025-09-12 20:54:35 +00:00
[Pipeline inference] Combine kvcache with pipeline inference (#4938)
* merge kvcache with pipeline inference and refactor the code structure * support ppsize > 2 * refactor pipeline code * do pre-commit * modify benchmark * fix bench mark * polish code * add docstring and update readme * refactor the code * fix some logic bug of ppinfer * polish readme * fix typo * skip infer test
This commit is contained in:
@@ -1,3 +1,4 @@
|
||||
from .pipeline import PPInferEngine
|
||||
|
||||
__all__ = ["PPInferEngine"]
|
||||
|
||||
__all__ = ['PPInferEngine']
|
||||
|
Reference in New Issue
Block a user