Files
ColossalAI/colossalai/inference/tensor_parallel/__init__.py
Yuanheng Zhao 17cfa57140 [infer] Add Bloom inference policy and replaced methods (#4512)
* add bloom inference methods and policy

* enable pass BatchInferState from model forward

* revise bloom infer layers/policies

* add engine for inference (draft)

* add test for bloom infer

* fix bloom infer policy and flow

* revise bloom test

* fix bloom file path

* remove unused codes

* fix bloom modeling

* fix dir typo

* fix trivial

* fix policy

* clean pr

* trivial fix
2023-08-30 17:50:41 +08:00

5 lines
123 B
Python

from .engine import TPInferEngine
from .kvcache_manager import MemoryManager
__all__ = ['MemoryManager', 'TPInferEngine']