mirror of
https://github.com/hpcaitech/ColossalAI.git
synced 2025-08-11 12:51:55 +00:00
* refactor kvcache manager and rotary_embedding and kvcache_memcpy operator * refactor decode_kv_cache_memcpy * enable alibi in pagedattention |
||
---|---|---|
.. | ||
__init__.py | ||
inference_ops_cuda.py | ||
inference.cpp |