[inference] update examples and engine (#5073)

* update examples and engine

* fix choices

* update example
This commit is contained in:
Xu Kai
2023-11-20 19:44:52 +08:00
committed by GitHub
parent 0c7d8bebd5
commit fb103cfd6e
12 changed files with 107 additions and 273 deletions

View File

@@ -3,5 +3,4 @@ packaging
ninja
auto-gptq==0.5.0
git+https://github.com/ModelTC/lightllm.git@ece7b43f8a6dfa74027adc77c2c176cff28c76c8
git+https://github.com/facebookresearch/xformers.git@main#egg=xformers
git+https://github.com/Dao-AILab/flash-attention.git@017716451d446e464dde9aca3a3c1ed2209caaa9