ColossalAI/colossalai/inference/quant/smoothquant/models
Xu Kai c6295c3381
[Refactor] remove useless inference code (#5022)
* remove useless code

* fix quant model

* fix test import bug

* mv original inference legacy

* fix chatglm2
2023-11-10 14:47:06 +08:00
..
__init__.py [refactor] refactor gptq and smoothquant llama (#5012) 2023-11-09 10:12:11 +08:00
base_model.py [Refactor] remove useless inference code (#5022) 2023-11-10 14:47:06 +08:00
linear.py [refactor] refactor gptq and smoothquant llama (#5012) 2023-11-09 10:12:11 +08:00
llama.py [Refactor] remove useless inference code (#5022) 2023-11-10 14:47:06 +08:00
parallel_linear.py [refactor] refactor gptq and smoothquant llama (#5012) 2023-11-09 10:12:11 +08:00