1
0
mirror of https://github.com/hpcaitech/ColossalAI.git synced 2025-04-29 04:05:35 +00:00
ColossalAI/examples/inference
Runyu Lu bcf0181ecd
[Feat] Distrifusion Acceleration Support for Diffusion Inference ()
* Distrifusion Support source

* comp comm overlap optimization

* sd3 benchmark

* pixart distrifusion bug fix

* sd3 bug fix and benchmark

* generation bug fix

* naming fix

* add docstring, fix counter and shape error

* add reference

* readme and requirement
2024-07-30 10:43:26 +08:00
..
benchmark_ops add paged-attetionv2: support seq length split across thread block () 2024-05-14 12:46:54 +08:00
client [Inference]Fix readme and example for API server () 2024-05-24 10:03:05 +08:00
llama [Fix] Fix spec-dec Glide LlamaModel for compatibility with transformers () 2024-06-19 15:37:53 +08:00
stable_diffusion [Feat] Distrifusion Acceleration Support for Diffusion Inference () 2024-07-30 10:43:26 +08:00