mirror of
https://github.com/hpcaitech/ColossalAI.git
synced 2025-07-16 08:27:44 +00:00
* fix flash decoding mask during verification * add spec-dec * add test for spec-dec * revise drafter init * remove drafter sampling * retire past kv in drafter * (trivial) rename attrs * (trivial) rename arg * revise how we enable/disable spec-dec |
||
---|---|---|
.. | ||
__init__.py | ||
context_attn_unpad.py | ||
flash_decoding.py | ||
fused_rotary_embedding.py | ||
kvcache_copy.py | ||
llama_act_combine_kernel.py | ||
no_pad_rotary_embedding.py | ||
qkv_matmul_kernel.py | ||
rms_layernorm.py | ||
rotary_cache_copy.py | ||
softmax.py |