Logo
Explore Help
Register Sign In
github/ColossalAI
1
0
Fork 0
You've already forked ColossalAI
mirror of https://github.com/hpcaitech/ColossalAI.git synced 2026-04-10 22:25:04 +00:00
Code Issues Packages Projects Releases Wiki Activity
Files
73bdfd88910efeecc4f09025773ecc58305aa494
ColossalAI/colossalai/shardformer/layer
History
YeAnbang 16e68a071d fix logprob, add filtering, temperature annealing, lr descent
2025-08-05 13:59:02 +08:00
..
__init__.py
[Feature] Support Distributed LogProb for GRPO Training (#6247)
2025-08-05 13:59:02 +08:00
_operation.py
[Sharderformer] Support zbv in Sharderformer Policy (#6150)
2025-01-02 10:22:26 +08:00
attn.py
[hotfix] fix flash attn window_size err (#6132)
2024-11-14 17:11:35 +08:00
dropout.py
[misc] update pre-commit and run all files (#4752)
2023-09-19 14:20:26 +08:00
embedding.py
[fp8] support hybrid parallel plugin (#5982)
2024-08-12 18:17:05 +08:00
linear.py
[CI] Cleanup Dist Optim tests with shared helper funcs (#6125)
2025-02-12 13:42:34 +08:00
loss.py
fix logprob, add filtering, temperature annealing, lr descent
2025-08-05 13:59:02 +08:00
normalization.py
[Hotfix] hotfix normalization (#6163)
2024-12-23 16:29:48 +08:00
parallel_module.py
[shardformer] refactor embedding resize (#5603)
2024-04-18 16:10:18 +08:00
qkv_fused_linear.py
[Sharderformer] Support zbv in Sharderformer Policy (#6150)
2025-01-02 10:22:26 +08:00
utils.py
[Sharderformer] Support zbv in Sharderformer Policy (#6150)
2025-01-02 10:22:26 +08:00
Powered by Gitea Version: 1.25.2 Page: 397ms Template: 8ms
English
Bahasa Indonesia Deutsch English Español Français Gaeilge Italiano Latviešu Magyar nyelv Nederlands Polski Português de Portugal Português do Brasil Suomi Svenska Türkçe Čeština Ελληνικά Български Русский Українська فارسی മലയാളം 日本語 简体中文 繁體中文(台灣) 繁體中文(香港) 한국어
Licenses API