mirror of
https://github.com/hpcaitech/ColossalAI.git
synced 2025-04-28 11:45:23 +00:00
* update help information * update style * fix * minor fix * support PP training * add pp support * remove unused code * address conversation * fix memory leakage support tp+pp * move empty cache * move empty cache * add DAPO support * remove format reward * fix filtering, still buggy * small fix * add DAPO support * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * tested multi-node training; fix bind_batch bug * fix conversation; support sleep mode * support reusing excessive samples * add dynamic batching control flag * add dynamic batching control flag * refactored * fix logging --------- Co-authored-by: Tong Li <tong.li35271158@gmail.com> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> |
||
---|---|---|
.. | ||
Colossal-LLaMA | ||
ColossalChat | ||
ColossalEval | ||
ColossalMoE | ||
ColossalQA | ||
README.md |
Applications
This directory contains the applications that are powered by Colossal-AI.
The list of applications include:
- Open-Sora: Revealing Complete Model Parameters, Training Details, and Everything for Sora-like Video Generation Models
- ColossalChat: Replication of ChatGPT with RLHF.
- Colossal-LLaMA: Continual Pre-training and Supervisied Fine-tuning of LLaMA2 / LLaMA3.
- ColossalEval: Evaluation Pipeline for LLMs.
- FastFold: Optimizing AlphaFold (Biomedicine) Training and Inference on GPU Clusters.
- ColossalQA: Document Retrieval Conversation System
- SwiftInfer: Breaks the Length Limit of LLM Inference for Multi-Round Conversations
Please note that the
Chatbot
application is migrated from the originalChatGPT
folder.
You can find more example code for base models and functions in the Examples directory.