ColossalAI/applications
YeAnbang d20c8ffd97
Add GRPO and Support RLVR for PPO (#6186)
* add grpo, support rlvr

* add grpo, support rlvr

* tested deepseek r1 pipeline

* add ci

* verify grpo r1

* verify grpo r1

* update readme, remove unused code

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove path

* clean code

* fix circular import

* fix ci OOM

* fix ci OOM

* skip kto tp, fix qwen generation

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
2025-02-18 09:43:36 +08:00
..
Colossal-LLaMA [Device]Support npu (#6159) 2024-12-17 15:42:39 +08:00
ColossalChat Add GRPO and Support RLVR for PPO (#6186) 2025-02-18 09:43:36 +08:00
ColossalEval [ColossalEval] support for vllm (#6056) 2024-09-18 17:09:45 +08:00
ColossalMoE [MoE/ZeRO] Moe refactor with zero refactor (#5821) 2024-06-28 14:00:08 +08:00
ColossalQA [pre-commit.ci] pre-commit autoupdate (#5572) 2024-07-01 17:16:41 +08:00
README.md [Hotfix] README link (#5966) 2024-08-08 18:04:47 +08:00

Applications

This directory contains the applications that are powered by Colossal-AI.

The list of applications include:

  • Open-Sora: Revealing Complete Model Parameters, Training Details, and Everything for Sora-like Video Generation Models
  • ColossalChat: Replication of ChatGPT with RLHF.
  • Colossal-LLaMA: Continual Pre-training and Supervisied Fine-tuning of LLaMA2 / LLaMA3.
  • ColossalEval: Evaluation Pipeline for LLMs.
  • FastFold: Optimizing AlphaFold (Biomedicine) Training and Inference on GPU Clusters.
  • ColossalQA: Document Retrieval Conversation System
  • SwiftInfer: Breaks the Length Limit of LLM Inference for Multi-Round Conversations

Please note that the Chatbot application is migrated from the original ChatGPT folder.

You can find more example code for base models and functions in the Examples directory.