mirror of https://github.com/hpcaitech/ColossalAI.git synced 2026-07-13 12:05:48 +00:00

Files

YeAnbang d20c8ffd97 Add GRPO and Support RLVR for PPO (#6186 )

* add grpo, support rlvr

* add grpo, support rlvr

* tested deepseek r1 pipeline

* add ci

* verify grpo r1

* verify grpo r1

* update readme, remove unused code

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

* remove path

* clean code

* fix circular import

* fix ci OOM

* fix ci OOM

* skip kto tp, fix qwen generation

---------

Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>

2025-02-18 09:43:36 +08:00

Colossal-LLaMA

[Device]Support npu (#6159 )

2024-12-17 15:42:39 +08:00

ColossalChat

Add GRPO and Support RLVR for PPO (#6186 )

2025-02-18 09:43:36 +08:00

ColossalEval

[ColossalEval] support for vllm (#6056 )

2024-09-18 17:09:45 +08:00

ColossalMoE

[MoE/ZeRO] Moe refactor with zero refactor (#5821 )

2024-06-28 14:00:08 +08:00

ColossalQA

[pre-commit.ci] pre-commit autoupdate (#5572 )

2024-07-01 17:16:41 +08:00

README.md

[Hotfix] README link (#5966 )

2024-08-08 18:04:47 +08:00

README.md

Applications

This directory contains the applications that are powered by Colossal-AI.

GPU Cloud Playground | Playground Document

The list of applications include:

Open-Sora: Revealing Complete Model Parameters, Training Details, and Everything for Sora-like Video Generation Models
ColossalChat: Replication of ChatGPT with RLHF.
Colossal-LLaMA: Continual Pre-training and Supervisied Fine-tuning of LLaMA2 / LLaMA3.
ColossalEval: Evaluation Pipeline for LLMs.
FastFold: Optimizing AlphaFold (Biomedicine) Training and Inference on GPU Clusters.
ColossalQA: Document Retrieval Conversation System
SwiftInfer: Breaks the Length Limit of LLM Inference for Multi-Round Conversations

Please note that the Chatbot application is migrated from the original ChatGPT folder.

You can find more example code for base models and functions in the Examples directory.