diff --git a/applications/ColossalChat/coati/distributed/README.md b/applications/ColossalChat/coati/distributed/README.md index e0773d838..68c5e5c68 100644 --- a/applications/ColossalChat/coati/distributed/README.md +++ b/applications/ColossalChat/coati/distributed/README.md @@ -2,6 +2,8 @@ This repository implements a distributed Reinforcement Learning (RL) training framework designed to fine-tune large language models using algorithms such as **GRPO** and **DAPO**. It supports multi-node and multi-GPU setups, scalable rollout generation, and policy optimization using libraries like VLLM. +**Please note that we are still under intensive development, stay tuned.** + --- ## 🚀 Features @@ -28,6 +30,15 @@ pip install -e . cd ./applications/ColossalChat pip install -e . ``` + +Install vllm and vllm-ascend +```bash +apt update -y +apt install -y libnuma-dev +pip install vllm==0.7.3 +pip install vllm-ascend==0.7.3 --extra-index https://download.pytorch.org/whl/cpu/ +``` + Install Fuyao Ray. Please update CANN before install fuyao ray ```bash @@ -85,6 +96,23 @@ export HCCL_SOCKET_IFNAME=eno0 export RAY_COLLECTIVE_MEET_TIMEOUT_SECONDS=7200 ``` + +## Architecture Design + +
+
+