ColossalAI

mirror of https://github.com/hpcaitech/ColossalAI.git synced 2026-03-18 19:03:50 +00:00

Author	SHA1	Message	Date
YeAnbang	99ba48fc40	Merge branch 'grpo-latest-rebase-main' of https://github.com/hpcaitech/ColossalAI into grpo-latest-rebase-main	2025-08-14 19:03:04 +08:00
YeAnbang	bbc5fb4ed8	fix ci	2025-08-14 18:59:54 +08:00
Hanks	c83dc66645	Update timeout	2025-08-14 09:39:49 +08:00
Hanks	9db9892f63	reduce memory consumption	2025-08-13 16:45:43 +08:00
BurkeHulk	c9cba49ab5	fix CI machine tag	2025-06-02 17:45:40 +08:00
flybird11111	4afff92138	fix	2025-05-28 11:13:44 +08:00
Wenxuan Tan	d383449fc4	[CI] Remove triton version for compatibility bug; update req torch >=2.2 (#6018 ) * remove triton version * remove torch 2.2 * remove torch 2.1 * debug * remove 2.1 build tests * require torch >=2.2 --------- Co-authored-by: Edenzzzz <wtan45@wisc.edu>	2024-08-27 10:12:21 +08:00
YeAnbang	b1031f7244	fix ci	2024-06-07 07:01:31 +00:00
YeAnbang	df5e9c53cf	[ColossalChat] Update RLHF V2 (#5286 ) * Add dpo. Fix sft, ppo, lora. Refactor all * fix and tested ppo * 2 nd round refactor * add ci tests * fix ci * fix ci * fix readme, style * fix readme style * fix style, fix benchmark * reproduce benchmark result, remove useless files * rename to ColossalChat * use new image * fix ci workflow * fix ci * use local model/tokenizer for ci tests * fix ci * fix ci * fix ci * fix ci timeout * fix rm progress bar. fix ci timeout * fix ci * fix ci typo * remove 3d plugin from ci temporary * test environment * cannot save optimizer * support chat template * fix readme * fix path * test ci locally * restore build_or_pr * fix ci data path * fix benchmark * fix ci, move ci tests to 3080, disable fast tokenizer * move ci to 85 * support flash attention 2 * add all-in-one data preparation script. Fix colossal-llama2-chat chat template * add hardware requirements * move ci test data * fix save_model, add unwrap * fix missing bos * fix missing bos; support grad accumulation with gemini * fix ci * fix ci * fix ci * fix llama2 chat template config * debug sft * debug sft * fix colossalai version requirement * fix ci * add sanity check to prevent NaN loss * fix requirements * add dummy data generation script * add dummy data generation script * add dummy data generation script * add dummy data generation script * update readme * update readme * update readme and ignore * fix logger bug * support parallel_output * modify data preparation logic * fix tokenization * update lr * fix inference * run pre-commit --------- Co-authored-by: Tong Li <tong.li352711588@gmail.com>	2024-03-29 14:12:29 +08:00
Frank Lee	73f4dc578e	[workflow] updated CI image (#5318 )	2024-01-29 11:53:07 +08:00
ver217	1c43bfd54e	[coati] update ci	2023-08-30 10:55:56 +08:00
Hongxin Liu	50793b35f4	[gemini] accelerate inference (#3641 ) * [gemini] support don't scatter after inference * [chat] update colossalai strategy * [chat] fix opt benchmark * [chat] update opt benchmark * [gemini] optimize inference * [test] add gemini inference test * [chat] fix unit test ci * [chat] fix ci * [chat] fix ci * [chat] skip checkpoint test	2023-04-26 16:32:40 +08:00
Hongxin Liu	179558a87a	[devops] fix chat ci (#3628 )	2023-04-24 10:55:14 +08:00
Frank Lee	169ed4d24e	[workflow] purged extension cache before GPT test (#3128 )	2023-03-14 10:11:32 +08:00
ver217	9c0943ecdb	[chatgpt] optimize generation kwargs (#2717 ) * [chatgpt] ppo trainer use default generate args * [chatgpt] example remove generation preparing fn * [chatgpt] benchmark remove generation preparing fn * [chatgpt] fix ci	2023-02-15 13:59:58 +08:00
ver217	f6b4ca4e6c	[devops] add chatgpt ci (#2713 )	2023-02-15 10:53:54 +08:00

16 Commits