Edenzzzz 
							
						 
					 
					
						
						
							
						
						f5c84af0b0 
					 
					
						
						
							
							[Feature] Zigzag Ring attention ( #5905 )  
						
						... 
						
						
						
						* halfway
* fix cross-PP-stage position id length diff bug
* fix typo
* fix typo
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci 
* unified cross entropy func for all shardformer models
* remove redundant lines
* add basic ring attn; debug cross entropy
* fwd bwd logic complete
* fwd bwd logic complete; add experimental triton rescale
* precision tests passed
* precision tests passed
* fix typos and remove misc files
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci 
* add sp_mode to benchmark; fix varlen interface
* update softmax_lse shape by new interface
* change tester name
* remove buffer clone; support packed seq layout
* add varlen tests
* fix typo
* all tests passed
* add dkv_group; fix mask
* remove debug statements
---------
Co-authored-by: Edenzzzz <wtan45@wisc.edu >
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> 
						
						
					 
					
						2024-08-16 13:56:38 +08:00 
						 
				 
			
				
					
						
							
							
								Hongxin Liu 
							
						 
					 
					
						
						
							
						
						7f8b16635b 
					 
					
						
						
							
							[misc] refactor launch API and tensor constructor ( #5666 )  
						
						... 
						
						
						
						* [misc] remove config arg from initialize
* [misc] remove old tensor contrusctor
* [plugin] add npu support for ddp
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci 
* [devops] fix doc test ci
* [test] fix test launch
* [doc] update launch doc
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> 
						
						
					 
					
						2024-04-29 10:40:11 +08:00 
						 
				 
			
				
					
						
							
							
								Edenzzzz 
							
						 
					 
					
						
						
							
						
						15055f9a36 
					 
					
						
						
							
							[hotfix] quick fixes to make legacy tutorials runnable ( #5559 )  
						
						... 
						
						
						
						Co-authored-by: Edenzzzz <wtan45@wisc.edu > 
						
						
					 
					
						2024-04-07 12:06:27 +08:00 
						 
				 
			
				
					
						
							
							
								Hongxin Liu 
							
						 
					 
					
						
						
							
						
						d202cc28c0 
					 
					
						
						
							
							[npu] change device to accelerator api ( #5239 )  
						
						... 
						
						
						
						* update accelerator
* fix timer
* fix amp
* update
* fix
* update bug
* add error raise
* fix autocast
* fix set device
* remove doc accelerator
* update doc
* update doc
* update doc
* use nullcontext
* update cpu
* update null context
* change time limit for example
* udpate
* update
* update
* update
* [npu] polish accelerator code
---------
Co-authored-by: Xuanlei Zhao <xuanlei.zhao@gmail.com >
Co-authored-by: zxl <43881818+oahzxl@users.noreply.github.com > 
						
						
					 
					
						2024-01-09 10:20:05 +08:00 
						 
				 
			
				
					
						
							
							
								Hongxin Liu 
							
						 
					 
					
						
						
							
						
						079bf3cb26 
					 
					
						
						
							
							[misc] update pre-commit and run all files ( #4752 )  
						
						... 
						
						
						
						* [misc] update pre-commit
* [misc] run pre-commit
* [misc] remove useless configuration files
* [misc] ignore cuda for clang-format 
						
						
					 
					
						2023-09-19 14:20:26 +08:00 
						 
				 
			
				
					
						
							
							
								Hongxin Liu 
							
						 
					 
					
						
						
							
						
						b5f9e37c70 
					 
					
						
						
							
							[legacy] clean up legacy code ( #4743 )  
						
						... 
						
						
						
						* [legacy] remove outdated codes of pipeline (#4692 )
* [legacy] remove cli of benchmark and update optim (#4690 )
* [legacy] remove cli of benchmark and update optim
* [doc] fix cli doc test
* [legacy] fix engine clip grad norm
* [legacy] remove outdated colo tensor (#4694 )
* [legacy] remove outdated colo tensor
* [test] fix test import
* [legacy] move outdated zero to legacy (#4696 )
* [legacy] clean up utils (#4700 )
* [legacy] clean up utils
* [example] update examples
* [legacy] clean up amp
* [legacy] fix amp module
* [legacy] clean up gpc (#4742 )
* [legacy] clean up context
* [legacy] clean core, constants and global vars
* [legacy] refactor initialize
* [example] fix examples ci
* [example] fix examples ci
* [legacy] fix tests
* [example] fix gpt example
* [example] fix examples ci
* [devops] fix ci installation
* [example] fix examples ci 
						
						
					 
					
						2023-09-18 16:31:06 +08:00 
						 
				 
			
				
					
						
							
							
								Tian Siyuan 
							
						 
					 
					
						
						
							
						
						f1ae8c9104 
					 
					
						
						
							
							[example] change accelerate version ( #4431 )  
						
						... 
						
						
						
						Co-authored-by: Siyuan Tian <siyuant@vmware.com >
Co-authored-by: Hongxin Liu <lhx0217@gmail.com > 
						
						
					 
					
						2023-08-30 22:56:13 +08:00 
						 
				 
			
				
					
						
							
							
								Hongxin Liu 
							
						 
					 
					
						
						
							
						
						27061426f7 
					 
					
						
						
							
							[gemini] improve compatibility and add static placement policy ( #4479 )  
						
						... 
						
						
						
						* [gemini] remove distributed-related part from colotensor (#4379 )
* [gemini] remove process group dependency
* [gemini] remove tp part from colo tensor
* [gemini] patch inplace op
* [gemini] fix param op hook and update tests
* [test] remove useless tests
* [test] remove useless tests
* [misc] fix requirements
* [test] fix model zoo
* [test] fix model zoo
* [test] fix model zoo
* [test] fix model zoo
* [test] fix model zoo
* [misc] update requirements
* [gemini] refactor gemini optimizer and gemini ddp (#4398 )
* [gemini] update optimizer interface
* [gemini] renaming gemini optimizer
* [gemini] refactor gemini ddp class
* [example] update gemini related example
* [example] update gemini related example
* [plugin] fix gemini plugin args
* [test] update gemini ckpt tests
* [gemini] fix checkpoint io
* [example] fix opt example requirements
* [example] fix opt example
* [example] fix opt example
* [example] fix opt example
* [gemini] add static placement policy (#4443 )
* [gemini] add static placement policy
* [gemini] fix param offload
* [test] update gemini tests
* [plugin] update gemini plugin
* [plugin] update gemini plugin docstr
* [misc] fix flash attn requirement
* [test] fix gemini checkpoint io test
* [example] update resnet example result (#4457 )
* [example] update bert example result (#4458 )
* [doc] update gemini doc (#4468 )
* [example] update gemini related examples (#4473 )
* [example] update gpt example
* [example] update dreambooth example
* [example] update vit
* [example] update opt
* [example] update palm
* [example] update vit and opt benchmark
* [hotfix] fix bert in model zoo (#4480 )
* [hotfix] fix bert in model zoo
* [test] remove chatglm gemini test
* [test] remove sam gemini test
* [test] remove vit gemini test
* [hotfix] fix opt tutorial example (#4497 )
* [hotfix] fix opt tutorial example
* [hotfix] fix opt tutorial example 
						
						
					 
					
						2023-08-24 09:29:25 +08:00 
						 
				 
			
				
					
						
							
							
								ver217 
							
						 
					 
					
						
						
							
						
						573af84184 
					 
					
						
						
							
							[example] update examples related to zero/gemini ( #3431 )  
						
						... 
						
						
						
						* [zero] update legacy import
* [zero] update examples
* [example] fix opt tutorial
* [example] fix opt tutorial
* [example] fix opt tutorial
* [example] fix opt tutorial
* [example] fix import 
						
						
					 
					
						2023-04-04 17:32:51 +08:00 
						 
				 
			
				
					
						
							
							
								ver217 
							
						 
					 
					
						
						
							
						
						26b7aac0be 
					 
					
						
						
							
							[zero] reorganize zero/gemini folder structure ( #3424 )  
						
						... 
						
						
						
						* [zero] refactor low-level zero folder structure
* [zero] fix legacy zero import path
* [zero] fix legacy zero import path
* [zero] remove useless import
* [zero] refactor gemini folder structure
* [zero] refactor gemini folder structure
* [zero] refactor legacy zero import path
* [zero] refactor gemini folder structure
* [zero] refactor gemini folder structure
* [zero] refactor gemini folder structure
* [zero] refactor legacy zero import path
* [zero] fix test import path
* [zero] fix test
* [zero] fix circular import
* [zero] update import 
						
						
					 
					
						2023-04-04 13:48:16 +08:00 
						 
				 
			
				
					
						
							
							
								Zheng Zeng 
							
						 
					 
					
						
						
							
						
						597914317b 
					 
					
						
						
							
							[doc] fix typo in opt inference tutorial ( #2849 )  
						
						
						
						
					 
					
						2023-02-21 17:16:13 +08:00 
						 
				 
			
				
					
						
							
							
								Frank Lee 
							
						 
					 
					
						
						
							
						
						4ae02c4b1c 
					 
					
						
						
							
							[tutorial] added energonai to opt inference requirements ( #2625 )  
						
						
						
						
					 
					
						2023-02-07 16:58:06 +08:00 
						 
				 
			
				
					
						
							
							
								binmakeswell 
							
						 
					 
					
						
						
							
						
						d7352bef2c 
					 
					
						
						
							
							[example] add example requirement ( #2345 )  
						
						
						
						
					 
					
						2023-01-06 09:03:29 +08:00 
						 
				 
			
				
					
						
							
							
								Jiarui Fang 
							
						 
					 
					
						
						
							
						
						f7e276fa71 
					 
					
						
						
							
							[Gemini] add GeminiAdamOptimizer ( #1960 )  
						
						
						
						
					 
					
						2022-11-16 14:44:28 +08:00 
						 
				 
			
				
					
						
							
							
								binmakeswell 
							
						 
					 
					
						
						
							
						
						9183e0dec5 
					 
					
						
						
							
							[tutorial] polish all README ( #1946 )  
						
						
						
						
					 
					
						2022-11-14 19:49:32 +08:00 
						 
				 
			
				
					
						
							
							
								binmakeswell 
							
						 
					 
					
						
						
							
						
						41868f7605 
					 
					
						
						
							
							[tutorial] polish README and OPT files ( #1930 )  
						
						... 
						
						
						
						* [tutorial] polish README and OPT files
* [tutorial] polish README and OPT files
* [tutorial] polish README and OPT files 
						
						
					 
					
						2022-11-13 13:09:58 +08:00 
						 
				 
			
				
					
						
							
							
								ver217 
							
						 
					 
					
						
						
							
						
						b0b7a786b7 
					 
					
						
						
							
							[tutorial] add synthetic dataset for opt ( #1924 )  
						
						
						
						
					 
					
						2022-11-13 03:26:11 +08:00 
						 
				 
			
				
					
						
							
							
								Frank Lee 
							
						 
					 
					
						
						
							
						
						abf4c27f6a 
					 
					
						
						
							
							[tutorial] removed huggingface model warning ( #1925 )  
						
						
						
						
					 
					
						2022-11-12 23:12:18 +08:00 
						 
				 
			
				
					
						
							
							
								Frank Lee 
							
						 
					 
					
						
						
							
						
						d43a671ad6 
					 
					
						
						
							
							Hotfix/tutorial readme index ( #1922 )  
						
						... 
						
						
						
						* [tutorial] removed tutorial index in readme
* [tutorial] removed tutorial index in readme 
						
						
					 
					
						2022-11-12 18:24:52 +08:00 
						 
				 
			
				
					
						
							
							
								BoxiangW 
							
						 
					 
					
						
						
							
						
						ca6e75bc28 
					 
					
						
						
							
							[tutorial] edited hands-on practices ( #1899 )  
						
						... 
						
						
						
						* Add handson to ColossalAI.
* Change names of handsons and edit sequence parallel example.
* Edit wrong folder name
* resolve conflict
* delete readme 
						
						
					 
					
						2022-11-11 17:08:17 +08:00