flybird11111 
							
						 
					 
					
						
						
							
						
						a1e39f4c0d 
					 
					
						
						
							
							[install]fix setup ( #5786 )  
						
						... 
						
						
						
						* fix
* [pre-commit.ci] auto fixes from pre-commit.com hooks
for more information, see https://pre-commit.ci 
---------
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> 
						
						
					 
					
						2024-06-06 11:47:48 +08:00 
						 
				 
			
				
					
						
							
							
								Charles Coulombe 
							
						 
					 
					
						
						
							
						
						c46e09715c 
					 
					
						
						
							
							Allow building cuda extension without a device. ( #5535 )  
						
						... 
						
						
						
						Added FORCE_CUDA environment variable support, to enable building extensions where a GPU device is not present but cuda libraries are. 
						
						
					 
					
						2024-06-05 14:26:30 +08:00 
						 
				 
			
				
					
						
							
							
								傅剑寒 
							
						 
					 
					
						
						
							
						
						279300dc5f 
					 
					
						
						
							
							[Inference/Refactor] Refactor compilation mechanism and unified multi hw ( #5613 )  
						
						... 
						
						
						
						* refactor compilation mechanism and unified multi hw
* fix file path bug
* add init.py to make pybind a module to avoid relative path error caused by softlink
* delete duplicated micros
* fix micros bug in gcc 
						
						
					 
					
						2024-04-24 14:17:54 +08:00 
						 
				 
			
				
					
						
							
							
								Hongxin Liu 
							
						 
					 
					
						
						
							
						
						19e1a5cf16 
					 
					
						
						
							
							[shardformer] update colo attention to support custom mask ( #5510 )  
						
						... 
						
						
						
						* [feature] refactor colo attention (#5462 )
* [extension] update api
* [feature] add colo attention
* [feature] update sdpa
* [feature] update npu attention
* [feature] update flash-attn
* [test] add flash attn test
* [test] update flash attn test
* [shardformer] update modeling to fit colo attention (#5465 )
* [misc] refactor folder structure
* [shardformer] update llama flash-attn
* [shardformer] fix llama policy
* [devops] update tensornvme install
* [test] update llama test
* [shardformer] update colo attn kernel dispatch
* [shardformer] update blip2
* [shardformer] update chatglm
* [shardformer] update gpt2
* [shardformer] update gptj
* [shardformer] update opt
* [shardformer] update vit
* [shardformer] update colo attention mask prep
* [shardformer] update whisper
* [test] fix shardformer tests (#5514 )
* [test] fix shardformer tests
* [test] fix shardformer tests 
						
						
					 
					
						2024-03-27 11:19:32 +08:00 
						 
				 
			
				
					
						
							
							
								Hongxin Liu 
							
						 
					 
					
						
						
							
						
						ffffc32dc7 
					 
					
						
						
							
							[checkpointio] fix gemini and hybrid parallel optim checkpoint ( #5347 )  
						
						... 
						
						
						
						* [checkpointio] fix hybrid parallel optim checkpoint
* [extension] fix cuda extension
* [checkpointio] fix gemini optimizer checkpoint
* polish code 
						
						
					 
					
						2024-02-01 16:13:06 +08:00 
						 
				 
			
				
					
						
							
							
								digger yu 
							
						 
					 
					
						
						
							
						
						6a3086a505 
					 
					
						
						
							
							fix typo under extensions/ ( #5330 )  
						
						
						
						
					 
					
						2024-01-30 09:55:16 +08:00 
						 
				 
			
				
					
						
							
							
								Frank Lee 
							
						 
					 
					
						
						
							
						
						7cfed5f076 
					 
					
						
						
							
							[feat] refactored extension module ( #5298 )  
						
						... 
						
						
						
						* [feat] refactored extension module
* polish
* polish
* polish
* polish
* polish
* polish
* polish
* polish
* polish
* polish 
						
						
					 
					
						2024-01-25 17:01:48 +08:00