Cuiqing Li
|
4b977541a8
|
[Kernels] added triton-implemented of self attention for colossal-ai (#4241)
* added softmax kernel
* added qkv_kernel
* added ops
* adding tests
* upload tets
* fix tests
* debugging
* debugging tests
* debugging
* added
* fixed errors
* added softmax kernel
* clean codes
* added tests
* update tests
* update tests
* added attention
* add
* fixed pytest checking
* add cuda check
* fix cuda version
* fix typo
|
2023-07-18 23:53:38 +08:00 |
|