Guangyao Zhang
|
53cb9606bd
|
[Feature] llama shardformer fp8 support (#5938)
* add llama shardformer fp8
* Llama Shardformer Parity
* fix typo
* fix all reduce
* fix pytest failure
* fix reduce op and move function to fp8.py
* fix typo
|
2024-08-05 10:05:47 +08:00 |
|