From 1c88dd43e261dfd54e9dfa2e5e7f3a3abe9ca4be Mon Sep 17 00:00:00 2001 From: Shen Chenhui Date: Thu, 10 Mar 2022 13:32:56 +0800 Subject: [PATCH] Fix/format (#366) --- README.md | 10 ++++------ 1 file changed, 4 insertions(+), 6 deletions(-) diff --git a/README.md b/README.md index 8528ac2a8..dd181341e 100644 --- a/README.md +++ b/README.md @@ -38,24 +38,22 @@ distributed training in a few lines. ## Examples ### ViT - - + - 14x larger batch size, and 5x faster training for Tensor Parallel = 64 ### GPT-3 - - + - Free 50% GPU resources, or 10.7% acceleration ### GPT-2 - + - 11x lower GPU RAM, or superlinear scaling ### BERT - + - 2x faster training, or 50% longer sequence length