[example] Update Llama Inference example (#5629)

* [example] add infernece benchmark llama3

* revise inference config - arg

* remove unused args

* add llama generation demo script

* fix init rope in llama policy

* add benchmark-llama3 - cleanup
This commit is contained in:
Yuanheng Zhao
2024-04-23 22:23:07 +08:00
committed by GitHub
parent 12f10d5b0b
commit 04863a9b14
4 changed files with 323 additions and 12 deletions

View File

@@ -100,5 +100,5 @@ class NoPaddingLlamaModelInferPolicy(LlamaForCausalLMPolicy):
return policy
def postprocess(self):
init_to_get_rotary(self.model.model)
init_to_get_rotary(self.model.model, self.model.config.rope_theta)
return self.model