community[patch]: Invoke callback prior to yielding token fix [HuggingFaceTextGenInference] (#20426)

…gFaceTextGenInference) - [x] **PR title**: community[patch]: Invoke callback prior to yielding token fix for [HuggingFaceTextGenInference] - [x] **PR message**: - **Description:** Invoke callback prior to yielding token in stream method in [HuggingFaceTextGenInference] - **Issue:** https://github.com/langchain-ai/langchain/issues/16913 - **Dependencies:** None - **Twitter handle:** @bolun_zhang If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>
2025-08-31 18:38:48 +00:00 · 2024-04-18 10:25:20 -04:00
parent 2d6d796040
commit e786da7774
1 changed files with 4 additions and 2 deletions
--- a/libs/community/langchain_community/llms/huggingface_text_gen_inference.py
+++ b/libs/community/langchain_community/llms/huggingface_text_gen_inference.py
@@ -259,9 +259,10 @@ class HuggingFaceTextGenInference(LLM):
            # yield text, if any
            if text:
                chunk = GenerationChunk(text=text)
-                yield chunk
+
                if run_manager:
                    run_manager.on_llm_new_token(chunk.text)
+                yield chunk

            # break if stop sequence found
            if stop_seq_found:
@@ -295,9 +296,10 @@ class HuggingFaceTextGenInference(LLM):
            # yield text, if any
            if text:
                chunk = GenerationChunk(text=text)
-                yield chunk
+
                if run_manager:
                    await run_manager.on_llm_new_token(chunk.text)
+                yield chunk

            # break if stop sequence found
            if stop_seq_found: