community[patch]: OpenLLM Async Client Fixes and Timeout Parameter (#20007)

Same changes as this merged [PR](https://github.com/langchain-ai/langchain/pull/17478) (https://github.com/langchain-ai/langchain/pull/17478), but for the async client, as the same issues persist. - Replaced 'responses' attribute of OpenLLM's GenerationOutput schema to 'outputs'. reference: 66de54eae7/openllm-core/src/openllm_core/_schemas.py (L135) - Added timeout parameter for the async client. --------- Co-authored-by: Seray Arslan <seray.arslan@knime.com>
2025-08-13 14:50:00 +00:00 · 2024-04-09 22:34:56 +02:00 · 2024-04-09 22:34:56 +02:00 · add31f46d0
commit add31f46d0
parent 37a9e23c05
1 changed files with 5 additions and 3 deletions
--- a/libs/community/langchain_community/llms/openllm.py
+++ b/libs/community/langchain_community/llms/openllm.py
@ -308,10 +308,12 @@ class OpenLLM(LLM):
            self._identifying_params["model_name"], **copied
        )
        if self._client:
-            async_client = openllm.client.AsyncHTTPClient(self.server_url)
+            async_client = openllm.client.AsyncHTTPClient(self.server_url, self.timeout)
            res = (
-                await async_client.generate(prompt, **config.model_dump(flatten=True))
+                (await async_client.generate(prompt, **config.model_dump(flatten=True)))
-            ).responses[0]
+                .outputs[0]
                .text
            )
        else:
            assert self._runner is not None
            (