langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 18:50:33 +00:00

Author	SHA1	Message	Date
Dat Nguyen	e71e6564b1	feat(ollama): add `dimensions` to `OllamaEmbeddings` (#36543 ) Fixes #34623 Add `dimensions` field to `OllamaEmbeddings` to allow users to specify output embedding size for models that support variable dimensions . The field is passed directly to the Ollama client's `embed()` call for both sync and async methods. How I verified it works: - Ran unit tests: `python -m pytest tests/unit_tests/ -v` - Ran integration tests against a live Ollama instance: `OLLAMA_HOST=http://ollama:11434 python -m pytest tests/integration_tests/ -v` - Confirmed that passing `dimensions=768` no longer raises `extra_forbidden` Pydantic validation error and returns embeddings of the expected size. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-04-06 21:50:54 -04:00
Yi Liu	19ddd42891	fix(ollama): raise error when clients are not initialized (#35185 ) ## Summary - When `self._client` is `None` in `_create_chat_stream()`, the method silently produces an empty generator instead of failing. - The error only surfaces later as a misleading `"No data received from Ollama stream"` ValueError, making it difficult to diagnose the actual root cause (uninitialized client). - Changed to raise `RuntimeError` immediately with a clear message when the sync client is not initialized. ## Why this matters Users who hit this path see a confusing error message that points them in the wrong direction. An explicit error at the point of failure makes debugging straightforward. ## Test plan - [x] Added `test_create_chat_stream_raises_when_client_none` - [x] Existing tests still pass > This PR was authored with the help of AI tools. --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2026-02-12 11:56:53 -05:00
Mason Daugherty	18c25e9f10	chore: ban relative imports on all packages (#34691 )	2026-01-09 17:02:24 -05:00
Mason Daugherty	d40e340479	chore: attribute package change versions (#33854 ) Needed to disambiguate for within inherited docs	2025-11-06 16:57:30 -05:00
Mason Daugherty	6fc21afbc9	style: `.. code-block::` admonition translations (#33400 ) biiiiiiiiiiiiiiiigggggggg pass	2025-10-09 16:52:58 -04:00
Mason Daugherty	d8a680ee57	style: address Sphinx double-backtick snippet syntax (#33389 )	2025-10-09 13:35:51 -04:00
Mason Daugherty	3576e690fa	chore: update Sphinx links to markdown (#33386 )	2025-10-09 11:54:14 -04:00
Mason Daugherty	b6132fc23e	style: remove more `Optional` syntax (#33371 )	2025-10-08 23:28:43 -04:00
Mason Daugherty	31eeb50ce0	chore: drop UP045 (#33362 ) Python 3.9 EOL	2025-10-08 21:17:53 -04:00
Mason Daugherty	d13823043d	style: monorepo pass for refs (#33359 ) * Delete some double backticks previously used by Sphinx (not done everywhere yet) * Fix some code blocks / dropdowns Ignoring CLI CI for now	2025-10-08 18:41:39 -04:00
Mason Daugherty	ae5b105d11	docs: v1 docs updates (#33173 ) Co-authored-by: Mohammad Mohtashim <45242107+keenborder786@users.noreply.github.com> Co-authored-by: Caspar Broekhuizen <caspar@langchain.dev> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Sadra Barikbin <sadraqazvin1@yahoo.com> Co-authored-by: Vadym Barda <vadim.barda@gmail.com>	2025-10-02 18:46:26 -04:00
Mason Daugherty	a89c549cb0	feat(ollama): add basic auth support (#32328 ) support for URL authentication in the format `https://user:password@host:port` for all LangChain Ollama clients. Related to #32327 and #25055	2025-10-01 20:46:37 -04:00
Mason Daugherty	b92b394804	style: repo linting pass (#33089 ) enable docstring-code-format	2025-09-24 15:25:55 -04:00
Mason Daugherty	ee4c2510eb	feat: port various nit changes from `wip-v0.4` (#32506 ) Lots of work that wasn't directly related to core improvements/messages/testing functionality	2025-08-11 15:09:08 -04:00
Mason Daugherty	96cbd90cba	fix: formatting issues in docstrings (#32265 ) Ensures proper reStructuredText formatting by adding the required blank line before closing docstring quotes, which resolves the "Block quote ends without a blank line; unexpected unindent" warning.	2025-07-27 23:37:47 -04:00
Mason Daugherty	d65da13299	docs(ollama): add `validate_model_on_init` note, bump lock (#32172 )	2025-07-22 10:58:45 -04:00
Copilot	98c3bbbaf0	fix(ollama): `num_gpu` parameter not working in async OllamaEmbeddings method (#32074 ) The `num_gpu` parameter in `OllamaEmbeddings` was not being passed to the Ollama client in the async embedding method, causing GPU acceleration settings to be ignored when using async operations. ## Problem The issue was in the `aembed_documents` method where the `options` parameter (containing `num_gpu` and other configuration) was missing: ```python # Sync method (working correctly) return self._client.embed( self.model, texts, options=self._default_params, keep_alive=self.keep_alive )["embeddings"] # Async method (missing options parameter) return ( await self._async_client.embed( self.model, texts, keep_alive=self.keep_alive # ❌ No options! ) )["embeddings"] ``` This meant that when users specified `num_gpu=4` (or any other GPU configuration), it would work with sync calls but be ignored with async calls. ## Solution Added the missing `options=self._default_params` parameter to the async embed call to match the sync version: ```python # Fixed async method return ( await self._async_client.embed( self.model, texts, options=self._default_params, # ✅ Now includes num_gpu! keep_alive=self.keep_alive, ) )["embeddings"] ``` ## Validation - ✅ Added unit test to verify options are correctly passed in both sync and async methods - ✅ All existing tests continue to pass - ✅ Manual testing confirms `num_gpu` parameter now works correctly - ✅ Code passes linting and formatting checks The fix ensures that GPU configuration works consistently across both synchronous and asynchronous embedding operations. Fixes #32059. <!-- START COPILOT CODING AGENT TIPS --> --- 💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more [Copilot coding agent tips](https://gh.io/copilot-coding-agent-tips) in the docs. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-16 18:42:52 -04:00
Mason Daugherty	1f829aacf4	ollama[patch]: ruff fixes and rules (#31924 ) * bump ruff deps * add more thorough ruff rules * fix said rules	2025-07-08 13:42:19 -04:00
Mason Daugherty	e686a70ee0	ollama: thinking, tool streaming, docs, tests (#31772 ) * New `reasoning` (bool) param to support toggling [Ollama thinking](https://ollama.com/blog/thinking) (#31573, #31700). If `reasoning=True`, Ollama's `thinking` content will be placed in the model responses' `additional_kwargs.reasoning_content`. * Supported by: * ChatOllama (class level, invocation level TODO) * OllamaLLM (TODO) * Added tests to ensure streaming tool calls is successful (#29129) * Refactored tests that relied on `extract_reasoning()` * Myriad docs additions and consistency/typo fixes * Improved type safety in some spots Closes #29129 Addresses #31573 and #31700 Supersedes #31701	2025-07-07 13:56:41 -04:00
Mason Daugherty	572020c4d8	ollama: add `validate_model_on_init`, catch more errors (#31784 ) * Ensure access to local model during `ChatOllama` instantiation (#27720). This adds a new param `validate_model_on_init` (default: `true`) * Catch a few more errors from the Ollama client to assist users	2025-07-03 11:07:11 -04:00
Mason Daugherty	e1aff00cc1	groq: support `reasoning_effort`, update docs for clarity (#31754 ) - There was some ambiguous wording that has been updated to hopefully clarify the functionality of `reasoning_format` in ChatGroq. - Added support for `reasoning_effort` - Added links to see models capable of `reasoning_format` and `reasoning_effort` - Other minor nits	2025-06-27 09:43:40 -04:00
Mason Daugherty	59c2b81627	docs: fix some inline links (#31748 )	2025-06-26 13:35:14 -04:00
Alexey Bondarenko	9efafe3337	ollama: Add separate kwargs parameter for async client (#31209 ) Description: Add a `async_client_kwargs` field to ollama chat/llm/embeddings adapters that is passed to async httpx client constructor. Motivation: In my use-case: - chat/embedding model adapters may be created frequently, sometimes to be called just once or to never be called at all - they may be used in bots sunc and async mode (not known at the moment they are created) So, I want to keep a static transport instance maintaining connection pool, so model adapters can be created and destroyed freely. But that doesn't work when both sync and async functions are in use as I can only pass one transport instance for both sync and async client, while transport types must be different for them. So I can't make both sync and async calls use shared transport with current model adapter interfaces. In this PR I add a separate `async_client_kwargs` that gets passed to async client constructor, so it will be possible to pass a separate transport instance. For sake of backwards compatibility, it is merged with `client_kwargs`, so nothing changes when it is not set. I am unable to run linter right now, but the changes look ok.	2025-05-15 16:10:10 -04:00
Sydney Runkle	8c6734325b	partners[lint]: run `pyupgrade` to get code in line with 3.9 standards (#30781 ) Using `pyupgrade` to get all `partners` code up to 3.9 standards (mostly, fixing old `typing` imports).	2025-04-11 07:18:44 -04:00
Stavros Kontopoulos	ac22cde130	langchain_ollama: Support keep_alive in embeddings (#30251 ) - Description: Adds support for keep_alive in Ollama Embeddings see https://github.com/ollama/ollama/issues/6401. Builds on top of of https://github.com/langchain-ai/langchain/pull/29296. I have this use case where I want to keep the embeddings model in cpu forever. - Dependencies: no deps are being introduced. - Issue: haven't created an issue yet.	2025-03-14 14:56:50 -04:00
Syed Baqar Abbas	f175319303	[feat] Added backwards compatibility for OllamaEmbeddings initialization (migration from `langchain_community.embeddings` to `langchain_ollama.embeddings` (#29296 ) - [feat] Added backwards compatibility for OllamaEmbeddings initialization (migration from `langchain_community.embeddings` to `langchain_ollama.embeddings`: "langchain_ollama" - Description: Given that `OllamaEmbeddings` from `langchain_community.embeddings` is deprecated, code is being shifted to ``langchain_ollama.embeddings`. However, this does not offer backward compatibility of initializing the parameters and `OllamaEmbeddings` object. - Issue: #29294 - Dependencies: None - Twitter handle: @BaqarAbbas2001 ## Additional Information Previously, `OllamaEmbeddings` from `langchain_community.embeddings` used to support the following options: `e9abe583b2/libs/community/langchain_community/embeddings/ollama.py (L125-L139)` However, in the new package `from langchain_ollama import OllamaEmbeddings`, there is no method to set these options. I have added these parameters to resolve this issue. This issue was also discussed in https://github.com/langchain-ai/langchain/discussions/29113	2025-01-20 11:16:29 -05:00
Erick Friis	aa7fa80e1e	partners/ollama: release 0.2.2rc1 (#28300 )	2024-11-22 22:25:05 +00:00
JiaranI	3952ee31b8	ollama: add pydocstyle linting for ollama (#27686 ) Description: add lint docstrings for ollama module Issue: the issue https://github.com/langchain-ai/langchain/issues/23188 @baskaryan test: ruff check passed. <img width="311" alt="e94c68ffa93dd518297a95a93de5217" src="https://github.com/user-attachments/assets/e96bf721-e0e3-44de-a50e-206603de398e"> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-31 03:06:55 +00:00
Erick Friis	c2a3021bb0	multiple: pydantic 2 compatibility, v0.3 (#26443 ) Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Dan O'Donovan <dan.odonovan@gmail.com> Co-authored-by: Tom Daniel Grande <tomdgrande@gmail.com> Co-authored-by: Grande <Tom.Daniel.Grande@statsbygg.no> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: ZhangShenao <15201440436@163.com> Co-authored-by: Friso H. Kingma <fhkingma@gmail.com> Co-authored-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Morgante Pell <morgantep@google.com>	2024-09-13 14:38:45 -07:00
Eugene Yurtsev	f4196f1fb8	ollama[patch]: Update extra in ollama package (#25383 ) Backwards compatible change that converts pydantic extras to literals which is consistent with pydantic 2 usage.	2024-08-14 10:30:01 -04:00
Eugene Yurtsev	f679ed72ca	ollama[patch]: Update API Reference for ollama embeddings (#25315 ) Update API reference for OllamaEmbeddings Issue: https://github.com/langchain-ai/langchain/issues/24856	2024-08-12 21:31:48 -04:00
Isaac Francisco	63ddf0afb4	ollama: allow base_url, headers, and auth to be passed (#25078 )	2024-08-05 15:39:36 -07:00
Isaac Francisco	464a525a5a	[partner]: minor change to embeddings for Ollama (#24521 )	2024-07-24 00:00:13 +00:00
Isaac Francisco	838464de25	ollama: init package (#23615 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-20 00:43:29 +00:00

34 Commits