langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-09-21 02:19:31 +00:00

Author	SHA1	Message	Date
Yasien Dwieb	ebb1024966	Merge branch 'master' into fix/rag-tutorial-part-1	2025-07-19 12:36:57 +03:00
Yasien Dwieb	c3fc968f68	rename to vector_size for better semantics	2025-07-19 12:36:38 +03:00
Isaac Francisco	98bfd57a76	fix(core): better error message for empty var names (#32073 ) Previously, we hit an index out of range error with empty variable names (accessing tag[0]), now we through a slightly nicer error --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-18 17:00:02 -04:00
Gurram Siddarth Reddy	427d2d6397	fix(core): implement sleep delay in FakeMessagesListChatModel `_generate` (#32014 ) implement sleep delay in FakeMessagesListChatModel._generate so the sleep parameter is respected, matching the documented behavior. This adds artificial latency between responses for testing purposes. Issue: closes [#31974](https://github.com/langchain-ai/langchain/issues/31974) following [docs](https://python.langchain.com/api_reference/core/language_models/langchain_core.language_models.fake_chat_models.FakeMessagesListChatModel.html#langchain_core.language_models.fake_chat_models.FakeMessagesListChatModel.sleep) Dependencies: none Twitter handle: [@siddarthreddyg2](https://x.com/siddarthreddyg2) --------- Signed-off-by: Siddarthreddygsr <siddarthreddygsr@gmail.com>	2025-07-18 15:54:28 -04:00
Yasien Dwieb	cea3f8ed8a	ensure embedding dimension value is dynamically set	2025-07-18 19:31:03 +03:00
Yasien Dwieb	a8d5810d6e	fix error about test collection test not found	2025-07-18 19:14:39 +03:00
Kanav Bansal	50a12a7ee5	fix(docs): fix broken link in VertexAILLM and NVIDIA LLM integrations (#32096 ) ## Description: This PR updates the `link` values for the following integration metadata entries: 1. VertexAILLM - Changed from: `google_vertexai` - To: `google_vertex_ai_palm` 2. NVIDIA - Changed from: `NVIDIA` - To: `nvidia_ai_endpoints` These changes ensure that the documentation links correspond to the correct integration paths, improving documentation navigation and consistency with the integration structure. ## Issue: N/A ## Dependencies: None ## Twitter handle: N/A Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-18 14:00:49 +00:00
Kanav Bansal	72a0f425ec	docs(docs): correct package name from langchain-google_vertexai to langchain-google-vertexai for VertexAILLM (#32095 ) - Description: This PR updates the `package` field for the VertexAI integration in the documentation metadata. The original value was `langchain-google_vertexai`, which has been corrected to `langchain-google-vertexai` to reflect the actual package name used in PyPI and LangChain integrations. - Issue: N/A - Dependencies: None - Twitter handle: N/A	2025-07-18 09:45:28 -04:00
Sarah Guthals	22535eb4b3	docs: add tensorlake provider (#32046 )	2025-07-17 19:28:14 -04:00
open-swe[bot]	5da986c3f6	fix(core): JSON Schema reference resolution for list indices (#32088 ) Fixes #32042 ## Summary Fixes a critical bug in JSON Schema reference resolution that prevented correctly dereferencing numeric components in JSON pointer paths, specifically for list indices in `anyOf`, `oneOf`, and `allOf` arrays. ## Changes - Fixed `_retrieve_ref` function in `libs/core/langchain_core/utils/json_schema.py` to properly handle numeric components - Added comprehensive test function `test_dereference_refs_list_index()` in `libs/core/tests/unit_tests/utils/test_json_schema.py` - Resolved line length formatting issues - Improved type checking and index validation for list and dictionary references ## Key Improvements - Correctly handles list index references in JSON pointer paths - Maintains backward compatibility with existing dictionary numeric key functionality - Adds robust error handling for out-of-bounds and invalid indices - Passes all test cases covering various reference scenarios ## Test Coverage - Verified fix for `#/properties/payload/anyOf/1/properties/startDate` reference - Tested edge cases including out-of-bounds and negative indices - Ensured no regression in existing reference resolution functionality Resolves the reported issue with JSON Schema reference dereferencing for list indices. --------- Co-authored-by: open-swe-dev[bot] <open-swe-dev@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-17 15:54:38 -04:00
Mason Daugherty	6d449df8bb	chore: update PR lint (#32091 ) remove regex	2025-07-17 15:33:48 -04:00
ccurme	3f4d27fe21	fix(infra): update some notebook cassettes (#32087 )	2025-07-17 13:57:29 -04:00
Mason Daugherty	59407338dd	docs: remove AI21 embeddings section (#32084 ) // no longer exists	2025-07-17 11:32:34 -04:00
Mason Daugherty	a1519af513	fix(docs): fix broken links (#32083 )	2025-07-17 10:38:51 -04:00
Christophe Bornet	b61ce9178c	refactor(langchain): remove `model_rebuild` (#32080 ) Since #29963 BaseCache and Callbacks are imported in BaseLanguageModel so there's no need to import them and rebuild the models. Note: fix is available since `langchain-core==0.3.39` and the current langchain dependency on core is `>=0.3.66` so the fix will always be there.	2025-07-17 10:34:41 -04:00
Mason Daugherty	9165cde538	feat(docs): add Slack community link to footer (#32053 )	2025-07-17 10:12:09 -04:00
Kanav Bansal	2c0e8dce0d	docs(docs): fix broken link in Google Gemini text embedding integration (#32082 ) - Description: Corrected the `link` path in the Google Gemini integration entry from `/docs/integrations/text_embedding/google-generative-ai` to `/docs/integrations/text_embedding/google_generative_ai` to align with actual directory structure and prevent broken documentation links. - Issue: N/A - Dependencies: None - Twitter handle: N/A	2025-07-17 09:58:07 -04:00
Mason Daugherty	491f63ca82	release(ollama): release 0.3.5 (#32076 ) langchain-ollama==0.3.5	2025-07-16 18:45:32 -04:00
Mason Daugherty	587c213760	bump lcok	2025-07-16 18:44:56 -04:00
Copilot	98c3bbbaf0	fix(ollama): `num_gpu` parameter not working in async OllamaEmbeddings method (#32074 ) The `num_gpu` parameter in `OllamaEmbeddings` was not being passed to the Ollama client in the async embedding method, causing GPU acceleration settings to be ignored when using async operations. ## Problem The issue was in the `aembed_documents` method where the `options` parameter (containing `num_gpu` and other configuration) was missing: ```python # Sync method (working correctly) return self._client.embed( self.model, texts, options=self._default_params, keep_alive=self.keep_alive )["embeddings"] # Async method (missing options parameter) return ( await self._async_client.embed( self.model, texts, keep_alive=self.keep_alive # ❌ No options! ) )["embeddings"] ``` This meant that when users specified `num_gpu=4` (or any other GPU configuration), it would work with sync calls but be ignored with async calls. ## Solution Added the missing `options=self._default_params` parameter to the async embed call to match the sync version: ```python # Fixed async method return ( await self._async_client.embed( self.model, texts, options=self._default_params, # ✅ Now includes num_gpu! keep_alive=self.keep_alive, ) )["embeddings"] ``` ## Validation - ✅ Added unit test to verify options are correctly passed in both sync and async methods - ✅ All existing tests continue to pass - ✅ Manual testing confirms `num_gpu` parameter now works correctly - ✅ Code passes linting and formatting checks The fix ensures that GPU configuration works consistently across both synchronous and asynchronous embedding operations. Fixes #32059. <!-- START COPILOT CODING AGENT TIPS --> --- 💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more [Copilot coding agent tips](https://gh.io/copilot-coding-agent-tips) in the docs. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-16 18:42:52 -04:00
efj-amzn	d3072e2d2e	feat(core): update `_import_utils.py` to not mask the thrown exception (#32071 )	2025-07-16 17:11:56 -04:00
Lauren Hirata Singh	b49372595e	docs: update LangSmith links (#32070 )	2025-07-16 16:31:28 -04:00
Mason Daugherty	16664d3b68	fix(docs): make docs link absolute (#32068 )	2025-07-16 20:15:28 +00:00
Inácio Nery	ea8f2a05ba	feat(perplexity): expose `search_results` in chat model (#31468 ) Description The Perplexity chat model already returns a search_results field, but LangChain dropped it when mapping Perplexity responses to additional_kwargs. This patch adds "search_results" to the allowed attribute lists in both _stream and _generate, so downstream code can access it just like images, citations, or related_questions. Dependencies None. The change is purely internal; no new imports or optional dependencies required. https://community.perplexity.ai/t/new-feature-search-results-field-with-richer-metadata/398 --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-16 15:16:35 -04:00
zygimantas-jac	2df05f6f6a	docs: add oxylabs to web browsing table (#31931 ) Added Oxylabs to the web browsing table Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-16 14:00:14 -04:00
nikk0o046	b1c7de98f5	fix(deepseek): convert tool output arrays to strings (#31913 ) ## Description When ChatDeepSeek invokes a tool that returns a list, it results in an openai.UnprocessableEntityError due to a failure in deserializing the JSON body. The root of the problem is that ChatDeepSeek uses BaseChatOpenAI internally, but the APIs are not identical: OpenAI v1/chat/completions accepts arrays as tool results, but Deepseek API does not. As a solution added `_get_request_payload` method to ChatDeepSeek, which inherits the behavior from BaseChatOpenAI but adds a step to stringify tool message content in case the content is an array. I also add a unit test for this. From the linked issue you can find the full reproducible example the reporter of the issue provided. After the changes it works as expected. Source: [Deepseek docs](https://api-docs.deepseek.com/api/create-chat-completion/) ![image](https://github.com/user-attachments/assets/a59ed3e7-6444-46d1-9dcf-97e40e4e8952) Source: [OpenAI docs](https://platform.openai.com/docs/api-reference/chat/create) ![image](https://github.com/user-attachments/assets/728f4fc6-e1a3-4897-b39f-6f1ade07d3dc) ## Issue Fixes #31394 ## Dependencies: No new dependencies. ## Twitter handle: Don't have one. --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-16 12:19:44 -04:00
Mohammad Mohtashim	96bf8262e2	fix: fixing missing Docstring Bug if no Docstring is provided in BaseModel class (#31608 ) - Description: Ensure that the tool description is an empty string when creating a Structured Tool from a Pydantic class in case no description is provided - Issue: Fixes #31606 --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-16 11:56:05 -04:00
Mason Daugherty	15103b0520	chore: add closing keyword to PR template (#32065 )	2025-07-16 11:54:26 -04:00
Casi	686a6b754c	fix: issue a warning if `np.nan` or `np.inf` are in `_cosine_similarity` argument Matrices (#31532 ) - Description: issues a warning if inf and nan are passed as inputs to langchain_core.vectorstores.utils._cosine_similarity - Issue: Fixes #31496 - Dependencies: no external dependencies added, only warnings module imported --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-16 11:50:09 -04:00
Michael Li	12d370a55a	fix(cli): exception to prevent swallowing unexpected errors (#31983 )	2025-07-16 10:23:43 -04:00
Michael Li	5a4c0c0816	fix(cli): handle exception in remove() (#31982 )	2025-07-16 10:23:02 -04:00
Krishna Somani	e2dc36b126	chore: update SECURITY.md (#32060 ) Made minor changes, making it neat	2025-07-16 10:20:59 -04:00
Kanav Bansal	c133eff6c8	docs(docs): fix product name in Google SQL for MySQL description (#32062 ) - Description: Corrected the service name from "Cloud Cloud SQL" to "Google Cloud SQL" to accurately reflect the official product branding.	2025-07-16 10:17:59 -04:00
Ahmad Elmalah	1892a67eef	docs: adding context for Textract linearization-config param (#32064 ) Before jumping into tech implementation, I added a context for linearization-config param, and explained what's linealization in this context. I also linked an AWS blog for more advanced use cases, as this single example doesn't cover all use cases. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-16 10:17:20 -04:00
Ahmad Elmalah	2ab2cab203	docs: update titles for Textract examples (#32063 ) On this PR I am doing two things: 1. Adding titles to the 4 example we have, to allow the reader to capture the essence of the paragraph quickly 2. Replacing 'samples' with 'examples', for more clarity, Why 'examples' could be a better terminology over 'samples' here? 1. On the page, we were using both 'samples' and 'examples' interchangeably which lead to confusion, now 'examples' are the use cases, while 'samples' are the the sample data being used 2. This is consistent with the rest of the docs, we typically use 'examples' for examples, for example https://python.langchain.com/docs/integrations/callbacks/fiddler/	2025-07-16 10:17:02 -04:00
Mason Daugherty	ad44f0688b	release(core): release 0.3.69 (#32056 ) langchain-core==0.3.69	2025-07-15 17:13:46 -04:00
Mason Daugherty	8ad12f3fcf	docs: add missing js providers to table (#32055 ) Update to show that Cerebras, xAI, and Cloudflare now have JS/TS equivalents	2025-07-15 17:09:35 -04:00
Jacob Lee	535ba43b0d	feat(core): add an option to make deserialization more permissive (#32054 ) ## Description Currently when deserializing objects that contain non-deserializable values, we throw an error. However, there are cases (e.g. proxies that return response fields containing extra fields like Python datetimes), where these values are not important and we just want to drop them. Twitter handle: @hacubu --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-15 17:00:01 -04:00
Eugene Yurtsev	3628dccbf3	chore(docs): add gtm tag to docs (#32048 ) Docusarus gtm langchain v2	2025-07-15 15:58:11 -04:00
Mason Daugherty	8d2135ad8a	chore: update links to LangChain how-to guides in issue templates (#32052 )	2025-07-15 15:38:35 -04:00
Mason Daugherty	8199a5562a	chore: update hyperlink (#32051 )	2025-07-15 14:41:27 -04:00
Mason Daugherty	0807711dad	chore: update contribution guidelines and templates to direct users to the LangChain Forum (#32050 )	2025-07-15 14:39:40 -04:00
Eugene Yurtsev	7a36d6b99c	chore(docs): bump langgraph in docs & reformat all docs (#32044 ) Trying to unblock documentation build pipeline * Bump langgraph dep in docs * Update langgraph in lock file (resolves an issue in API reference generation)	2025-07-15 15:06:59 +00:00
Mason Daugherty	3b9dd1eba0	docs(groq): cleanup (#32043 )	2025-07-15 10:37:37 -04:00
Eugene Yurtsev	02d0a9af6c	chore(core): unpin packaging dependency (#32032 ) Unpin packaging dependency --------- Co-authored-by: ntjohnson1 <24689722+ntjohnson1@users.noreply.github.com>	2025-07-14 21:42:32 +00:00
Christophe Bornet	953592d4f7	feat(langchain): add ruff rules G (#32029 ) https://docs.astral.sh/ruff/rules/#flake8-logging-format-g	2025-07-14 15:19:36 -04:00
Christophe Bornet	19fff8cba9	feat(langchain): add ruff rules DTZ (#32021 ) See https://docs.astral.sh/ruff/rules/#flake8-datetimez-dtz	2025-07-14 12:47:16 -04:00
Marco Vinciguerra	26c2c8f70a	docs: update ScrapeGraphAI tools (#32026 ) It was outdated --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-14 12:38:55 -04:00
Hunter Lovell	d96b75f9d3	chore: update readme with forum link (#32027 )	2025-07-14 09:15:26 -07:00
Ahmad Elmalah	2fdccd789c	docs: update Textract docs (#31992 ) I am modifying two things: 1. "This sample demonstrates" with "The following samples demonstrate" as we're talking about at least 4 samples 2. Bringing the sentence to after talking about the definition of textract to keep the document organized (textract definition then samples) --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-14 15:36:29 +00:00

1 2 3 4 5 ...

13806 Commits