langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-04 02:33:05 +00:00

Author	SHA1	Message	Date
Chester Curme	881c6534a6	Merge branch 'master' into wip-v0.4 # Conflicts: # .github/workflows/_integration_test.yml # .github/workflows/_release.yml # .github/workflows/api_doc_build.yml # .github/workflows/people.yml # .github/workflows/run_notebooks.yml # .github/workflows/scheduled_test.yml # SECURITY.md # docs/docs/integrations/vectorstores/pgvectorstore.ipynb # libs/langchain_v1/langchain/chat_models/base.py # libs/langchain_v1/tests/integration_tests/chat_models/test_base.py # libs/langchain_v1/tests/unit_tests/chat_models/test_chat_models.py	2025-07-30 13:16:17 -04:00
ccurme	a9e52ca605	chore(openai): bump openai sdk (#32322 )	2025-07-30 10:58:18 -04:00
Kanav Bansal	e2bc8f19c0	docs(docs): update RAG tutorials link across multiple vector store docs (AstraDB, DatabricksVectorSearch, FAISS, Redis, etc.) (#32301 ) ## Description: This PR updates the internal documentation link for the RAG tutorials to reflect the updated path. Previously, the link pointed to the root `/docs/tutorials/`, which was generic. It now correctly routes to the RAG-specific tutorial page for the following vector store docs. 1. AstraDBVectorStore 2. Clickhouse 3. CouchbaseSearchVectorStore 4. DatabricksVectorSearch 5. ElasticsearchStore 6. FAISS 7. Milvus 8. MongoDBAtlasVectorSearch 9. openGauss 10. PGVector 11. PGVectorStore 12. PineconeVectorStore 13. QdrantVectorStore 14. Redis 15. SQLServer ## Issue: N/A ## Dependencies: None ## Twitter handle: N/A	2025-07-30 09:46:01 -04:00
Mason Daugherty	fbd5a238d8	fix(core): revert "fix: tool call streaming bug with inconsistent indices from Qwen3" (#32307 ) Reverts langchain-ai/langchain#32160 Original issue stems from using `ChatOpenAI` to interact with a `qwen` model. Recommended to use [langchain-qwq](https://python.langchain.com/docs/integrations/chat/qwq/) which is built for Qwen	2025-07-29 10:26:38 -04:00
HerrDings	fc2f66ca80	docs: fixed link to docs of unstructured (#32306 ) In the section [How to load documents from a directory](https://python.langchain.com/docs/how_to/document_loader_directory/) there is a link to the docs of unstructured. When you click this link, it tells you that it has moved. Accordingly this PR fixes this link in LangChain docs directly from: `https://unstructured-io.github.io/unstructured/#` to: `https://docs.unstructured.io/`	2025-07-29 10:12:22 -04:00
Mason Daugherty	0e287763cd	fix: lint	2025-07-28 18:49:43 -04:00
Copilot	0b56c1bc4b	fix: tool call streaming bug with inconsistent indices from Qwen3 (#32160 ) Fixes a streaming bug where models like Qwen3 (using OpenAI interface) send tool call chunks with inconsistent indices, resulting in duplicate/erroneous tool calls instead of a single merged tool call. ## Problem When Qwen3 streams tool calls, it sends chunks with inconsistent `index` values: - First chunk: `index=1` with tool name and partial arguments - Subsequent chunks: `index=0` with `name=None`, `id=None` and argument continuation The existing `merge_lists` function only merges chunks when their `index` values match exactly, causing these logically related chunks to remain separate, resulting in multiple incomplete tool calls instead of one complete tool call. ```python # Before fix: Results in 1 valid + 1 invalid tool call chunk1 = AIMessageChunk(tool_call_chunks=[ {"name": "search", "args": '{"query":', "id": "call_123", "index": 1} ]) chunk2 = AIMessageChunk(tool_call_chunks=[ {"name": None, "args": ' "test"}', "id": None, "index": 0} ]) merged = chunk1 + chunk2 # Creates 2 separate tool calls # After fix: Results in 1 complete tool call merged = chunk1 + chunk2 # Creates 1 merged tool call: search({"query": "test"}) ``` ## Solution Enhanced the `merge_lists` function in `langchain_core/utils/_merge.py` with intelligent tool call chunk merging: 1. Preserves existing behavior: Same-index chunks still merge as before 2. Adds special handling: Tool call chunks with `name=None`/`id=None` that don't match any existing index are now merged with the most recent complete tool call chunk 3. Maintains backward compatibility: All existing functionality works unchanged 4. Targeted fix: Only affects tool call chunks, doesn't change behavior for other list items The fix specifically handles the pattern where: - A continuation chunk has `name=None` and `id=None` (indicating it's part of an ongoing tool call) - No matching index is found in existing chunks - There exists a recent tool call chunk with a valid name or ID to merge with ## Testing Added comprehensive test coverage including: - ✅ Qwen3-style chunks with different indices now merge correctly - ✅ Existing same-index behavior preserved - ✅ Multiple distinct tool calls remain separate - ✅ Edge cases handled (empty chunks, orphaned continuations) - ✅ Backward compatibility maintained Fixes #31511. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-28 22:31:41 +00:00
Copilot	ad88e5aaec	fix(core): resolve cache validation error by safely converting Generation to ChatGeneration objects (#32156 ) ## Problem ChatLiteLLM encounters a `ValidationError` when using cache on subsequent calls, causing the following error: ``` ValidationError(model='ChatResult', errors=[{'loc': ('generations', 0, 'type'), 'msg': "unexpected value; permitted: 'ChatGeneration'", 'type': 'value_error.const', 'ctx': {'given': 'Generation', 'permitted': ('ChatGeneration',)}}]) ``` This occurs because: 1. The cache stores `Generation` objects (with `type="Generation"`) 2. But `ChatResult` expects `ChatGeneration` objects (with `type="ChatGeneration"` and a required `message` field) 3. When cached values are retrieved, validation fails due to the type mismatch ## Solution Added graceful handling in both sync (`_generate_with_cache`) and async (`_agenerate_with_cache`) cache methods to: 1. Detect when cached values contain `Generation` objects instead of expected `ChatGeneration` objects 2. Convert them to `ChatGeneration` objects by wrapping the text content in an `AIMessage` 3. Preserve all original metadata (`generation_info`) 4. Allow `ChatResult` creation to succeed without validation errors ## Example ```python # Before: This would fail with ValidationError from langchain_community.chat_models import ChatLiteLLM from langchain_community.cache import SQLiteCache from langchain.globals import set_llm_cache set_llm_cache(SQLiteCache(database_path="cache.db")) llm = ChatLiteLLM(model_name="openai/gpt-4o", cache=True, temperature=0) print(llm.predict("test")) # Works fine (cache empty) print(llm.predict("test")) # Now works instead of ValidationError # After: Seamlessly handles both Generation and ChatGeneration objects ``` ## Changes - `libs/core/langchain_core/language_models/chat_models.py`: - Added `Generation` import from `langchain_core.outputs` - Enhanced cache retrieval logic in `_generate_with_cache` and `_agenerate_with_cache` methods - Added conversion from `Generation` to `ChatGeneration` objects when needed - `libs/core/tests/unit_tests/language_models/chat_models/test_cache.py`: - Added test case to validate the conversion logic handles mixed object types ## Impact - Backward Compatible: Existing code continues to work unchanged - Minimal Change: Only affects cache retrieval path, no API changes - Robust: Handles both legacy cached `Generation` objects and new `ChatGeneration` objects - Preserves Data: All original content and metadata is maintained during conversion Fixes #22389. <!-- START COPILOT CODING AGENT TIPS --> --- 💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more [Copilot coding agent tips](https://gh.io/copilot-coding-agent-tips) in the docs. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-07-28 22:28:16 +00:00
Mason Daugherty	30e3ed6a19	fix: add space in run-name for better readability	2025-07-28 17:46:27 -04:00
Mason Daugherty	8641a95c43	fix: update run-name in `scheduled_test.yml` to include dynamic inputs	2025-07-28 17:45:05 -04:00
Mason Daugherty	df70c5c186	chore: update actions run-names and add default inputs (#32293 )	2025-07-28 17:33:27 -04:00
Mason Daugherty	d5ca77e065	fix: remove erreneous rocket emoji in `run-name`	2025-07-28 17:11:14 -04:00
Mason Daugherty	b7e4797e8b	release(anthropic): 0.3.18 (#32292 )	2025-07-28 17:07:11 -04:00
Mason Daugherty	3a487bf720	refactor(anthropic): AnthropicLLM to use Messages API (#32290 ) re: #32189	2025-07-28 16:22:58 -04:00
Mason Daugherty	e5fd67024c	fix: update link text for reporting security vulnerabilities in SECURITY.md	2025-07-28 15:05:31 -04:00
Mason Daugherty	b86841ac40	fix: update alt attribute for GitHub Codespace badge in README	2025-07-28 15:04:57 -04:00
Mason Daugherty	8db16b5633	fix: use new Google model names in examples (#32288 )	2025-07-28 19:03:42 +00:00
Mason Daugherty	6f10160a45	fix: `scripts/` errors	2025-07-28 15:03:25 -04:00
Mason Daugherty	e79e0bd6b4	fix(openai): add `max_retries` parameter to ChatOpenAI for handling 503 capacity errors (#32286 ) Some integration tests were failing	2025-07-28 13:58:23 -04:00
ccurme	c55294ecb0	chore(core): add test for nested pydantic fields in schemas (#32285 )	2025-07-28 17:27:24 +00:00
Mason Daugherty	7a26c3d233	fix: update `bar_model` to use the correct model version `claude-3-7-sonnet-20250219` (#32284 )	2025-07-28 12:57:40 -04:00
Mason Daugherty	c6ffac3ce0	refactor: mdx lint (#32282 )	2025-07-28 12:56:22 -04:00
Mason Daugherty	a07d2c5016	refactor: remove references to unsupported model `claude-3-sonnet-20240229` (#32281 ) Addresses some (but not all) test issues brought about in #32280	2025-07-28 11:57:43 -04:00
Mason Daugherty	5e9eb19a83	chore: update branch with changes from master (#32277 ) Co-authored-by: Maxime Grenu <69890511+cluster2600@users.noreply.github.com> Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: jmaillefaud <jonathan.maillefaud@evooq.ch> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: tanwirahmad <tanwirahmad@users.noreply.github.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: niceg <79145285+growmuye@users.noreply.github.com> Co-authored-by: Chaitanya varma <varmac301@gmail.com> Co-authored-by: dishaprakash <57954147+dishaprakash@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Kanav Bansal <13186335+bansalkanav@users.noreply.github.com> Co-authored-by: Aleksandr Filippov <71711753+alex-feel@users.noreply.github.com> Co-authored-by: Alex Feel <afilippov@spotware.com>	2025-07-28 10:39:41 -04:00
Aleksandr Filippov	f0b6baa0ef	fix(core): track within-batch deduplication in indexing num_skipped count (#32273 ) Description: Fixes incorrect `num_skipped` count in the LangChain indexing API. The current implementation only counts documents that already exist in RecordManager (cross-batch duplicates) but fails to count documents removed during within-batch deduplication via `_deduplicate_in_order()`. This PR adds tracking of the original batch size before deduplication and includes the difference in `num_skipped`, ensuring that `num_added + num_skipped` equals the total number of input documents. Issue: Fixes incorrect document count reporting in indexing statistics Dependencies: None Fixes #32272 --------- Co-authored-by: Alex Feel <afilippov@spotware.com>	2025-07-28 09:58:51 -04:00
Mason Daugherty	12c0e9b7d8	fix(docs): local API reference documentation build (#32271 ) ensure all relevant packages are correctly processed - cli wasn't included, also fix ValueError	2025-07-28 00:50:20 -04:00
Mason Daugherty	ed682ae62d	fix: explicitly tell uv to copy when using devcontainer (#32267 )	2025-07-28 00:01:06 -04:00
Mason Daugherty	caf1919217	fix: devcontainer to use volume to store the workspace (#32266 ) should resolve the file sharing issue for users on macOS.	2025-07-27 23:43:06 -04:00
Mason Daugherty	904066f1ec	feat: add VSCode configuration files for Python development (#32263 )	2025-07-27 23:37:59 -04:00
Mason Daugherty	96cbd90cba	fix: formatting issues in docstrings (#32265 ) Ensures proper reStructuredText formatting by adding the required blank line before closing docstring quotes, which resolves the "Block quote ends without a blank line; unexpected unindent" warning.	2025-07-27 23:37:47 -04:00
Mason Daugherty	a8a2cff129	Merge branch 'master' of github.com:langchain-ai/langchain	2025-07-27 23:34:59 -04:00
Mason Daugherty	f4ff4514ef	fix: update workspace folder path in devcontainer configuration	2025-07-27 23:34:57 -04:00
Mason Daugherty	d1679cec91	chore: add .editorconfig for consistent coding styles across files (#32261 ) Following existing codebase conventions	2025-07-27 23:25:30 -04:00
Mason Daugherty	5295f2add0	fix: update dev container name to match service name	2025-07-27 22:30:16 -04:00
Mason Daugherty	5f5b87e9a3	fix: update service name in devcontainer configuration	2025-07-27 22:28:47 -04:00
Mason Daugherty	e0ef98dac0	feat: add markdownlint configuration file (#32264 )	2025-07-27 22:24:58 -04:00
Mason Daugherty	62212c7ee2	fix: update links in SECURITY.md to use markdown format	2025-07-27 21:54:25 -04:00
Mason Daugherty	9d38f170ce	refactor: enhance workflow names and descriptions for clarity (#32262 )	2025-07-27 21:31:59 -04:00
Mason Daugherty	c6cb1fae61	fix: devcontainer (#32260 )	2025-07-27 20:24:16 -04:00
Kanav Bansal	e42b1d23dc	docs(docs): update RAG tutorials link to point to correct path (#32256 ) - Description: This PR updates the internal documentation link for the RAG tutorials to reflect the updated path. Previously, the link pointed to the root `/docs/tutorials/`, which was generic. It now correctly routes to the RAG-specific tutorial page. - Issue: N/A - Dependencies: None - Twitter handle: N/A	2025-07-27 20:00:41 -04:00
Mason Daugherty	53d0bfe9cd	refactor: markdownlint (#32259 )	2025-07-27 20:00:16 -04:00
Mason Daugherty	eafab52483	refactor: markdownlint `SECURITY.md` (#32258 )	2025-07-27 19:55:25 -04:00
Christophe Bornet	efdfa00d10	chore(langchain): add ruff rules ARG (#32110 ) See https://docs.astral.sh/ruff/rules/#flake8-unused-arguments-arg Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-26 18:32:34 -04:00
Christophe Bornet	a2ad5aca41	chore(langchain): add ruff rules TC (#31921 ) See https://docs.astral.sh/ruff/rules/#flake8-type-checking-tc	2025-07-26 18:27:26 -04:00
Mason Daugherty	5ecbb5f277	fix(docs): temporary workaround until the underlying dependency issues in the AI21 package ecosystem are resolved. (#32248 )	2025-07-25 15:12:44 -04:00
Mason Daugherty	c1028171af	fix(docs): update protobuf version constraint to <5.0 in vercel_overrides.txt (#32247 )	2025-07-25 15:08:44 -04:00
ccurme	f6236d9f12	fix(infra): add pypdf to vercel overrides (#32242 ) > × No solution found when resolving dependencies: ╰─▶ Because only langchain-neo4j==0.5.0 is available and langchain-neo4j==0.5.0 depends on neo4j-graphrag>=1.9.0, we can conclude that all versions of langchain-neo4j depend on neo4j-graphrag>=1.9.0. And because only neo4j-graphrag<=1.9.0 is available and neo4j-graphrag==1.9.0 depends on pypdf>=5.1.0,<6.0.0, we can conclude that all versions of langchain-neo4j depend on pypdf>=5.1.0,<6.0.0. And because langchain-upstage==0.6.0 depends on pypdf>=4.2.0,<5.0.0 and only langchain-upstage==0.6.0 is available, we can conclude that all versions of langchain-neo4j and all versions of langchain-upstage are incompatible. And because you require langchain-neo4j and langchain-upstage, we can conclude that your requirements are unsatisfiable. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-25 15:05:21 -04:00
Mason Daugherty	df20f111a8	fix(docs): add validation for repository format and name in API docs build workflow (#32246 ) for build	2025-07-25 15:05:06 -04:00
Eugene Yurtsev	db22311094	ci(infra): no need for `.` in the regexp (#32245 ) No need for allowing `.`	2025-07-25 15:02:02 -04:00
Mason Daugherty	f624ad489a	feat(docs): improve devx, fix `Makefile` targets (#32237 ) TL;DR much of the provided `Makefile` targets were broken, and any time I wanted to preview changes locally I either had to refer to a command Chester gave me or try waiting on a Vercel preview deployment. With this PR, everything should behave like normal. Significant updates to the `Makefile` and documentation files, focusing on improving usability, adding clear messaging, and fixing/enhancing documentation workflows. ### Updates to `Makefile`: #### Enhanced build and cleaning processes: - Added informative messages (e.g., "📚 Building LangChain documentation...") to makefile targets like `docs_build`, `docs_clean`, and `api_docs_build` for better user feedback during execution. - Introduced a `clean-cache` target to the `docs` `Makefile` to clear cached dependencies and ensure clean builds. #### Improved dependency handling: - Modified `install-py-deps` to create a `.venv/deps_installed` marker, preventing redundant/duplicate dependency installations and improving efficiency. #### Streamlined file generation and infrastructure setup: - Added caching for the LangServe README download and parallelized feature table generation - Added user-friendly completion messages for targets like `copy-infra` and `render`. #### Documentation server updates: - Enhanced the `start` target with messages indicating server start and URL for local documentation viewing. --- ### Documentation Improvements: #### Content clarity and consistency: - Standardized section titles for consistency across documentation files. [[1]](diffhunk://#diff-9b1a85ea8a9dcf79f58246c88692cd7a36316665d7e05a69141cfdc50794c82aL1-R1) [[2]](diffhunk://#diff-944008ad3a79d8a312183618401fcfa71da0e69c75803eff09b779fc8e03183dL1-R1) - Refined phrasing and formatting in sections like "Dependency management" and "Formatting and linting" for better readability. [[1]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L6-R6) [[2]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L84-R82) #### Enhanced workflows: - Updated instructions for building and viewing documentation locally, including tips for specifying server ports and handling API reference previews. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L60-R94) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L82-R126) - Expanded guidance on cleaning documentation artifacts and using linting tools effectively. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L82-R126) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L107-R142) #### API reference documentation: - Improved instructions for generating and formatting in-code documentation, highlighting best practices for docstring writing. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L107-R142) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L144-R186) --- ### Minor Changes: - Added support for a new package name (`langchain_v1`) in the API documentation generation script. - Fixed minor capitalization and formatting issues in documentation files. [[1]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L40-R40) [[2]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L166-R160) --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-07-25 14:49:03 -04:00

1 2 3 4 5 ...

13922 Commits