langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-09-08 06:23:20 +00:00

Author	SHA1	Message	Date
ccurme	088095b663	release(openai): release 0.3.29 (#32463 )	2025-08-08 11:04:33 -04:00
Mason Daugherty	c31236264e	chore: formatting across codebase (#32466 )	2025-08-08 10:20:10 -04:00
ccurme	02001212b0	fix(openai): revert some changes (#32462 ) Keep coverage on `output_version="v0"` (increasing coverage is being managed in v0.4 branch).	2025-08-08 08:51:18 -04:00
Mason Daugherty	00244122bd	feat(openai): `minimal` and `verbosity` (#32455 )	2025-08-08 02:24:21 +00:00
ccurme	6727d6e8c8	release(core): 0.3.74 (#32454 )	2025-08-07 16:39:01 -04:00
Michael Matloka	5036bd7adb	fix(openai): don't crash get_num_tokens_from_messages on gpt-5 (#32451 )	2025-08-07 16:33:19 -04:00
ccurme	ec2b34a02d	feat(openai): custom tools (#32449 )	2025-08-07 16:30:01 -04:00
Mason Daugherty	145d38f7dd	test(openai): add tests for `prompt_cache_key` parameter and update docs (#32363 ) Introduce tests to validate the behavior and inclusion of the `prompt_cache_key` parameter in request payloads for the `ChatOpenAI` model.	2025-08-07 15:29:47 -04:00
ccurme	68c70da33e	fix(openai): add in `output_text` (#32450 ) This property was deleted in `openai==1.99.2`.	2025-08-07 15:23:56 -04:00
Eugene Yurtsev	754528d23f	feat(langchain): add stuff and map reduce chains (#32333 ) * Add stuff and map reduce chains * We'll need to rename and add unit tests to the chains prior to official release	2025-08-07 15:20:05 -04:00
Christophe Bornet	a647073b26	feat(standard-tests): add a property to set the name of the parameter for the number of results to return (#32443 ) Not all retrievers use `k` as param name to set the number of results to return. Even in LangChain itself. Eg: `bc4251b9e0/libs/core/langchain_core/indexing/in_memory.py (L31)` So it's helpful to be able to change it for a given retriever. The change also adds hints to disable the tests if the retriever doesn't support setting the param in the constructor or in the invoke method (for instance, the `InMemoryDocumentIndex` in the link supports in the constructor but not in the invoke method). This change is backward compatible. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-08-07 11:22:24 -04:00
ccurme	06d8754b0b	release(core): 0.3.73 (#32446 )	2025-08-07 09:03:53 -04:00
ccurme	6e108c1cb4	feat(core): zero-out token costs for cache hits (#32437 )	2025-08-07 08:49:34 -04:00
John Bledsoe	bc4251b9e0	fix(core): fix index checking when merging lists (#32431 ) Description: fix an issue I discovered when attempting to merge messages in which one message has an `index` key in its content dictionary and another does not.	2025-08-06 12:47:33 -04:00
Mason Daugherty	ba83f58141	release(groq): 0.3.7 (#32417 )	2025-08-05 15:13:08 -04:00
Mason Daugherty	fb490b0c39	feat(groq): losen restrictions on `reasoning_effort`, inject effort in meta, update tests (#32415 )	2025-08-05 15:03:38 -04:00
Mason Daugherty	419c173225	feat(groq): openai-oss (#32411 ) use new openai-oss for integration tests, set module-level testing model names and improve robustness of tool tests	2025-08-05 14:18:56 -04:00
Narasimha Badrinath	dd9f5d7cde	feat(docs): add langchain-gradientai as provider (#32202 ) langchain-gradientai is Digitalocean's integration with Langchain. It will help users to build langchain applications using Digitalocean's GradientAI platform. --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-04 14:57:59 +00:00
ccurme	a9e52ca605	chore(openai): bump openai sdk (#32322 )	2025-07-30 10:58:18 -04:00
Mason Daugherty	fbd5a238d8	fix(core): revert "fix: tool call streaming bug with inconsistent indices from Qwen3" (#32307 ) Reverts langchain-ai/langchain#32160 Original issue stems from using `ChatOpenAI` to interact with a `qwen` model. Recommended to use [langchain-qwq](https://python.langchain.com/docs/integrations/chat/qwq/) which is built for Qwen	2025-07-29 10:26:38 -04:00
Mason Daugherty	0e287763cd	fix: lint	2025-07-28 18:49:43 -04:00
Copilot	0b56c1bc4b	fix: tool call streaming bug with inconsistent indices from Qwen3 (#32160 ) Fixes a streaming bug where models like Qwen3 (using OpenAI interface) send tool call chunks with inconsistent indices, resulting in duplicate/erroneous tool calls instead of a single merged tool call. ## Problem When Qwen3 streams tool calls, it sends chunks with inconsistent `index` values: - First chunk: `index=1` with tool name and partial arguments - Subsequent chunks: `index=0` with `name=None`, `id=None` and argument continuation The existing `merge_lists` function only merges chunks when their `index` values match exactly, causing these logically related chunks to remain separate, resulting in multiple incomplete tool calls instead of one complete tool call. ```python # Before fix: Results in 1 valid + 1 invalid tool call chunk1 = AIMessageChunk(tool_call_chunks=[ {"name": "search", "args": '{"query":', "id": "call_123", "index": 1} ]) chunk2 = AIMessageChunk(tool_call_chunks=[ {"name": None, "args": ' "test"}', "id": None, "index": 0} ]) merged = chunk1 + chunk2 # Creates 2 separate tool calls # After fix: Results in 1 complete tool call merged = chunk1 + chunk2 # Creates 1 merged tool call: search({"query": "test"}) ``` ## Solution Enhanced the `merge_lists` function in `langchain_core/utils/_merge.py` with intelligent tool call chunk merging: 1. Preserves existing behavior: Same-index chunks still merge as before 2. Adds special handling: Tool call chunks with `name=None`/`id=None` that don't match any existing index are now merged with the most recent complete tool call chunk 3. Maintains backward compatibility: All existing functionality works unchanged 4. Targeted fix: Only affects tool call chunks, doesn't change behavior for other list items The fix specifically handles the pattern where: - A continuation chunk has `name=None` and `id=None` (indicating it's part of an ongoing tool call) - No matching index is found in existing chunks - There exists a recent tool call chunk with a valid name or ID to merge with ## Testing Added comprehensive test coverage including: - ✅ Qwen3-style chunks with different indices now merge correctly - ✅ Existing same-index behavior preserved - ✅ Multiple distinct tool calls remain separate - ✅ Edge cases handled (empty chunks, orphaned continuations) - ✅ Backward compatibility maintained Fixes #31511. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-28 22:31:41 +00:00
Copilot	ad88e5aaec	fix(core): resolve cache validation error by safely converting Generation to ChatGeneration objects (#32156 ) ## Problem ChatLiteLLM encounters a `ValidationError` when using cache on subsequent calls, causing the following error: ``` ValidationError(model='ChatResult', errors=[{'loc': ('generations', 0, 'type'), 'msg': "unexpected value; permitted: 'ChatGeneration'", 'type': 'value_error.const', 'ctx': {'given': 'Generation', 'permitted': ('ChatGeneration',)}}]) ``` This occurs because: 1. The cache stores `Generation` objects (with `type="Generation"`) 2. But `ChatResult` expects `ChatGeneration` objects (with `type="ChatGeneration"` and a required `message` field) 3. When cached values are retrieved, validation fails due to the type mismatch ## Solution Added graceful handling in both sync (`_generate_with_cache`) and async (`_agenerate_with_cache`) cache methods to: 1. Detect when cached values contain `Generation` objects instead of expected `ChatGeneration` objects 2. Convert them to `ChatGeneration` objects by wrapping the text content in an `AIMessage` 3. Preserve all original metadata (`generation_info`) 4. Allow `ChatResult` creation to succeed without validation errors ## Example ```python # Before: This would fail with ValidationError from langchain_community.chat_models import ChatLiteLLM from langchain_community.cache import SQLiteCache from langchain.globals import set_llm_cache set_llm_cache(SQLiteCache(database_path="cache.db")) llm = ChatLiteLLM(model_name="openai/gpt-4o", cache=True, temperature=0) print(llm.predict("test")) # Works fine (cache empty) print(llm.predict("test")) # Now works instead of ValidationError # After: Seamlessly handles both Generation and ChatGeneration objects ``` ## Changes - `libs/core/langchain_core/language_models/chat_models.py`: - Added `Generation` import from `langchain_core.outputs` - Enhanced cache retrieval logic in `_generate_with_cache` and `_agenerate_with_cache` methods - Added conversion from `Generation` to `ChatGeneration` objects when needed - `libs/core/tests/unit_tests/language_models/chat_models/test_cache.py`: - Added test case to validate the conversion logic handles mixed object types ## Impact - Backward Compatible: Existing code continues to work unchanged - Minimal Change: Only affects cache retrieval path, no API changes - Robust: Handles both legacy cached `Generation` objects and new `ChatGeneration` objects - Preserves Data: All original content and metadata is maintained during conversion Fixes #22389. <!-- START COPILOT CODING AGENT TIPS --> --- 💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more [Copilot coding agent tips](https://gh.io/copilot-coding-agent-tips) in the docs. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-07-28 22:28:16 +00:00
Mason Daugherty	b7e4797e8b	release(anthropic): 0.3.18 (#32292 )	2025-07-28 17:07:11 -04:00
Mason Daugherty	3a487bf720	refactor(anthropic): AnthropicLLM to use Messages API (#32290 ) re: #32189	2025-07-28 16:22:58 -04:00
Mason Daugherty	8db16b5633	fix: use new Google model names in examples (#32288 )	2025-07-28 19:03:42 +00:00
Mason Daugherty	e79e0bd6b4	fix(openai): add `max_retries` parameter to ChatOpenAI for handling 503 capacity errors (#32286 ) Some integration tests were failing	2025-07-28 13:58:23 -04:00
ccurme	c55294ecb0	chore(core): add test for nested pydantic fields in schemas (#32285 )	2025-07-28 17:27:24 +00:00
Mason Daugherty	7a26c3d233	fix: update `bar_model` to use the correct model version `claude-3-7-sonnet-20250219` (#32284 )	2025-07-28 12:57:40 -04:00
Mason Daugherty	a07d2c5016	refactor: remove references to unsupported model `claude-3-sonnet-20240229` (#32281 ) Addresses some (but not all) test issues brought about in #32280	2025-07-28 11:57:43 -04:00
Aleksandr Filippov	f0b6baa0ef	fix(core): track within-batch deduplication in indexing num_skipped count (#32273 ) Description: Fixes incorrect `num_skipped` count in the LangChain indexing API. The current implementation only counts documents that already exist in RecordManager (cross-batch duplicates) but fails to count documents removed during within-batch deduplication via `_deduplicate_in_order()`. This PR adds tracking of the original batch size before deduplication and includes the difference in `num_skipped`, ensuring that `num_added + num_skipped` equals the total number of input documents. Issue: Fixes incorrect document count reporting in indexing statistics Dependencies: None Fixes #32272 --------- Co-authored-by: Alex Feel <afilippov@spotware.com>	2025-07-28 09:58:51 -04:00
Mason Daugherty	12c0e9b7d8	fix(docs): local API reference documentation build (#32271 ) ensure all relevant packages are correctly processed - cli wasn't included, also fix ValueError	2025-07-28 00:50:20 -04:00
Mason Daugherty	96cbd90cba	fix: formatting issues in docstrings (#32265 ) Ensures proper reStructuredText formatting by adding the required blank line before closing docstring quotes, which resolves the "Block quote ends without a blank line; unexpected unindent" warning.	2025-07-27 23:37:47 -04:00
Mason Daugherty	c6cb1fae61	fix: devcontainer (#32260 )	2025-07-27 20:24:16 -04:00
Christophe Bornet	efdfa00d10	chore(langchain): add ruff rules ARG (#32110 ) See https://docs.astral.sh/ruff/rules/#flake8-unused-arguments-arg Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-26 18:32:34 -04:00
Christophe Bornet	a2ad5aca41	chore(langchain): add ruff rules TC (#31921 ) See https://docs.astral.sh/ruff/rules/#flake8-type-checking-tc	2025-07-26 18:27:26 -04:00
Mason Daugherty	f624ad489a	feat(docs): improve devx, fix `Makefile` targets (#32237 ) TL;DR much of the provided `Makefile` targets were broken, and any time I wanted to preview changes locally I either had to refer to a command Chester gave me or try waiting on a Vercel preview deployment. With this PR, everything should behave like normal. Significant updates to the `Makefile` and documentation files, focusing on improving usability, adding clear messaging, and fixing/enhancing documentation workflows. ### Updates to `Makefile`: #### Enhanced build and cleaning processes: - Added informative messages (e.g., "📚 Building LangChain documentation...") to makefile targets like `docs_build`, `docs_clean`, and `api_docs_build` for better user feedback during execution. - Introduced a `clean-cache` target to the `docs` `Makefile` to clear cached dependencies and ensure clean builds. #### Improved dependency handling: - Modified `install-py-deps` to create a `.venv/deps_installed` marker, preventing redundant/duplicate dependency installations and improving efficiency. #### Streamlined file generation and infrastructure setup: - Added caching for the LangServe README download and parallelized feature table generation - Added user-friendly completion messages for targets like `copy-infra` and `render`. #### Documentation server updates: - Enhanced the `start` target with messages indicating server start and URL for local documentation viewing. --- ### Documentation Improvements: #### Content clarity and consistency: - Standardized section titles for consistency across documentation files. [[1]](diffhunk://#diff-9b1a85ea8a9dcf79f58246c88692cd7a36316665d7e05a69141cfdc50794c82aL1-R1) [[2]](diffhunk://#diff-944008ad3a79d8a312183618401fcfa71da0e69c75803eff09b779fc8e03183dL1-R1) - Refined phrasing and formatting in sections like "Dependency management" and "Formatting and linting" for better readability. [[1]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L6-R6) [[2]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L84-R82) #### Enhanced workflows: - Updated instructions for building and viewing documentation locally, including tips for specifying server ports and handling API reference previews. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L60-R94) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L82-R126) - Expanded guidance on cleaning documentation artifacts and using linting tools effectively. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L82-R126) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L107-R142) #### API reference documentation: - Improved instructions for generating and formatting in-code documentation, highlighting best practices for docstring writing. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L107-R142) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L144-R186) --- ### Minor Changes: - Added support for a new package name (`langchain_v1`) in the API documentation generation script. - Fixed minor capitalization and formatting issues in documentation files. [[1]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L40-R40) [[2]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L166-R160) --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-07-25 14:49:03 -04:00
Christophe Bornet	12ae42c5e9	chore(langchain): add ruff rules D1 (except D100 and D104) (#32123 )	2025-07-25 11:59:48 -04:00
Christophe Bornet	e1238b8085	chore(langchain): add ruff rules SLF (#32112 ) See https://docs.astral.sh/ruff/rules/private-member-access/	2025-07-25 11:56:40 -04:00
Chaitanya varma	8f5ec20ccf	chore(langchain): `strip_ansi` fucntion to remove ANSI escape sequences (#32200 ) Description: Fixes a bug in the file callback test where ANSI escape codes were causing test failures. The improved test now properly handles ANSI escape sequences by: - Using exact string comparison instead of substring checking - Applying the `strip_ansi` function consistently to all file contents - Adding descriptive assertion messages - Maintaining test coverage and backward compatibility The changes ensure tests pass reliably even when terminal control sequences are present in the output Issue: Fixes #32150 Dependencies: None required - uses existing dependencies only. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-07-25 15:53:19 +00:00
niceg	0d6f915442	fix: LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. (#32222 ) fix: Fix LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. - Description: This PR fixes an issue where the LLM would mimic Unicode responses due to forced Unicode conversion of non-ASCII characters in tool calls. The fix involves disabling the `ensure_ascii` flag in `json.dumps()` when converting tool calls to OpenAI format. - Issue: Fixes ↓↓↓ input： ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "你好啊集团"}'}}]} ``` output: ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "\\u4f60\\u597d\\u554a\\u96c6\\u56e2"}'}}]} ``` then: llm will mimic outputting unicode. Unicode's vast number of symbols can lengthen LLM responses, leading to slower performance. <img width="686" height="277" alt="image" src="https://github.com/user-attachments/assets/28f3b007-3964-4455-bee2-68f86ac1906d" /> --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 17:01:31 -04:00
Mason Daugherty	d53ebf367e	fix(docs): capitalization, codeblock formatting, and hyperlinks, note blocks (#32235 ) widespread cleanup attempt	2025-07-24 16:55:04 -04:00
Copilot	54542b9385	docs(openai): add comprehensive documentation and examples for `extra_body` + others (#32149 ) This PR addresses the common issue where users struggle to pass custom parameters to OpenAI-compatible APIs like LM Studio, vLLM, and others. The problem occurs when users try to use `model_kwargs` for custom parameters, which causes API errors. ## Problem Users attempting to pass custom parameters (like LM Studio's `ttl` parameter) were getting errors: ```python # ❌ This approach fails llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", model_kwargs={"ttl": 5} # Causes TypeError: unexpected keyword argument 'ttl' ) ``` ## Solution The `extra_body` parameter is the correct way to pass custom parameters to OpenAI-compatible APIs: ```python # ✅ This approach works correctly llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 5} # Custom parameters go in extra_body ) ``` ## Changes Made 1. Enhanced Documentation: Updated the `extra_body` parameter docstring with comprehensive examples for LM Studio, vLLM, and other providers 2. Added Documentation Section: Created a new "OpenAI-compatible APIs" section in the main class docstring with practical examples 3. Unit Tests: Added tests to verify `extra_body` functionality works correctly: - `test_extra_body_parameter()`: Verifies custom parameters are included in request payload - `test_extra_body_with_model_kwargs()`: Ensures `extra_body` and `model_kwargs` work together 4. Clear Guidance: Documented when to use `extra_body` vs `model_kwargs` ## Examples Added LM Studio with TTL (auto-eviction): ```python ChatOpenAI( base_url="http://localhost:1234/v1", api_key="lm-studio", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 300} # Auto-evict after 5 minutes ) ``` vLLM with custom sampling: ```python ChatOpenAI( base_url="http://localhost:8000/v1", api_key="EMPTY", model="meta-llama/Llama-2-7b-chat-hf", extra_body={ "use_beam_search": True, "best_of": 4 } ) ``` ## Why This Works - `model_kwargs` parameters are passed directly to the OpenAI client's `create()` method, causing errors for non-standard parameters - `extra_body` parameters are included in the HTTP request body, which is exactly what OpenAI-compatible APIs expect for custom parameters Fixes #32115. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 16:43:16 -04:00
Christophe Bornet	0b34be4ce5	refactor(langchain): refactor unit test stub classes (#32209 ) See https://github.com/langchain-ai/langchain/pull/32098#discussion_r2225961563	2025-07-24 11:05:56 -04:00
Eugene Yurtsev	7995c719c5	chore(langchain_v1): clean anything uncertain (#32228 ) Further clean up of namespace: - Removed prompts (we'll re-add in a separate commit) - Remove LocalFileStore until we can review whether all the implementation details are necessary - Remove message processing logic from memory (we'll figure out where to expose it) - Remove `Tool` primitive (should be sufficient to use `BaseTool` for typing purposes) - Remove utilities to create kv stores. Unclear if they've had much usage outside MultiparentRetriever	2025-07-24 14:41:05 +00:00
Mason Daugherty	bdf1cd383c	fix(langchain): update deps	2025-07-24 10:37:08 -04:00
Mason Daugherty	77c981999e	fix(text-splitters): update langchain-core version to 0.3.72	2025-07-24 10:35:07 -04:00
Mason Daugherty	7f015b6f14	fix(text-splitters): update lock for release	2025-07-24 10:32:04 -04:00
Mason Daugherty	0e139fb9a6	release(langchain): 0.3.27 (#32227 )	2025-07-24 10:20:20 -04:00
tanwirahmad	622bb05751	fix(langchain): class HTMLSemanticPreservingSplitter ignores the text inside the div tag (#32213 ) Description: We collect the text from the "html", "body", "div", and "main" nodes, if they have any. Issue: Fixes #32206.	2025-07-24 10:09:03 -04:00

1 2 3 4 5 ...

7386 Commits