langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-07-16 17:26:50 +00:00

Author	SHA1	Message	Date
Copilot	ad88e5aaec	fix(core): resolve cache validation error by safely converting Generation to ChatGeneration objects (#32156 ) ## Problem ChatLiteLLM encounters a `ValidationError` when using cache on subsequent calls, causing the following error: ``` ValidationError(model='ChatResult', errors=[{'loc': ('generations', 0, 'type'), 'msg': "unexpected value; permitted: 'ChatGeneration'", 'type': 'value_error.const', 'ctx': {'given': 'Generation', 'permitted': ('ChatGeneration',)}}]) ``` This occurs because: 1. The cache stores `Generation` objects (with `type="Generation"`) 2. But `ChatResult` expects `ChatGeneration` objects (with `type="ChatGeneration"` and a required `message` field) 3. When cached values are retrieved, validation fails due to the type mismatch ## Solution Added graceful handling in both sync (`_generate_with_cache`) and async (`_agenerate_with_cache`) cache methods to: 1. Detect when cached values contain `Generation` objects instead of expected `ChatGeneration` objects 2. Convert them to `ChatGeneration` objects by wrapping the text content in an `AIMessage` 3. Preserve all original metadata (`generation_info`) 4. Allow `ChatResult` creation to succeed without validation errors ## Example ```python # Before: This would fail with ValidationError from langchain_community.chat_models import ChatLiteLLM from langchain_community.cache import SQLiteCache from langchain.globals import set_llm_cache set_llm_cache(SQLiteCache(database_path="cache.db")) llm = ChatLiteLLM(model_name="openai/gpt-4o", cache=True, temperature=0) print(llm.predict("test")) # Works fine (cache empty) print(llm.predict("test")) # Now works instead of ValidationError # After: Seamlessly handles both Generation and ChatGeneration objects ``` ## Changes - `libs/core/langchain_core/language_models/chat_models.py`: - Added `Generation` import from `langchain_core.outputs` - Enhanced cache retrieval logic in `_generate_with_cache` and `_agenerate_with_cache` methods - Added conversion from `Generation` to `ChatGeneration` objects when needed - `libs/core/tests/unit_tests/language_models/chat_models/test_cache.py`: - Added test case to validate the conversion logic handles mixed object types ## Impact - Backward Compatible: Existing code continues to work unchanged - Minimal Change: Only affects cache retrieval path, no API changes - Robust: Handles both legacy cached `Generation` objects and new `ChatGeneration` objects - Preserves Data: All original content and metadata is maintained during conversion Fixes #22389. <!-- START COPILOT CODING AGENT TIPS --> --- 💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more [Copilot coding agent tips](https://gh.io/copilot-coding-agent-tips) in the docs. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-07-28 22:28:16 +00:00
Mason Daugherty	30e3ed6a19	fix: add space in run-name for better readability	2025-07-28 17:46:27 -04:00
Mason Daugherty	8641a95c43	fix: update run-name in `scheduled_test.yml` to include dynamic inputs	2025-07-28 17:45:05 -04:00
Mason Daugherty	df70c5c186	chore: update actions run-names and add default inputs (#32293 )	2025-07-28 17:33:27 -04:00
Mason Daugherty	d5ca77e065	fix: remove erreneous rocket emoji in `run-name`	2025-07-28 17:11:14 -04:00
Mason Daugherty	b7e4797e8b	release(anthropic): 0.3.18 (#32292 ) langchain-anthropic==0.3.18	2025-07-28 17:07:11 -04:00
Mason Daugherty	3a487bf720	refactor(anthropic): AnthropicLLM to use Messages API (#32290 ) re: #32189	2025-07-28 16:22:58 -04:00
Mason Daugherty	e5fd67024c	fix: update link text for reporting security vulnerabilities in SECURITY.md	2025-07-28 15:05:31 -04:00
Mason Daugherty	b86841ac40	fix: update alt attribute for GitHub Codespace badge in README	2025-07-28 15:04:57 -04:00
Mason Daugherty	8db16b5633	fix: use new Google model names in examples (#32288 )	2025-07-28 19:03:42 +00:00
Mason Daugherty	6f10160a45	fix: `scripts/` errors	2025-07-28 15:03:25 -04:00
Mason Daugherty	e79e0bd6b4	fix(openai): add `max_retries` parameter to ChatOpenAI for handling 503 capacity errors (#32286 ) Some integration tests were failing	2025-07-28 13:58:23 -04:00
ccurme	c55294ecb0	chore(core): add test for nested pydantic fields in schemas (#32285 )	2025-07-28 17:27:24 +00:00
Mason Daugherty	7a26c3d233	fix: update `bar_model` to use the correct model version `claude-3-7-sonnet-20250219` (#32284 )	2025-07-28 12:57:40 -04:00
Mason Daugherty	c6ffac3ce0	refactor: mdx lint (#32282 )	2025-07-28 12:56:22 -04:00
Mason Daugherty	a07d2c5016	refactor: remove references to unsupported model `claude-3-sonnet-20240229` (#32281 ) Addresses some (but not all) test issues brought about in #32280	2025-07-28 11:57:43 -04:00
Aleksandr Filippov	f0b6baa0ef	fix(core): track within-batch deduplication in indexing num_skipped count (#32273 ) Description: Fixes incorrect `num_skipped` count in the LangChain indexing API. The current implementation only counts documents that already exist in RecordManager (cross-batch duplicates) but fails to count documents removed during within-batch deduplication via `_deduplicate_in_order()`. This PR adds tracking of the original batch size before deduplication and includes the difference in `num_skipped`, ensuring that `num_added + num_skipped` equals the total number of input documents. Issue: Fixes incorrect document count reporting in indexing statistics Dependencies: None Fixes #32272 --------- Co-authored-by: Alex Feel <afilippov@spotware.com>	2025-07-28 09:58:51 -04:00
Mason Daugherty	12c0e9b7d8	fix(docs): local API reference documentation build (#32271 ) ensure all relevant packages are correctly processed - cli wasn't included, also fix ValueError	2025-07-28 00:50:20 -04:00
Mason Daugherty	ed682ae62d	fix: explicitly tell uv to copy when using devcontainer (#32267 )	2025-07-28 00:01:06 -04:00
Mason Daugherty	caf1919217	fix: devcontainer to use volume to store the workspace (#32266 ) should resolve the file sharing issue for users on macOS.	2025-07-27 23:43:06 -04:00
Mason Daugherty	904066f1ec	feat: add VSCode configuration files for Python development (#32263 )	2025-07-27 23:37:59 -04:00
Mason Daugherty	96cbd90cba	fix: formatting issues in docstrings (#32265 ) Ensures proper reStructuredText formatting by adding the required blank line before closing docstring quotes, which resolves the "Block quote ends without a blank line; unexpected unindent" warning.	2025-07-27 23:37:47 -04:00
Mason Daugherty	a8a2cff129	Merge branch 'master' of github.com:langchain-ai/langchain	2025-07-27 23:34:59 -04:00
Mason Daugherty	f4ff4514ef	fix: update workspace folder path in devcontainer configuration	2025-07-27 23:34:57 -04:00
Mason Daugherty	d1679cec91	chore: add .editorconfig for consistent coding styles across files (#32261 ) Following existing codebase conventions	2025-07-27 23:25:30 -04:00
Mason Daugherty	5295f2add0	fix: update dev container name to match service name	2025-07-27 22:30:16 -04:00
Mason Daugherty	5f5b87e9a3	fix: update service name in devcontainer configuration	2025-07-27 22:28:47 -04:00
Mason Daugherty	e0ef98dac0	feat: add markdownlint configuration file (#32264 )	2025-07-27 22:24:58 -04:00
Mason Daugherty	62212c7ee2	fix: update links in SECURITY.md to use markdown format	2025-07-27 21:54:25 -04:00
Mason Daugherty	9d38f170ce	refactor: enhance workflow names and descriptions for clarity (#32262 )	2025-07-27 21:31:59 -04:00
Mason Daugherty	c6cb1fae61	fix: devcontainer (#32260 )	2025-07-27 20:24:16 -04:00
Kanav Bansal	e42b1d23dc	docs(docs): update RAG tutorials link to point to correct path (#32256 ) - Description: This PR updates the internal documentation link for the RAG tutorials to reflect the updated path. Previously, the link pointed to the root `/docs/tutorials/`, which was generic. It now correctly routes to the RAG-specific tutorial page. - Issue: N/A - Dependencies: None - Twitter handle: N/A	2025-07-27 20:00:41 -04:00
Mason Daugherty	53d0bfe9cd	refactor: markdownlint (#32259 )	2025-07-27 20:00:16 -04:00
Mason Daugherty	eafab52483	refactor: markdownlint `SECURITY.md` (#32258 )	2025-07-27 19:55:25 -04:00
Christophe Bornet	efdfa00d10	chore(langchain): add ruff rules ARG (#32110 ) See https://docs.astral.sh/ruff/rules/#flake8-unused-arguments-arg Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-26 18:32:34 -04:00
Christophe Bornet	a2ad5aca41	chore(langchain): add ruff rules TC (#31921 ) See https://docs.astral.sh/ruff/rules/#flake8-type-checking-tc	2025-07-26 18:27:26 -04:00
Mason Daugherty	5ecbb5f277	fix(docs): temporary workaround until the underlying dependency issues in the AI21 package ecosystem are resolved. (#32248 )	2025-07-25 15:12:44 -04:00
Mason Daugherty	c1028171af	fix(docs): update protobuf version constraint to <5.0 in vercel_overrides.txt (#32247 )	2025-07-25 15:08:44 -04:00
ccurme	f6236d9f12	fix(infra): add pypdf to vercel overrides (#32242 ) > × No solution found when resolving dependencies: ╰─▶ Because only langchain-neo4j==0.5.0 is available and langchain-neo4j==0.5.0 depends on neo4j-graphrag>=1.9.0, we can conclude that all versions of langchain-neo4j depend on neo4j-graphrag>=1.9.0. And because only neo4j-graphrag<=1.9.0 is available and neo4j-graphrag==1.9.0 depends on pypdf>=5.1.0,<6.0.0, we can conclude that all versions of langchain-neo4j depend on pypdf>=5.1.0,<6.0.0. And because langchain-upstage==0.6.0 depends on pypdf>=4.2.0,<5.0.0 and only langchain-upstage==0.6.0 is available, we can conclude that all versions of langchain-neo4j and all versions of langchain-upstage are incompatible. And because you require langchain-neo4j and langchain-upstage, we can conclude that your requirements are unsatisfiable. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-25 15:05:21 -04:00
Mason Daugherty	df20f111a8	fix(docs): add validation for repository format and name in API docs build workflow (#32246 ) for build	2025-07-25 15:05:06 -04:00
Eugene Yurtsev	db22311094	ci(infra): no need for `.` in the regexp (#32245 ) No need for allowing `.`	2025-07-25 15:02:02 -04:00
Mason Daugherty	f624ad489a	feat(docs): improve devx, fix `Makefile` targets (#32237 ) TL;DR much of the provided `Makefile` targets were broken, and any time I wanted to preview changes locally I either had to refer to a command Chester gave me or try waiting on a Vercel preview deployment. With this PR, everything should behave like normal. Significant updates to the `Makefile` and documentation files, focusing on improving usability, adding clear messaging, and fixing/enhancing documentation workflows. ### Updates to `Makefile`: #### Enhanced build and cleaning processes: - Added informative messages (e.g., "📚 Building LangChain documentation...") to makefile targets like `docs_build`, `docs_clean`, and `api_docs_build` for better user feedback during execution. - Introduced a `clean-cache` target to the `docs` `Makefile` to clear cached dependencies and ensure clean builds. #### Improved dependency handling: - Modified `install-py-deps` to create a `.venv/deps_installed` marker, preventing redundant/duplicate dependency installations and improving efficiency. #### Streamlined file generation and infrastructure setup: - Added caching for the LangServe README download and parallelized feature table generation - Added user-friendly completion messages for targets like `copy-infra` and `render`. #### Documentation server updates: - Enhanced the `start` target with messages indicating server start and URL for local documentation viewing. --- ### Documentation Improvements: #### Content clarity and consistency: - Standardized section titles for consistency across documentation files. [[1]](diffhunk://#diff-9b1a85ea8a9dcf79f58246c88692cd7a36316665d7e05a69141cfdc50794c82aL1-R1) [[2]](diffhunk://#diff-944008ad3a79d8a312183618401fcfa71da0e69c75803eff09b779fc8e03183dL1-R1) - Refined phrasing and formatting in sections like "Dependency management" and "Formatting and linting" for better readability. [[1]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L6-R6) [[2]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L84-R82) #### Enhanced workflows: - Updated instructions for building and viewing documentation locally, including tips for specifying server ports and handling API reference previews. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L60-R94) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L82-R126) - Expanded guidance on cleaning documentation artifacts and using linting tools effectively. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L82-R126) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L107-R142) #### API reference documentation: - Improved instructions for generating and formatting in-code documentation, highlighting best practices for docstring writing. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L107-R142) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L144-R186) --- ### Minor Changes: - Added support for a new package name (`langchain_v1`) in the API documentation generation script. - Fixed minor capitalization and formatting issues in documentation files. [[1]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L40-R40) [[2]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L166-R160) --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-07-25 14:49:03 -04:00
Eugene Yurtsev	549ecd3e78	chore(infra): harden api docs build workflow (#32243 ) Harden permissions for api docs build workflow	2025-07-25 14:40:20 -04:00
dishaprakash	a0671676ae	feat(docs): add PGVectorStore (#30950 ) Thank you for contributing to LangChain! - Adding documentation for PGVectorStore: docs: Adding documentation for the new PGVectorStore as a part of langchain-postgres - Add docs: The notebook for PGVectorStore is now added to the directory `docs/docs/integrations`. As a part of this change, we've also updated the VectorStore features table and VectorStoreTabs --------- Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-07-25 13:22:58 -04:00
Christophe Bornet	12ae42c5e9	chore(langchain): add ruff rules D1 (except D100 and D104) (#32123 )	2025-07-25 11:59:48 -04:00
Christophe Bornet	e1238b8085	chore(langchain): add ruff rules SLF (#32112 ) See https://docs.astral.sh/ruff/rules/private-member-access/	2025-07-25 11:56:40 -04:00
Chaitanya varma	8f5ec20ccf	chore(langchain): `strip_ansi` fucntion to remove ANSI escape sequences (#32200 ) Description: Fixes a bug in the file callback test where ANSI escape codes were causing test failures. The improved test now properly handles ANSI escape sequences by: - Using exact string comparison instead of substring checking - Applying the `strip_ansi` function consistently to all file contents - Adding descriptive assertion messages - Maintaining test coverage and backward compatibility The changes ensure tests pass reliably even when terminal control sequences are present in the output Issue: Fixes #32150 Dependencies: None required - uses existing dependencies only. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-07-25 15:53:19 +00:00
niceg	0d6f915442	fix: LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. (#32222 ) fix: Fix LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. - Description: This PR fixes an issue where the LLM would mimic Unicode responses due to forced Unicode conversion of non-ASCII characters in tool calls. The fix involves disabling the `ensure_ascii` flag in `json.dumps()` when converting tool calls to OpenAI format. - Issue: Fixes ↓↓↓ input： ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "你好啊集团"}'}}]} ``` output: ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "\\u4f60\\u597d\\u554a\\u96c6\\u56e2"}'}}]} ``` then: llm will mimic outputting unicode. Unicode's vast number of symbols can lengthen LLM responses, leading to slower performance. <img width="686" height="277" alt="image" src="https://github.com/user-attachments/assets/28f3b007-3964-4455-bee2-68f86ac1906d" /> --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 17:01:31 -04:00
Mason Daugherty	d53ebf367e	fix(docs): capitalization, codeblock formatting, and hyperlinks, note blocks (#32235 ) widespread cleanup attempt	2025-07-24 16:55:04 -04:00
Copilot	54542b9385	docs(openai): add comprehensive documentation and examples for `extra_body` + others (#32149 ) This PR addresses the common issue where users struggle to pass custom parameters to OpenAI-compatible APIs like LM Studio, vLLM, and others. The problem occurs when users try to use `model_kwargs` for custom parameters, which causes API errors. ## Problem Users attempting to pass custom parameters (like LM Studio's `ttl` parameter) were getting errors: ```python # ❌ This approach fails llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", model_kwargs={"ttl": 5} # Causes TypeError: unexpected keyword argument 'ttl' ) ``` ## Solution The `extra_body` parameter is the correct way to pass custom parameters to OpenAI-compatible APIs: ```python # ✅ This approach works correctly llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 5} # Custom parameters go in extra_body ) ``` ## Changes Made 1. Enhanced Documentation: Updated the `extra_body` parameter docstring with comprehensive examples for LM Studio, vLLM, and other providers 2. Added Documentation Section: Created a new "OpenAI-compatible APIs" section in the main class docstring with practical examples 3. Unit Tests: Added tests to verify `extra_body` functionality works correctly: - `test_extra_body_parameter()`: Verifies custom parameters are included in request payload - `test_extra_body_with_model_kwargs()`: Ensures `extra_body` and `model_kwargs` work together 4. Clear Guidance: Documented when to use `extra_body` vs `model_kwargs` ## Examples Added LM Studio with TTL (auto-eviction): ```python ChatOpenAI( base_url="http://localhost:1234/v1", api_key="lm-studio", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 300} # Auto-evict after 5 minutes ) ``` vLLM with custom sampling: ```python ChatOpenAI( base_url="http://localhost:8000/v1", api_key="EMPTY", model="meta-llama/Llama-2-7b-chat-hf", extra_body={ "use_beam_search": True, "best_of": 4 } ) ``` ## Why This Works - `model_kwargs` parameters are passed directly to the OpenAI client's `create()` method, causing errors for non-standard parameters - `extra_body` parameters are included in the HTTP request body, which is exactly what OpenAI-compatible APIs expect for custom parameters Fixes #32115. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 16:43:16 -04:00

1 2 3 4 5 ...

13914 Commits