langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-01 09:04:03 +00:00

Author	SHA1	Message	Date
niceg	0d6f915442	fix: LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. (#32222 ) fix: Fix LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. - Description: This PR fixes an issue where the LLM would mimic Unicode responses due to forced Unicode conversion of non-ASCII characters in tool calls. The fix involves disabling the `ensure_ascii` flag in `json.dumps()` when converting tool calls to OpenAI format. - Issue: Fixes ↓↓↓ input： ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "你好啊集团"}'}}]} ``` output: ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "\\u4f60\\u597d\\u554a\\u96c6\\u56e2"}'}}]} ``` then: llm will mimic outputting unicode. Unicode's vast number of symbols can lengthen LLM responses, leading to slower performance. <img width="686" height="277" alt="image" src="https://github.com/user-attachments/assets/28f3b007-3964-4455-bee2-68f86ac1906d" /> --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 17:01:31 -04:00
Mason Daugherty	d53ebf367e	fix(docs): capitalization, codeblock formatting, and hyperlinks, note blocks (#32235 ) widespread cleanup attempt	2025-07-24 16:55:04 -04:00
Copilot	54542b9385	docs(openai): add comprehensive documentation and examples for `extra_body` + others (#32149 ) This PR addresses the common issue where users struggle to pass custom parameters to OpenAI-compatible APIs like LM Studio, vLLM, and others. The problem occurs when users try to use `model_kwargs` for custom parameters, which causes API errors. ## Problem Users attempting to pass custom parameters (like LM Studio's `ttl` parameter) were getting errors: ```python # ❌ This approach fails llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", model_kwargs={"ttl": 5} # Causes TypeError: unexpected keyword argument 'ttl' ) ``` ## Solution The `extra_body` parameter is the correct way to pass custom parameters to OpenAI-compatible APIs: ```python # ✅ This approach works correctly llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 5} # Custom parameters go in extra_body ) ``` ## Changes Made 1. Enhanced Documentation: Updated the `extra_body` parameter docstring with comprehensive examples for LM Studio, vLLM, and other providers 2. Added Documentation Section: Created a new "OpenAI-compatible APIs" section in the main class docstring with practical examples 3. Unit Tests: Added tests to verify `extra_body` functionality works correctly: - `test_extra_body_parameter()`: Verifies custom parameters are included in request payload - `test_extra_body_with_model_kwargs()`: Ensures `extra_body` and `model_kwargs` work together 4. Clear Guidance: Documented when to use `extra_body` vs `model_kwargs` ## Examples Added LM Studio with TTL (auto-eviction): ```python ChatOpenAI( base_url="http://localhost:1234/v1", api_key="lm-studio", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 300} # Auto-evict after 5 minutes ) ``` vLLM with custom sampling: ```python ChatOpenAI( base_url="http://localhost:8000/v1", api_key="EMPTY", model="meta-llama/Llama-2-7b-chat-hf", extra_body={ "use_beam_search": True, "best_of": 4 } ) ``` ## Why This Works - `model_kwargs` parameters are passed directly to the OpenAI client's `create()` method, causing errors for non-standard parameters - `extra_body` parameters are included in the HTTP request body, which is exactly what OpenAI-compatible APIs expect for custom parameters Fixes #32115. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 16:43:16 -04:00
Mason Daugherty	7d2a13f519	fix: various typos (#32231 )	2025-07-24 12:35:08 -04:00
Christophe Bornet	0b34be4ce5	refactor(langchain): refactor unit test stub classes (#32209 ) See https://github.com/langchain-ai/langchain/pull/32098#discussion_r2225961563	2025-07-24 11:05:56 -04:00
Mason Daugherty	6f3169eb49	chore: update copilot development guidelines for clarity and structure (#32230 )	2025-07-24 15:05:09 +00:00
Eugene Yurtsev	7995c719c5	chore(langchain_v1): clean anything uncertain (#32228 ) Further clean up of namespace: - Removed prompts (we'll re-add in a separate commit) - Remove LocalFileStore until we can review whether all the implementation details are necessary - Remove message processing logic from memory (we'll figure out where to expose it) - Remove `Tool` primitive (should be sufficient to use `BaseTool` for typing purposes) - Remove utilities to create kv stores. Unclear if they've had much usage outside MultiparentRetriever	2025-07-24 14:41:05 +00:00
Mason Daugherty	bdf1cd383c	fix(langchain): update deps	2025-07-24 10:37:08 -04:00
Mason Daugherty	77c981999e	fix(text-splitters): update langchain-core version to 0.3.72	2025-07-24 10:35:07 -04:00
Mason Daugherty	7f015b6f14	fix(text-splitters): update lock for release	2025-07-24 10:32:04 -04:00
Mason Daugherty	71ad451e1f	Merge branch 'master' of github.com:langchain-ai/langchain	2025-07-24 10:24:17 -04:00
Mason Daugherty	2c42893703	fix(langchain): update langchain-core version to 0.3.72	2025-07-24 10:24:04 -04:00
Mason Daugherty	0e139fb9a6	release(langchain): 0.3.27 (#32227 )	2025-07-24 10:20:20 -04:00
tanwirahmad	622bb05751	fix(langchain): class HTMLSemanticPreservingSplitter ignores the text inside the div tag (#32213 ) Description: We collect the text from the "html", "body", "div", and "main" nodes, if they have any. Issue: Fixes #32206.	2025-07-24 10:09:03 -04:00
Eugene Yurtsev	56dde3ade3	feat(langchain): v1 scaffolding (#32166 ) This PR adds scaffolding for langchain 1.0 entry package. Most contents have been removed. Currently remaining entrypoints for: * chat models * embedding models * memory -> trimming messages, filtering messages and counting tokens [we may remove this] * prompts -> we may remove some prompts * storage: primarily to support cache backed embeddings, may remove the kv store * tools -> report tool primitives Things to be added: * Selected agent implementations * Selected workflows * Common primitives: messages, Document * Primitives for type hinting: BaseChatModel, BaseEmbeddings * Selected retrievers * Selected text splitters Things to be removed: * Globals needs to be removed (needs an update in langchain core) Todos: * TBD indexing api (requires sqlalchemy which we don't want as a dependency) * Be explicit about public/private interfaces (e.g., likely rename chat_models.base.py to something more internal) * Remove dockerfiles * Update module doc-strings and README.md	2025-07-24 09:47:48 -04:00
Mason Daugherty	bd3d6496f3	release(core): 0.3.72 (#32214 ) fixes #32170	2025-07-23 20:33:48 -04:00
jmaillefaud	fb5da8384e	fix(core): Dereference Refs for pydantic schema fails in tool schema generation (#32203 ) The `_dereference_refs_helper` in `langchain_core.utils.json_schema` incorrectly handled objects with a reference and other fields. Issue: #32170 # Description We change the check so that it accepts other keys in the object.	2025-07-23 20:28:27 -04:00
Maxime Grenu	a7d0e42f3f	docs: fix typos in documentation (#32201 ) ## Summary - Fixed redundant word "done" in SECURITY.md line 69 - Fixed grammar errors in Fireworks README.md line 77: "how it fares compares" → "how it compares" and "in terms just" → "in terms of" ## Test plan - [x] Verified changes improve readability and correct grammar - [x] No functional changes, documentation only 🤖 Generated with [Claude Code](https://claude.ai/code) Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude <noreply@anthropic.com>	2025-07-23 10:43:25 -04:00
Christophe Bornet	3496e1739e	feat(langchain): add ruff rules PL (#32079 ) See https://docs.astral.sh/ruff/rules/#pylint-pl	2025-07-22 23:55:32 -04:00
Jacob Lee	0f39155f62	docs: Specify environment variables for BedrockConverse (#32194 )	2025-07-22 17:37:47 -04:00
ccurme	6aeda24a07	docs(chroma): update feature table (#32193 ) Supports multi-tenancy.	2025-07-22 20:55:07 +00:00
Mason Daugherty	3ed804a5f3	fix(perplexity): undo xfails (#32192 )	2025-07-22 16:29:37 -04:00
Mason Daugherty	ca137bfe62	.	2025-07-22 16:25:02 -04:00
Mason Daugherty	fa487fb62d	fix(perplexity): temp xfail int tests (#32191 ) It appears the API has changes since the 2025-04-15 release, leading to failed integration tests.	2025-07-22 16:20:51 -04:00
ccurme	053fb16a05	revert: drop anthropic from core test matrix (#32190 ) Reverts langchain-ai/langchain#32185	2025-07-22 20:13:02 +00:00
ccurme	3672bbc71e	fix(anthropic): update integration test models (#32189 ) Multiple models were [retired](https://docs.anthropic.com/en/docs/about-claude/model-deprecations#model-status) yesterday. Tests remain broken until we figure out what to do with the legacy Anthropic LLM integration— currently uses their (legacy) text completions API, for which there appear to be no remaining supported models.	2025-07-22 19:51:39 +00:00
Mason Daugherty	a02ad3d192	docs: formatting cleanup (#32188 ) * formatting cleaning * make `init_chat_model` more prominent in list of guides	2025-07-22 15:46:15 -04:00
ccurme	0c4054a7fc	release(core): 0.3.71 (#32186 )	2025-07-22 15:44:36 -04:00
ccurme	75517c3ea9	chore(infra): drop anthropic from core test matrix (#32185 )	2025-07-22 19:38:58 +00:00
ccurme	ebf2e11bcb	fix(core): exclude api_key from tracing metadata (#32184 ) (standard param)	2025-07-22 15:32:12 -04:00
ccurme	e41e6ec6aa	release(chroma): 0.2.5 (#32183 )	2025-07-22 15:24:03 -04:00
itaismith	09769373b3	feat(chroma): Add Chroma Cloud support (#32125 ) * Adding support for more Chroma client options (`HttpClient` and `CloundClient`). This includes adding arguments necessary for instantiating these clients. * Adding support for Chroma's new persisted collection configuration (we moved index configuration into this new construct). * Delegate `Settings` configuration to Chroma's client constructors.	2025-07-22 15:14:15 -04:00
ccurme	3fc27e7a95	docs: update feature table for Chroma (#32182 )	2025-07-22 18:21:17 +00:00
ccurme	8acfd677bc	fix(core): add type key when tracing in some cases (#31825 )	2025-07-22 18:08:16 +00:00
Mason Daugherty	af3789b9ed	fix(deepseek): release openai version (#32181 ) used sdk version instead of langchain by accident	2025-07-22 13:29:52 -04:00
Mason Daugherty	a6896794ca	release(ollama): 0.3.6 (#32180 )	2025-07-22 13:24:17 -04:00
Copilot	d40fd5a3ce	feat(ollama): warn on empty `load` responses (#32161 ) ## Problem When using `ChatOllama` with `create_react_agent`, agents would sometimes terminate prematurely with empty responses when Ollama returned `done_reason: 'load'` responses with no content. This caused agents to return empty `AIMessage` objects instead of actual generated text. ```python from langchain_ollama import ChatOllama from langgraph.prebuilt import create_react_agent from langchain_core.messages import HumanMessage llm = ChatOllama(model='qwen2.5:7b', temperature=0) agent = create_react_agent(model=llm, tools=[]) result = agent.invoke(HumanMessage('Hello'), {"configurable": {"thread_id": "1"}}) # Before fix: AIMessage(content='', response_metadata={'done_reason': 'load'}) # Expected: AIMessage with actual generated content ``` ## Root Cause The `_iterate_over_stream` and `_aiterate_over_stream` methods treated any response with `done: True` as final, regardless of `done_reason`. When Ollama returns `done_reason: 'load'` with empty content, it indicates the model was loaded but no actual generation occurred - this should not be considered a complete response. ## Solution Modified the streaming logic to skip responses when: - `done: True` - `done_reason: 'load'` - Content is empty or contains only whitespace This ensures agents only receive actual generated content while preserving backward compatibility for load responses that do contain content. ## Changes - `_iterate_over_stream`: Skip empty load responses instead of yielding them - `_aiterate_over_stream`: Apply same fix to async streaming - Tests: Added comprehensive test cases covering all edge cases ## Testing All scenarios now work correctly: - ✅ Empty load responses are skipped (fixes original issue) - ✅ Load responses with actual content are preserved (backward compatibility) - ✅ Normal stop responses work unchanged - ✅ Streaming behavior preserved - ✅ `create_react_agent` integration fixed Fixes #31482. <!-- START COPILOT CODING AGENT TIPS --> --- 💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more [Copilot coding agent tips](https://gh.io/copilot-coding-agent-tips) in the docs. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-22 13:21:11 -04:00
Mason Daugherty	116b758498	fix: bump deps for release (#32179 ) forgot to bump the `pyproject.toml` files	2025-07-22 13:12:14 -04:00
Mason Daugherty	10996a2821	release(perplexity): 0.1.2 (#32176 )	2025-07-22 13:02:19 -04:00
Mason Daugherty	2aed07efb6	release(deepseek): 0.1.4 (#32178 )	2025-07-22 13:01:54 -04:00
Mason Daugherty	64dac1faf7	release(huggingface): 0.3.1 (#32177 )	2025-07-22 13:01:34 -04:00
Mason Daugherty	58768d8aef	release(xai): 0.2.5 (#32174 )	2025-07-22 13:01:26 -04:00
Mason Daugherty	d65da13299	docs(ollama): add `validate_model_on_init` note, bump lock (#32172 )	2025-07-22 10:58:45 -04:00
Mason Daugherty	a5cecf77f0	unused-ignore	2025-07-22 10:33:14 -04:00
Mason Daugherty	da536abde6	Merge branch 'master' into copilot/fix-31398	2025-07-22 10:30:18 -04:00
Kanav Bansal	c14bd1fcfe	fix(docs): update RAG tutorials link to point to correct path (#32169 ) ## Description: This PR updates the internal documentation link for the RAG tutorials to reflect the updated path. Previously, the link pointed to the root `/docs/tutorials/`, which was generic. It now correctly routes to the RAG-specific tutorial page for the following text-embedding models. 1. DatabricksEmbeddings 2. IBM watsonx.ai 3. OpenAIEmbeddings 4. NomicEmbeddings 5. CohereEmbeddings 6. MistralAIEmbeddings 7. FireworksEmbeddings 8. TogetherEmbeddings 9. LindormAIEmbeddings 10. ModelScopeEmbeddings 11. ClovaXEmbeddings 12. NetmindEmbeddings 13. SambaNovaCloudEmbeddings 14. SambaStudioEmbeddings 15. ZhipuAIEmbeddings ## Issue: N/A ## Dependencies: None ## Twitter handle: N/A	2025-07-22 10:24:50 -04:00
Byeongjin Kang	a1ccabf85d	docs: add documentation about how to use extended thinking with ChatBedrockConverse (#32168 )	2025-07-22 08:44:08 -04:00
Mason Daugherty	0a07cde3a2	fix: add type ignore for asyncio.create_task to support Python 3.9 and 3.10	2025-07-21 21:22:16 -04:00
Copilot	2104cf0d9a	fix: replace deprecated `Pydantic .schema()` calls with v1/v2 compatible pattern (#32162 ) This PR addresses deprecation warnings users encounter when using LangChain tools with Pydantic v2: ``` PydanticDeprecatedSince20: The `schema` method is deprecated; use `model_json_schema` instead. Deprecated in Pydantic V2.0 to be removed in V3.0. ``` ## Root Cause Several LangChain components were still using the deprecated `.schema()` method directly instead of the Pydantic v1/v2 compatible approach. While users calling `.schema()` on returned models will still see warnings (which is correct), LangChain's internal code should not generate these warnings. ## Changes Made Updated 3 files to use the standard compatibility pattern: ```python # Before (deprecated) schema = model.schema() # After (compatible with both v1 and v2) if hasattr(model, "model_json_schema"): schema = model.model_json_schema() # Pydantic v2 else: schema = model.schema() # Pydantic v1 ``` ### Files Updated: - `evaluation/parsing/json_schema.py`: Fixed `_parse_json()` method to handle Pydantic models correctly - `output_parsers/yaml.py`: Fixed `get_format_instructions()` to use compatible schema access - `chains/openai_functions/citation_fuzzy_match.py`: Fixed direct `.schema()` call on QuestionAnswer model ## Verification ✅ Zero breaking changes - all existing functionality preserved ✅ No deprecation warnings from LangChain internal code ✅ Backward compatible with Pydantic v1 ✅ Forward compatible with Pydantic v2 ✅ Edge cases handled (strings, plain objects, etc.) ## User Impact LangChain users will no longer see deprecation warnings from internal LangChain code. Users who directly call `.schema()` on schemas returned by LangChain should adopt the same compatibility pattern: ```python # User code should use this pattern input_schema = tool.get_input_schema() if hasattr(input_schema, "model_json_schema"): schema_result = input_schema.model_json_schema() else: schema_result = input_schema.schema() ``` Fixes #31458. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-21 21:19:53 -04:00
Mason Daugherty	0e0b1f39ca	lint	2025-07-21 21:18:59 -04:00

1 2 3 4 5 ...

13924 Commits