langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-09-21 18:39:57 +00:00

Author	SHA1	Message	Date
Chester Curme	844b8b87d7	Merge branch 'standard_outputs' into cc/openai_v1 # Conflicts: # libs/core/langchain_core/language_models/v1/chat_models.py # libs/core/langchain_core/messages/utils.py # libs/core/langchain_core/messages/v1.py # libs/partners/openai/langchain_openai/chat_models/_compat.py # libs/partners/openai/langchain_openai/chat_models/base.py	2025-07-28 12:38:32 -04:00
Chester Curme	61e329637b	lint	2025-07-28 11:02:37 -04:00
Chester Curme	b8fed06409	move get_num_tokens_from_messages to BaseChatModel and BaseChatModelV1	2025-07-28 10:58:57 -04:00
Mason Daugherty	ef9b5a9e18	add back standard_outputs	2025-07-28 10:47:26 -04:00
Mason Daugherty	5e9eb19a83	chore: update branch with changes from master (#32277 ) Co-authored-by: Maxime Grenu <69890511+cluster2600@users.noreply.github.com> Co-authored-by: Claude <claude@anthropic.com> Co-authored-by: Claude <noreply@anthropic.com> Co-authored-by: jmaillefaud <jonathan.maillefaud@evooq.ch> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: tanwirahmad <tanwirahmad@users.noreply.github.com> Co-authored-by: Christophe Bornet <cbornet@hotmail.com> Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com> Co-authored-by: niceg <79145285+growmuye@users.noreply.github.com> Co-authored-by: Chaitanya varma <varmac301@gmail.com> Co-authored-by: dishaprakash <57954147+dishaprakash@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com> Co-authored-by: Kanav Bansal <13186335+bansalkanav@users.noreply.github.com> Co-authored-by: Aleksandr Filippov <71711753+alex-feel@users.noreply.github.com> Co-authored-by: Alex Feel <afilippov@spotware.com>	2025-07-28 10:39:41 -04:00
Chester Curme	c409f723a2	Merge branch 'standard_outputs' into cc/openai_v1 # Conflicts: # libs/core/langchain_core/messages/utils.py	2025-07-28 10:19:50 -04:00
ccurme	3d9e694f73	feat(core): start on v1 chat model (#32276 ) Co-authored-by: Nuno Campos <nuno@langchain.dev>	2025-07-28 10:17:06 -04:00
Mason Daugherty	c921d08b18	feat(docs): add docstring to `_convert_from_v1_message()`	2025-07-25 11:01:48 -04:00
Mason Daugherty	3f653011e6	nit: use `block` instead of `content_block` for consistency in `convert_to_openai_image_block()`	2025-07-25 10:57:22 -04:00
Mason Daugherty	ee13a3b6fa	nit: rearrange `index` to be grouped with other always-present fields	2025-07-25 10:16:35 -04:00
Chester Curme	61129557c0	x	2025-07-24 17:17:33 -04:00
Chester Curme	4899857042	start on openai	2025-07-24 17:12:22 -04:00
Chester Curme	041b196145	Revert "copy BaseChatModel to language_models.v1" This reverts commit `2d031031e3`.	2025-07-24 13:33:41 -04:00
Chester Curme	dd8057a034	remove type ignores for eugene	2025-07-24 13:31:50 -04:00
Chester Curme	b94f23883f	move best-effort v1 conversion	2025-07-24 13:31:27 -04:00
Chester Curme	2d031031e3	copy BaseChatModel to language_models.v1	2025-07-24 09:56:45 -04:00
Chester Curme	0bb7a823c5	x	2025-07-23 15:17:46 -04:00
Chester Curme	df0a8562a9	openai: lint	2025-07-23 13:47:24 -04:00
ccurme	e9b0b84675	feat: new message formats (v0.4) (#32208 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-07-23 13:30:21 -04:00
Chester Curme	79bc8259e5	openai: format	2025-07-23 11:52:50 -04:00
Chester Curme	7c0d1cb324	openai: fix lint and tests	2025-07-23 09:53:46 -04:00
Chester Curme	eb8d32aff2	output_version -> str	2025-07-23 09:38:01 -04:00
Chester Curme	78d036a093	Merge branch 'wip-v0.4' into standard_outputs	2025-07-23 09:34:20 -04:00
Christophe Bornet	3496e1739e	feat(langchain): add ruff rules PL (#32079 ) See https://docs.astral.sh/ruff/rules/#pylint-pl	2025-07-22 23:55:32 -04:00
Chester Curme	6572656cd2	core: support both old and new data content blocks	2025-07-22 18:19:09 -04:00
Chester Curme	e1f034c795	openai: support web search and code interpreter content blocks	2025-07-22 16:58:43 -04:00
Chester Curme	b1a02f971b	fix tests	2025-07-22 16:45:19 -04:00
Mason Daugherty	3ed804a5f3	fix(perplexity): undo xfails (#32192 )	2025-07-22 16:29:37 -04:00
Mason Daugherty	ca137bfe62	.	2025-07-22 16:25:02 -04:00
Mason Daugherty	fa487fb62d	fix(perplexity): temp xfail int tests (#32191 ) It appears the API has changes since the 2025-04-15 release, leading to failed integration tests.	2025-07-22 16:20:51 -04:00
ccurme	3672bbc71e	fix(anthropic): update integration test models (#32189 ) Multiple models were [retired](https://docs.anthropic.com/en/docs/about-claude/model-deprecations#model-status) yesterday. Tests remain broken until we figure out what to do with the legacy Anthropic LLM integration— currently uses their (legacy) text completions API, for which there appear to be no remaining supported models.	2025-07-22 19:51:39 +00:00
Mason Daugherty	a02ad3d192	docs: formatting cleanup (#32188 ) * formatting cleaning * make `init_chat_model` more prominent in list of guides	2025-07-22 15:46:15 -04:00
ccurme	0c4054a7fc	release(core): 0.3.71 (#32186 )	2025-07-22 15:44:36 -04:00
ccurme	ebf2e11bcb	fix(core): exclude api_key from tracing metadata (#32184 ) (standard param)	2025-07-22 15:32:12 -04:00
ccurme	e41e6ec6aa	release(chroma): 0.2.5 (#32183 )	2025-07-22 15:24:03 -04:00
itaismith	09769373b3	feat(chroma): Add Chroma Cloud support (#32125 ) * Adding support for more Chroma client options (`HttpClient` and `CloundClient`). This includes adding arguments necessary for instantiating these clients. * Adding support for Chroma's new persisted collection configuration (we moved index configuration into this new construct). * Delegate `Settings` configuration to Chroma's client constructors.	2025-07-22 15:14:15 -04:00
ccurme	8acfd677bc	fix(core): add type key when tracing in some cases (#31825 )	2025-07-22 18:08:16 +00:00
Mason Daugherty	af3789b9ed	fix(deepseek): release openai version (#32181 ) used sdk version instead of langchain by accident	2025-07-22 13:29:52 -04:00
Mason Daugherty	a6896794ca	release(ollama): 0.3.6 (#32180 )	2025-07-22 13:24:17 -04:00
Copilot	d40fd5a3ce	feat(ollama): warn on empty `load` responses (#32161 ) ## Problem When using `ChatOllama` with `create_react_agent`, agents would sometimes terminate prematurely with empty responses when Ollama returned `done_reason: 'load'` responses with no content. This caused agents to return empty `AIMessage` objects instead of actual generated text. ```python from langchain_ollama import ChatOllama from langgraph.prebuilt import create_react_agent from langchain_core.messages import HumanMessage llm = ChatOllama(model='qwen2.5:7b', temperature=0) agent = create_react_agent(model=llm, tools=[]) result = agent.invoke(HumanMessage('Hello'), {"configurable": {"thread_id": "1"}}) # Before fix: AIMessage(content='', response_metadata={'done_reason': 'load'}) # Expected: AIMessage with actual generated content ``` ## Root Cause The `_iterate_over_stream` and `_aiterate_over_stream` methods treated any response with `done: True` as final, regardless of `done_reason`. When Ollama returns `done_reason: 'load'` with empty content, it indicates the model was loaded but no actual generation occurred - this should not be considered a complete response. ## Solution Modified the streaming logic to skip responses when: - `done: True` - `done_reason: 'load'` - Content is empty or contains only whitespace This ensures agents only receive actual generated content while preserving backward compatibility for load responses that do contain content. ## Changes - `_iterate_over_stream`: Skip empty load responses instead of yielding them - `_aiterate_over_stream`: Apply same fix to async streaming - Tests: Added comprehensive test cases covering all edge cases ## Testing All scenarios now work correctly: - ✅ Empty load responses are skipped (fixes original issue) - ✅ Load responses with actual content are preserved (backward compatibility) - ✅ Normal stop responses work unchanged - ✅ Streaming behavior preserved - ✅ `create_react_agent` integration fixed Fixes #31482. <!-- START COPILOT CODING AGENT TIPS --> --- 💡 You can make Copilot smarter by setting up custom instructions, customizing its development environment and configuring Model Context Protocol (MCP) servers. Learn more [Copilot coding agent tips](https://gh.io/copilot-coding-agent-tips) in the docs. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-22 13:21:11 -04:00
Mason Daugherty	116b758498	fix: bump deps for release (#32179 ) forgot to bump the `pyproject.toml` files	2025-07-22 13:12:14 -04:00
Mason Daugherty	10996a2821	release(perplexity): 0.1.2 (#32176 )	2025-07-22 13:02:19 -04:00
Mason Daugherty	2aed07efb6	release(deepseek): 0.1.4 (#32178 )	2025-07-22 13:01:54 -04:00
Mason Daugherty	64dac1faf7	release(huggingface): 0.3.1 (#32177 )	2025-07-22 13:01:34 -04:00
Mason Daugherty	58768d8aef	release(xai): 0.2.5 (#32174 )	2025-07-22 13:01:26 -04:00
Mason Daugherty	d65da13299	docs(ollama): add `validate_model_on_init` note, bump lock (#32172 )	2025-07-22 10:58:45 -04:00
Mason Daugherty	b24f90dabe	refactor(core): standard content blocks (#32085 )	2025-07-22 09:17:55 -04:00
Copilot	2104cf0d9a	fix: replace deprecated `Pydantic .schema()` calls with v1/v2 compatible pattern (#32162 ) This PR addresses deprecation warnings users encounter when using LangChain tools with Pydantic v2: ``` PydanticDeprecatedSince20: The `schema` method is deprecated; use `model_json_schema` instead. Deprecated in Pydantic V2.0 to be removed in V3.0. ``` ## Root Cause Several LangChain components were still using the deprecated `.schema()` method directly instead of the Pydantic v1/v2 compatible approach. While users calling `.schema()` on returned models will still see warnings (which is correct), LangChain's internal code should not generate these warnings. ## Changes Made Updated 3 files to use the standard compatibility pattern: ```python # Before (deprecated) schema = model.schema() # After (compatible with both v1 and v2) if hasattr(model, "model_json_schema"): schema = model.model_json_schema() # Pydantic v2 else: schema = model.schema() # Pydantic v1 ``` ### Files Updated: - `evaluation/parsing/json_schema.py`: Fixed `_parse_json()` method to handle Pydantic models correctly - `output_parsers/yaml.py`: Fixed `get_format_instructions()` to use compatible schema access - `chains/openai_functions/citation_fuzzy_match.py`: Fixed direct `.schema()` call on QuestionAnswer model ## Verification ✅ Zero breaking changes - all existing functionality preserved ✅ No deprecation warnings from LangChain internal code ✅ Backward compatible with Pydantic v1 ✅ Forward compatible with Pydantic v2 ✅ Edge cases handled (strings, plain objects, etc.) ## User Impact LangChain users will no longer see deprecation warnings from internal LangChain code. Users who directly call `.schema()` on schemas returned by LangChain should adopt the same compatibility pattern: ```python # User code should use this pattern input_schema = tool.get_input_schema() if hasattr(input_schema, "model_json_schema"): schema_result = input_schema.model_json_schema() else: schema_result = input_schema.schema() ``` Fixes #31458. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-21 21:19:53 -04:00
Copilot	18c64aed6d	feat(core): add `sanitize_for_postgres` utility to fix PostgreSQL NUL byte DataError (#32157 ) This PR fixes the PostgreSQL NUL byte issue that causes `psycopg.DataError` when inserting documents containing `\x00` bytes into PostgreSQL-based vector stores. ## Problem PostgreSQL text fields cannot contain NUL (0x00) bytes. When documents with such characters are processed by PGVector or langchain-postgres implementations, they fail with: ``` (psycopg.DataError) PostgreSQL text fields cannot contain NUL (0x00) bytes ``` This commonly occurs when processing PDFs, documents from various loaders, or text extracted by libraries like unstructured that may contain embedded NUL bytes. ## Solution Added `sanitize_for_postgres()` utility function to `langchain_core.utils.strings` that removes or replaces NUL bytes from text content. ### Key Features - Simple API: `sanitize_for_postgres(text, replacement="")` - Configurable: Replace NUL bytes with empty string (default) or space for readability - Comprehensive: Handles all problematic examples from the original issue - Well-tested: Complete unit tests with real-world examples - Backward compatible: No breaking changes, purely additive ### Usage Example ```python from langchain_core.utils import sanitize_for_postgres from langchain_core.documents import Document # Before: This would fail with DataError problematic_content = "Getting\x00Started with embeddings" # After: Clean the content before database insertion clean_content = sanitize_for_postgres(problematic_content) # Result: "GettingStarted with embeddings" # Or preserve readability with spaces readable_content = sanitize_for_postgres(problematic_content, " ") # Result: "Getting Started with embeddings" # Use in Document processing doc = Document(page_content=clean_content, metadata={...}) ``` ### Integration Pattern PostgreSQL vector store implementations should sanitize content before insertion: ```python def add_documents(self, documents: List[Document]) -> List[str]: # Sanitize documents before insertion sanitized_docs = [] for doc in documents: sanitized_content = sanitize_for_postgres(doc.page_content, " ") sanitized_doc = Document( page_content=sanitized_content, metadata=doc.metadata, id=doc.id ) sanitized_docs.append(sanitized_doc) return self._insert_documents_to_db(sanitized_docs) ``` ## Changes Made - Added `sanitize_for_postgres()` function in `langchain_core/utils/strings.py` - Updated `langchain_core/utils/__init__.py` to export the new function - Added comprehensive unit tests in `tests/unit_tests/utils/test_strings.py` - Validated against all examples from the original issue report ## Testing All tests pass, including: - Basic NUL byte removal and replacement - Multiple consecutive NUL bytes - Empty string handling - Real examples from the GitHub issue - Backward compatibility with existing string utilities This utility enables PostgreSQL integrations in both langchain-community and langchain-postgres packages to handle documents with NUL bytes reliably. Fixes #26033. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-07-21 20:33:20 -04:00
Christophe Bornet	64261449b8	feat(langchain): add ruff rules TRY (#32047 ) See https://docs.astral.sh/ruff/rules/#tryceratops-try * TRY004 (replace by TypeError) in main code is escaped with `noqa` to not break backward compatibility. The rule is still interesting for new code. * TRY301 ignored at the moment. This one is quite hard to fix and I'm not sure it's very interesting to activate it. Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-21 13:41:20 -04:00

1 2 3 4 5 ...

7378 Commits