langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 10:17:00 +00:00

Author	SHA1	Message	Date
ccurme	b8cdbc4eca	fix(anthropic): sanitize tool use block when taking directly from content (#32574 )	2025-08-18 09:06:57 -04:00
Mason Daugherty	d3d23e2372	fix(anthropic): streaming token counting to defer input tokens until completion (#32518 ) Supersedes #32461 Fixed incorrect input token reporting during streaming when tools are used. Previously, input tokens were counted at `message_start` before tool execution, leading to inaccurate counts. Now input tokens are properly deferred until `message_delta` (completion), aligning with Anthropic's billing model and SDK expectations. Before Fix: - Streaming with tools: Input tokens = 0 ❌ - Non-streaming with tools: Input tokens = 472 ✅ After Fix: - Streaming with tools: Input tokens = 472 ✅ - Non-streaming with tools: Input tokens = 472 ✅ Aligns with Anthropic's SDK expectations. The SDK handles input token updates in `message_delta` events: ```python # https://github.com/anthropics/anthropic-sdk-python/blob/main/src/anthropic/lib/streaming/_messages.py if event.usage.input_tokens is not None: current_snapshot.usage.input_tokens = event.usage.input_tokens ```	2025-08-15 17:49:46 -04:00
Jack	b9dcce95be	fix(anthropic): Add proxy (#32409 ) Thank you for contributing to LangChain! Follow these steps to mark your pull request as ready for review. If any of these steps are not completed, your PR will not be considered for review. - [x] PR title: Follows the format: {TYPE}({SCOPE}): {DESCRIPTION} - [x] PR message: *Delete this entire checklist* and replace with fix #30146 - [x] Add tests and docs: If you're adding a new integration, you must include: - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. We will not consider a PR unless these three are passing in CI. See [contribution guidelines](https://python.langchain.com/docs/contributing/) for more. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to `pyproject.toml` files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-08-12 21:21:26 +00:00
ccurme	be83ce74a7	feat(anthropic): support `cache_control` as a kwarg (#31523 ) ```python from langchain_anthropic import ChatAnthropic llm = ChatAnthropic(model="claude-3-5-haiku-latest") caching_llm = llm.bind(cache_control={"type": "ephemeral"}) caching_llm.invoke( [ HumanMessage("..."), AIMessage("..."), HumanMessage("..."), # <-- final message / content block gets cache annotation ] ) ``` Potentially useful given's Anthropic's [incremental caching](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching#continuing-a-multi-turn-conversation) capabilities: > During each turn, we mark the final block of the final message with cache_control so the conversation can be incrementally cached. The system will automatically lookup and use the longest previously cached prefix for follow-up messages. --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-08-12 16:18:24 -04:00
Mason Daugherty	1167e7458e	fix(anthropic): update test model names and adjust token count assertions in integration tests (#32422 )	2025-08-12 19:39:35 +00:00
Mason Daugherty	d5fd0bca35	docs(anthropic): add documentation for extended context windows in Claude Sonnet 4 (#32517 )	2025-08-12 19:16:26 +00:00
Mason Daugherty	ee4c2510eb	feat: port various nit changes from `wip-v0.4` (#32506 ) Lots of work that wasn't directly related to core improvements/messages/testing functionality	2025-08-11 15:09:08 -04:00
Mason Daugherty	b7e4797e8b	release(anthropic): 0.3.18 (#32292 )	2025-07-28 17:07:11 -04:00
Mason Daugherty	3a487bf720	refactor(anthropic): AnthropicLLM to use Messages API (#32290 ) re: #32189	2025-07-28 16:22:58 -04:00
Mason Daugherty	a07d2c5016	refactor: remove references to unsupported model `claude-3-sonnet-20240229` (#32281 ) Addresses some (but not all) test issues brought about in #32280	2025-07-28 11:57:43 -04:00
Mason Daugherty	f624ad489a	feat(docs): improve devx, fix `Makefile` targets (#32237 ) TL;DR much of the provided `Makefile` targets were broken, and any time I wanted to preview changes locally I either had to refer to a command Chester gave me or try waiting on a Vercel preview deployment. With this PR, everything should behave like normal. Significant updates to the `Makefile` and documentation files, focusing on improving usability, adding clear messaging, and fixing/enhancing documentation workflows. ### Updates to `Makefile`: #### Enhanced build and cleaning processes: - Added informative messages (e.g., "📚 Building LangChain documentation...") to makefile targets like `docs_build`, `docs_clean`, and `api_docs_build` for better user feedback during execution. - Introduced a `clean-cache` target to the `docs` `Makefile` to clear cached dependencies and ensure clean builds. #### Improved dependency handling: - Modified `install-py-deps` to create a `.venv/deps_installed` marker, preventing redundant/duplicate dependency installations and improving efficiency. #### Streamlined file generation and infrastructure setup: - Added caching for the LangServe README download and parallelized feature table generation - Added user-friendly completion messages for targets like `copy-infra` and `render`. #### Documentation server updates: - Enhanced the `start` target with messages indicating server start and URL for local documentation viewing. --- ### Documentation Improvements: #### Content clarity and consistency: - Standardized section titles for consistency across documentation files. [[1]](diffhunk://#diff-9b1a85ea8a9dcf79f58246c88692cd7a36316665d7e05a69141cfdc50794c82aL1-R1) [[2]](diffhunk://#diff-944008ad3a79d8a312183618401fcfa71da0e69c75803eff09b779fc8e03183dL1-R1) - Refined phrasing and formatting in sections like "Dependency management" and "Formatting and linting" for better readability. [[1]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L6-R6) [[2]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L84-R82) #### Enhanced workflows: - Updated instructions for building and viewing documentation locally, including tips for specifying server ports and handling API reference previews. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L60-R94) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L82-R126) - Expanded guidance on cleaning documentation artifacts and using linting tools effectively. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L82-R126) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L107-R142) #### API reference documentation: - Improved instructions for generating and formatting in-code documentation, highlighting best practices for docstring writing. [[1]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L107-R142) [[2]](diffhunk://#diff-048deddcfd44b242e5b23aed9f2e9ec73afc672244ce14df2a0a316d95840c87L144-R186) --- ### Minor Changes: - Added support for a new package name (`langchain_v1`) in the API documentation generation script. - Fixed minor capitalization and formatting issues in documentation files. [[1]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L40-R40) [[2]](diffhunk://#diff-2069d4f956ab606ae6d51b191439283798adaf3a6648542c409d258131617059L166-R160) --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-07-25 14:49:03 -04:00
niceg	0d6f915442	fix: LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. (#32222 ) fix: Fix LLM mimicking Unicode responses due to forced Unicode conversion of non-ASCII characters. - Description: This PR fixes an issue where the LLM would mimic Unicode responses due to forced Unicode conversion of non-ASCII characters in tool calls. The fix involves disabling the `ensure_ascii` flag in `json.dumps()` when converting tool calls to OpenAI format. - Issue: Fixes ↓↓↓ input： ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "你好啊集团"}'}}]} ``` output: ```json {'role': 'assistant', 'tool_calls': [{'type': 'function', 'id': 'call_nv9trcehdpihr21zj9po19vq', 'function': {'name': 'create_customer', 'arguments': '{"customer_name": "\\u4f60\\u597d\\u554a\\u96c6\\u56e2"}'}}]} ``` then: llm will mimic outputting unicode. Unicode's vast number of symbols can lengthen LLM responses, leading to slower performance. <img width="686" height="277" alt="image" src="https://github.com/user-attachments/assets/28f3b007-3964-4455-bee2-68f86ac1906d" /> --------- Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 17:01:31 -04:00
Mason Daugherty	d53ebf367e	fix(docs): capitalization, codeblock formatting, and hyperlinks, note blocks (#32235 ) widespread cleanup attempt	2025-07-24 16:55:04 -04:00
Copilot	54542b9385	docs(openai): add comprehensive documentation and examples for `extra_body` + others (#32149 ) This PR addresses the common issue where users struggle to pass custom parameters to OpenAI-compatible APIs like LM Studio, vLLM, and others. The problem occurs when users try to use `model_kwargs` for custom parameters, which causes API errors. ## Problem Users attempting to pass custom parameters (like LM Studio's `ttl` parameter) were getting errors: ```python # ❌ This approach fails llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", model_kwargs={"ttl": 5} # Causes TypeError: unexpected keyword argument 'ttl' ) ``` ## Solution The `extra_body` parameter is the correct way to pass custom parameters to OpenAI-compatible APIs: ```python # ✅ This approach works correctly llm = ChatOpenAI( base_url="http://localhost:1234/v1", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 5} # Custom parameters go in extra_body ) ``` ## Changes Made 1. Enhanced Documentation: Updated the `extra_body` parameter docstring with comprehensive examples for LM Studio, vLLM, and other providers 2. Added Documentation Section: Created a new "OpenAI-compatible APIs" section in the main class docstring with practical examples 3. Unit Tests: Added tests to verify `extra_body` functionality works correctly: - `test_extra_body_parameter()`: Verifies custom parameters are included in request payload - `test_extra_body_with_model_kwargs()`: Ensures `extra_body` and `model_kwargs` work together 4. Clear Guidance: Documented when to use `extra_body` vs `model_kwargs` ## Examples Added LM Studio with TTL (auto-eviction): ```python ChatOpenAI( base_url="http://localhost:1234/v1", api_key="lm-studio", model="mlx-community/QwQ-32B-4bit", extra_body={"ttl": 300} # Auto-evict after 5 minutes ) ``` vLLM with custom sampling: ```python ChatOpenAI( base_url="http://localhost:8000/v1", api_key="EMPTY", model="meta-llama/Llama-2-7b-chat-hf", extra_body={ "use_beam_search": True, "best_of": 4 } ) ``` ## Why This Works - `model_kwargs` parameters are passed directly to the OpenAI client's `create()` method, causing errors for non-standard parameters - `extra_body` parameters are included in the HTTP request body, which is exactly what OpenAI-compatible APIs expect for custom parameters Fixes #32115. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-24 16:43:16 -04:00
ccurme	3672bbc71e	fix(anthropic): update integration test models (#32189 ) Multiple models were [retired](https://docs.anthropic.com/en/docs/about-claude/model-deprecations#model-status) yesterday. Tests remain broken until we figure out what to do with the legacy Anthropic LLM integration— currently uses their (legacy) text completions API, for which there appear to be no remaining supported models.	2025-07-22 19:51:39 +00:00
ccurme	2ef9465893	fix(anthropic): fix test (#32145 )	2025-07-21 14:49:40 +00:00
Mason Daugherty	71b361936d	ruff: restore stacklevels, disable autofixing (#31919 )	2025-07-08 12:55:47 -04:00
Mason Daugherty	ae210c1590	ruff: add bugbear across packages (#31917 ) WIP, other packages will get in next PRs	2025-07-08 12:22:55 -04:00
Mason Daugherty	750721b4c3	huggingface[patch]: ruff fixes and rules (#31912 ) * bump ruff deps * add more thorough ruff rules * fix said rules	2025-07-08 10:07:57 -04:00
Mason Daugherty	2a7645300c	anthropic[patch]: ruff fixes and rules (#31899 ) * bump ruff deps * add more thorough ruff rules * fix said rules	2025-07-07 18:32:27 -04:00
Mason Daugherty	e7eac27241	ruff: more rules across the board & fixes (#31898 ) * standardizes ruff dep version across all `pyproject.toml` files * cli: ruff rules and corrections * langchain: rules and corrections	2025-07-07 17:48:01 -04:00
Mason Daugherty	706a66eccd	fix: automatically fix issues with ruff (#31897 ) * Perform safe automatic fixes instead of only selecting [isort](https://docs.astral.sh/ruff/rules/#isort-i)	2025-07-07 14:13:10 -04:00
ccurme	3f4b355eef	anthropic[patch]: pass back in citations in multi-turn conversations (#31882 ) Also adds VCR cassettes for some heavy tests.	2025-07-05 17:33:22 -04:00
ccurme	ade642b7c5	Revert "infra: temporarily skip tests" (#31854 ) Reverts langchain-ai/langchain#31853	2025-07-03 13:55:29 -04:00
ccurme	c9f45dc323	infra: temporarily skip tests (#31853 ) Tests failed twice with different timeout errors.	2025-07-03 13:39:14 -04:00
ccurme	f88fff0b8a	anthropic: release 0.3.17 (#31852 )	2025-07-03 13:18:43 -04:00
Mason Daugherty	1a3a8db3c9	docs: anthropic formatting cleanup (#31847 ) inline URLs, capitalization, code blocks	2025-07-03 14:50:23 +00:00
Mason Daugherty	645e25f624	langchain-anthropic[patch]: Add ruff bandit rules (#31789 )	2025-06-30 14:00:53 -04:00
ccurme	0ae434be21	anthropic: release 0.3.16 (#31744 )	2025-06-26 09:09:29 -04:00
ccurme	b02bd67788	anthropic[patch]: cache clients (#31659 )	2025-06-25 14:49:02 -04:00
ccurme	e09abf8170	anthropic[patch]: add benchmark (#31718 ) Account for lazy loading of clients in init time benchmark	2025-06-24 15:17:22 -04:00
ccurme	ee83993b91	docs: document Anthropic cache TTL count details (#31708 )	2025-06-23 20:16:42 +00:00
Bagatur	f7f52cab12	anthropic[patch]: cache tokens nit (#31484 ) if you pass in beta headers directly cache_creation is a dict	2025-06-05 16:15:03 -04:00
ccurme	14c561e15d	infra: relax types-requests version range (#31504 )	2025-06-05 18:57:08 +00:00
Bagatur	ec8bab83f8	anthropic[fix]: bump langchain-core dep (#31483 )	2025-06-03 10:56:48 -04:00
Bagatur	310e643842	release[anthropic]: 0.3.15 (#31479 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-06-03 10:38:11 -04:00
ccurme	d3be4a0c56	infra: remove use of --vcr-record=none (#31452 ) This option is specific to `pytest-vcr`. `pytest-recording` runs in this mode by default.	2025-06-01 10:49:59 -04:00
ccurme	3db1aa0ba6	standard-tests: migrate to pytest-recording (#31425 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-05-31 15:21:15 -04:00
ccurme	49eeb0f3c3	standard-tests: add benchmarks (#31302 ) Co-authored-by: Sydney Runkle <sydneymarierunkle@gmail.com>	2025-05-29 15:21:37 +00:00
ccurme	0e3f35effe	anthropic: store cache ttl details on usage metadata (#31393 )	2025-05-28 13:52:37 -04:00
ccurme	443341a20d	anthropic: release 0.3.14 (#31378 )	2025-05-27 17:31:05 +00:00
ccurme	580986b260	anthropic: support for code execution, MCP connector, files API features (#31340 ) Support for the new [batch of beta features](https://www.anthropic.com/news/agent-capabilities-api) released yesterday: - [Code execution](https://docs.anthropic.com/en/docs/agents-and-tools/tool-use/code-execution-tool) - [MCP connector](https://docs.anthropic.com/en/docs/agents-and-tools/mcp-connector) - [Files API](https://docs.anthropic.com/en/docs/build-with-claude/files) Also verified support for [prompt cache TTL](https://docs.anthropic.com/en/docs/build-with-claude/prompt-caching#1-hour-cache-duration-beta).	2025-05-27 12:45:45 -04:00
mathislindner	e1af509966	anthropic: emit informative error message if there are only system messages in a prompt (#30822 ) PR message: Not sure if I put the check at the right spot, but I thought throwing the error before the loop made sense to me. Description: Checks if there are only system messages using AnthropicChat model and throws an error if it's the case. Check Issue for more details Issue: #30764 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-05-16 20:43:59 +00:00
ccurme	9aac8923a3	docs: add web search to anthropic docs (#31169 )	2025-05-08 16:20:11 -04:00
ccurme	2d202f9762	anthropic[patch]: split test into two (#31167 )	2025-05-08 09:23:36 -04:00
ccurme	d4555ac924	anthropic: release 0.3.13 (#31162 )	2025-05-08 03:13:15 +00:00
ccurme	e34f9fd6f7	anthropic: update streaming usage metadata (#31158 ) Anthropic updated how they report token counts during streaming today. See changes to `MessageDeltaUsage` in [this commit](`2da00f26c5 (diff-1a396eba0cd9cd8952dcdb58049d3b13f6b7768ead1411888d66e28211f7bfc5)`). It's clean and simple to grab these fields from the final `message_delta` event. However, some of them are typed as Optional, and language [here](`e42451ab3f/src/anthropic/lib/streaming/_messages.py (L462)`) suggests they may not always be present. So here we take the required field from the `message_delta` event as we were doing previously, and ignore the rest.	2025-05-07 23:09:56 -04:00
ccurme	682f338c17	anthropic[patch]: support web search (#31157 )	2025-05-07 18:04:06 -04:00
ccurme	b5b90b5929	anthropic[patch]: be robust to null fields when translating usage metadata (#31151 )	2025-05-07 18:30:21 +00:00
Ben Gladwell	da59eb7eb4	anthropic: Allow kwargs to pass through when counting tokens (#31082 ) - Description: `ChatAnthropic.get_num_tokens_from_messages` does not currently receive `kwargs` and pass those on to `self._client.beta.messages.count_tokens`. This is a problem if you need to pass specific options to `count_tokens`, such as the `thinking` option. This PR fixes that. - Issue: N/A - Dependencies: None - Twitter handle: @bengladwell Co-authored-by: ccurme <chester.curme@gmail.com>	2025-04-30 17:56:22 -04:00

... 3 4 5 6 7 ...

434 Commits