langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-11 22:04:37 +00:00

Author	SHA1	Message	Date
Eugene Yurtsev	9f345d64fd	core[patch]: Remove old accidental commit (#30483 ) Remove commented out file that was accidentally added Co-authored-by: ccurme <chester.curme@gmail.com>	2025-03-25 15:37:20 -07:00
ccurme	4b9e2e51f3	core[patch]: add token counting callback handler (#30481 ) Stripped-down version of [OpenAICallbackHandler](https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/callbacks/openai_info.py) that just tracks `AIMessage.usage_metadata`. ```python from langchain_core.callbacks import get_usage_metadata_callback from langgraph.prebuilt import create_react_agent def get_weather(location: str) -> str: """Get the weather at a location.""" return "It's sunny." tools = [get_weather] agent = create_react_agent("openai:gpt-4o-mini", tools) with get_usage_metadata_callback() as cb: result = await agent.ainvoke({"messages": "What's the weather in Boston?"}) print(cb.usage_metadata) ```	2025-03-25 18:16:39 -04:00
pulvedu	1d2b1d8e5e	docs: fix typos in Tavily Docs (#30484 ) Thank you for contributing to LangChain! Small changes to docs --------- Co-authored-by: pulvedu <dustin@tavily.com>	2025-03-25 18:16:09 -04:00
Christian Jung	19104db7c5	Docs: Fix typo in cookbook (#30485 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: fix typo - Issue: - - Dependencies: - - Twitter handle: - - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17.	2025-03-25 18:15:29 -04:00
Eugene Yurtsev	0acca6b9c8	core[patch]: Fix handling of `title` when tool schema is specified manually via JSONSchema (#30479 ) Fix issue: https://github.com/langchain-ai/langchain/issues/30456	2025-03-25 15:15:24 -04:00
Ben Chambers	c5e42a4027	community: deprecate graph vector store (#30328 ) - Description: mark GraphVectorStore `@deprecated` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-25 13:52:54 +00:00
Ian Muge	a8ce63903d	community: Add edge properties to the gremlin graph schema (#30449 ) Description: Extend the gremlin graph schema to include the edge properties, grouped by its triples; i.e: `inVLabel` and `outVLabel`. This should give more context when crafting queries to run against a gremlin graph db	2025-03-24 19:03:01 -04:00
ccurme	b60e6f6efa	community[patch]: update API ref for AmazonTextractPDFParser (#30468 )	2025-03-24 23:02:52 +00:00
David Sánchez Sánchez	3ba0d28d8e	community: update perplexity docstring (#30451 ) This pull request includes extensive documentation updates for the `ChatPerplexity` class in the `libs/community/langchain_community/chat_models/perplexity.py` file. The changes provide detailed setup instructions, key initialization arguments, and usage examples for various functionalities of the `ChatPerplexity` class. Documentation improvements: * Added setup instructions for installing the `openai` package and setting the `PPLX_API_KEY` environment variable. * Documented key initialization arguments for completion parameters and client parameters, including `model`, `temperature`, `max_tokens`, `streaming`, `pplx_api_key`, `request_timeout`, and `max_retries`. * Provided examples for instantiating the `ChatPerplexity` class, invoking it with messages, using structured output, invoking with perplexity-specific parameters, streaming responses, and accessing token usage and response metadata.Thank you for contributing to LangChain!	2025-03-24 15:01:02 -04:00
Vadym Barda	97dec30eea	docs[patch]: update trim_messages doc (#30462 )	2025-03-24 18:50:48 +00:00
ccurme	c2dd8d84ff	infra[patch]: remove pyspark from langchain-community extended testing requirements (#30466 )	2025-03-24 14:41:54 -04:00
ccurme	aa30d2d57f	standard-tests: release 0.3.16 (#30464 )	2025-03-24 18:35:12 +00:00
ccurme	b09e7c125c	cli: use pytest-watcher (#30465 ) pytest-watch is no longer maintained.	2025-03-24 18:06:31 +00:00
David Sánchez Sánchez	d7b13e12ee	community: update perplexity documentation (#30450 ) This pull request includes updates to the `docs/docs/integrations/chat/perplexity.ipynb` file to enhance the documentation for `ChatPerplexity`. The changes focus on demonstrating the use of Perplexity-specific parameters and supporting structured outputs for Tier 3+ users. Enhancements to documentation: * Added a new markdown cell explaining the use of Perplexity-specific parameters through the `ChatPerplexity` class, including parameters like `search_domain_filter`, `return_images`, `return_related_questions`, and `search_recency_filter` using the `extra_body` parameter. * Added a new code cell demonstrating how to invoke `ChatPerplexity` with the `extra_body` parameter to filter search recency. Support for structured outputs: * Added a new markdown cell explaining that `ChatPerplexity` supports structured outputs for Tier 3+ users. * Added a new code cell demonstrating how to use `ChatPerplexity` with structured outputs by defining a `BaseModel` class and invoking the chat with structured output.[Copilot is generating a summary...]Thank you for contributing to LangChain! --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-24 13:49:59 -04:00
ccurme	50ec4a1a4f	openai[patch]: attempt to make test less flaky (#30463 )	2025-03-24 17:36:36 +00:00
ccurme	8486e0ae80	openai[patch]: bump openai sdk (#30461 ) [New required field](https://github.com/openai/openai-python/pull/2223/files#diff-530fd17eb1cc43440c82630df0ddd9b0893cf14b04065a95e6eef6cd2f766a44R26) for `ResponseUsage` released in 1.66.5.	2025-03-24 12:10:00 -04:00
ccurme	cbbc968903	openai: release 0.3.10 (#30460 )	2025-03-24 15:37:53 +00:00
ccurme	ed5e589191	openai[patch]: support multi-turn computer use (#30410 ) Here we accept ToolMessages of the form ```python ToolMessage( content=<representation of screenshot> (see below), tool_call_id="abc123", additional_kwargs={"type": "computer_call_output"}, ) ``` and translate them to `computer_call_output` items for the Responses API. We also propagate `reasoning_content` items from AIMessages. ## Example ### Load screenshots ```python import base64 def load_png_as_base64(file_path): with open(file_path, "rb") as image_file: encoded_string = base64.b64encode(image_file.read()) return encoded_string.decode('utf-8') screenshot_1_base64 = load_png_as_base64("/path/to/screenshot/of/application.png") screenshot_2_base64 = load_png_as_base64("/path/to/screenshot/of/desktop.png") ``` ### Initial message and response ```python from langchain_core.messages import HumanMessage, ToolMessage from langchain_openai import ChatOpenAI llm = ChatOpenAI( model="computer-use-preview", model_kwargs={"truncation": "auto"}, ) tool = { "type": "computer_use_preview", "display_width": 1024, "display_height": 768, "environment": "browser" } llm_with_tools = llm.bind_tools([tool]) input_message = HumanMessage( content=[ { "type": "text", "text": ( "Click the red X to close and reveal my Desktop. " "Proceed, no confirmation needed." ) }, { "type": "input_image", "image_url": f"data:image/png;base64,{screenshot_1_base64}", } ] ) response = llm_with_tools.invoke( [input_message], reasoning={ "generate_summary": "concise", }, ) response.additional_kwargs["tool_outputs"] ``` ### Construct ToolMessage ```python tool_call_id = response.additional_kwargs["tool_outputs"][0]["call_id"] tool_message = ToolMessage( content=[ { "type": "input_image", "image_url": f"data:image/png;base64,{screenshot_2_base64}" } ], # content=f"data:image/png;base64,{screenshot_2_base64}", # <-- also acceptable tool_call_id=tool_call_id, additional_kwargs={"type": "computer_call_output"}, ) ``` ### Invoke again ```python messages = [ input_message, response, tool_message, ] response_2 = llm_with_tools.invoke( messages, reasoning={ "generate_summary": "concise", }, ) ```	2025-03-24 15:25:36 +00:00
Vadym Barda	7bc50730aa	core[patch]: release 0.3.48 (#30458 )	2025-03-24 09:48:03 -04:00
Mohammad Mohtashim	33f1ab1528	Youtube Loader `load` method Fixed (#30314 ) - Description: Fixed the `YoutubeLoader` loading method not returning the correct object - Issue: #30309 --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-03-23 14:48:03 -04:00
Simon Paredes	df4448dfac	langchain-groq: Add response metadata when streaming (#30379 ) - Description: Add missing `model_name` and `system_fingerprint` metadata when streaming. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-23 14:34:41 -04:00
Changyong Um	e2d9fe766f	community[tool]: Integrate a tool for the naver_search (#30392 ) Hello! I have reopened a pull request for tool integration. Please refer to the previous [PR](https://github.com/langchain-ai/langchain/pull/30248). I understand that for the tool integration, a separate package should be created, and only the documentation should be added under docs/docs/. If there are any other procedures, please let me know. [langchain-naver-community](https://github.com/e7217/langchain-naver-community) cc: @ccurme --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-23 14:05:24 -04:00
Jonathan Feng	3848a1371d	langchain-contextual: update provider documentation and add reranker documentation (#30415 ) Hi @ccurme! Thanks so much for helping with getting the Contextual documentation merged last time. We added the reranker to our provider's documentation! Please let me know if there's any issues with it! Would love to also work with your team on an announcement for this! 🙏 Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: updates contextual provider documentation to include information about our reranker, also includes documentation for contextual's reranker in the retrievers section - Twitter handle: https://x.com/ContextualAI/highlights docs have been added - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2025-03-22 18:09:09 -04:00
ccurme	d867afff1c	docs: update package table ordering (#30437 ) Update download counts (only impacts ordering, counts in rendered page are updated automatically).	2025-03-22 18:07:08 -04:00
Brandon Luu	bbbd4e1db8	docs: Update VectorStoreTab vector store initializations (#30413 ) Description: Update vector store tab inits to match either the docs or api_ref (whichever was more comprehensive) List of changes per vector stores: - In-memory - no change - AstraDB - match to docs - docs/api_refs match (excluding embeddings) - Chroma - match to docs - api_refs is less descriptive - FAISS - match to docs - docs/api_refs match (excluding embeddings) - Milvus - match to docs to use Milvus Lite with Flat index - api_refs does not have index_param for generalization - MongoDB - match to docs - api_refs are sparser - PGVector - match to api_ref - changed to include docker cmd directly in code - docs/api_ref has comment to view docker command in separate code block - Pinecone - match to api_refs - docs have code dispersed - Qdrant - match to api_ref - docs has size=3072, api_ref has size=1536 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-22 17:29:45 -04:00
Matthew Farrellee	e7032901c3	langchain-tests: allow test_serdes for packages outside the default valid namespaces (#30343 ) Description: a third party package not listed in the default valid namespaces cannot pass test_serdes because the load() does not allow for extending the valid_namespaces. test_serdes will fail with - ValueError: Invalid namespace: {'lc': 1, 'type': 'constructor', 'id': ['langchain_other', 'chat_models', 'ChatOther'], 'kwargs': {'model_name': '...', 'api_key': '...'}, 'name': 'ChatOther'} this change has test_serdes automatically extend valid_namespaces based off the ChatModel under test's namespace.	2025-03-22 17:27:39 -04:00
Jiwon Kang	699475a01d	community: uuidv1 is unsafe (#30432 ) this_row_id previously used UUID v1. However, since UUID v1 can be predicted if the MAC address and timestamp are known, it poses a potential security risk. Therefore, it has been changed to UUID v4.	2025-03-22 15:27:49 -04:00
Dhruvajyoti Sarma	31551dab40	feature: added warning when duckdb is used as a vectorstore without pandas (#30435 ) added warning when duckdb is used as a vectorstore without pandas being installed (currently used for similarity search result processing) Thank you for contributing to LangChain! - [ ] PR title: "community: added warning when duckdb is used as a vectorstore without pandas" - [ ] PR message: *Delete this entire checklist* and replace with - Description: displays a warning when using duckdb as a vector store without pandas being installed, as it is used by the `similarity_search` function - Issue: #29933 - Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-22 19:27:21 +00:00
ccurme	e81b82ee0b	docs: update cassettes (#30434 ) Following updates to `draw_mermaid_png`	2025-03-22 12:57:36 -04:00
ccurme	6484635ac3	docs: update cassettes for response metadata guide (#30431 ) As of langchain-groq 0.3 ChatGroq requires a model name. Also update other models.	2025-03-22 07:52:08 -04:00
Cesar Sanz	5383abfeee	Fix incorrect import path for AzureAIChatCompletionsModel (#30417 ) Fixes #30416 Correct the import path for `AzureAIChatCompletionsModel` in the `_init_chat_model_helper` function. * Update the import statement in `libs/langchain/langchain/chat_models/base.py` to `from langchain_azure_ai.chat_models import AzureAIChatCompletionsModel`. --- For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/langchain-ai/langchain/pull/30417?shareId=6ff6d5de-e3d1-4972-8d24-5e74838e9945).	2025-03-22 07:44:51 -04:00
Misakar	7750ad588b	community：ChatLiteLLM support output reasoning content (#30430 )	2025-03-22 07:43:33 -04:00
Adrián Panella	b75573e858	core: add tool_call exclusion in filter_message (#30289 ) Extend functionallity to allow to filter pairs of tool calls (ai + tool). --------- Co-authored-by: vbarda <vadym@langchain.dev>	2025-03-21 23:05:29 +00:00
Vadym Barda	673ec00030	docs[patch]: add warning to token counter docstring (#30426 )	2025-03-21 18:59:40 -04:00
Adrián Panella	3933a4abc3	core(mermaid): allow greater customization (#29939 ) Adds greater style customization by allowing a custom frontmatter config. This allows to set a `theme` and `look` or to adjust theme by setting `themeVariables` Example: ```python node_colors = NodeStyles( default="fill:#e2e2e2,line-height:1.2,stroke:#616161", first="fill:#cfeab8,fill-opacity:0", last="fill:#eac3b8", ) frontmatter_config = { "config": { "theme": "neutral", "look": "handDrawn" } } graph.get_graph().draw_mermaid_png(node_colors=node_colors, frontmatter_config=frontmatter_config) ``` ![image](https://github.com/user-attachments/assets/11b56d30-3be2-482f-8432-3ce704a09552) --------- Co-authored-by: vbarda <vadym@langchain.dev>	2025-03-21 18:25:26 -04:00
Vadym Barda	07823cd41c	core[patch]: optimize trim_messages (#30327 ) Refactored w/ Claude Up to 20x speedup! (with theoretical max improvement of `O(n / log n)`)	2025-03-21 17:08:26 -04:00
ccurme	b78ae7817e	openai[patch]: trace strict in structured_output_kwargs (#30425 )	2025-03-21 14:37:28 -04:00
axiangcoding	428de88398	docs: Update a note about how to track azure openai's token usage when streaming (#30409 ) - Description: Update a note about how to track azure openai's token usage when streaming - Issue: #30390 - Dependencies: None - Twitter handle: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-21 14:18:50 -04:00
ccurme	1de7fa8f3a	Revert "deepseek: temporarily bypass tests" (#30424 ) Reverts langchain-ai/langchain#30423	2025-03-21 17:14:31 +00:00
ccurme	c74dfff836	deepseek: temporarily bypass tests (#30423 ) Deepseek infra is not stable enough to get through integration tests. Previous two attempts had two tests time out, they both pass locally.	2025-03-21 17:08:35 +00:00
ccurme	7147903724	deepseek: release 0.1.3 (#30422 )	2025-03-21 16:39:50 +00:00
Andras L Ferenczi	b5f49df86a	partner: ChatDeepSeek on openrouter not returning reasoning (#30240 ) Deepseek model does not return reasoning when hosted on openrouter (Issue [30067](https://github.com/langchain-ai/langchain/issues/30067)) the following code did not return reasoning: ```python llm = ChatDeepSeek( model = 'deepseek/deepseek-r1:nitro', api_base="https://openrouter.ai/api/v1", api_key=os.getenv("OPENROUTER_API_KEY")) messages = [ {"role": "system", "content": "You are an assistant."}, {"role": "user", "content": "9.11 and 9.8, which is greater? Explain the reasoning behind this decision."} ] response = llm.invoke(messages, extra_body={"include_reasoning": True}) print(response.content) print(f"REASONING: {response.additional_kwargs.get('reasoning_content', '')}") print(response) ``` The fix is to extract reasoning from response.choices[0].message["model_extra"] and from choices[0].delta["reasoning"]. and place in response additional_kwargs. Change is really just the addition of a couple one-sentence if statements. --------- Co-authored-by: andrasfe <andrasf94@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-21 16:35:37 +00:00
Vadym Barda	4852ab8d0a	core[patch]: more tests for trim_messages (#30421 )	2025-03-21 16:19:52 +00:00
ccurme	e8e3b2bfae	ollama: release 0.3.0 (#30420 )	2025-03-21 15:50:08 +00:00
Jojo	8f300740ed	docs: fix several typos in docs/docs/how_to/split_html.ipynb (#30407 ) Fix several typos in docs/docs/how_to/split_html.ipynb * `structered` should be `structured` * `signifcant` should be `significant` * `seperator` should be `separator`	2025-03-21 11:46:26 -04:00
Jojo	c77ee99980	docs: fix typo in chat_history.ipynb (#30406 ) `peristence` should be `persistence`	2025-03-21 11:45:52 -04:00
Jojo	f657b19a24	docs: Fix typo in chat_history.ipynb (#30405 ) `repsonse` should be `response`	2025-03-21 11:45:31 -04:00
Bob Merkus	5700646cc5	ollama: add reasoning model support (e.g. deepseek) (#29689 ) # Description This PR adds reasoning model support for `langchain-ollama` by extracting reasoning token blocks, like those used in deepseek. It was inspired by [ollama-deep-researcher](https://github.com/langchain-ai/ollama-deep-researcher), specifically the parsing of [thinking blocks](`6d1aaf2139/src/assistant/graph.py (L91)`): ```python # TODO: This is a hack to remove the <think> tags w/ Deepseek models # It appears very challenging to prompt them out of the responses while "<think>" in running_summary and "</think>" in running_summary: start = running_summary.find("<think>") end = running_summary.find("</think>") + len("</think>") running_summary = running_summary[:start] + running_summary[end:] ``` This notes that it is very hard to remove the reasoning block from prompting, but we actually want the model to reason in order to increase model performance. This implementation extracts the thinking block, so the client can still expect a proper message to be returned by `ChatOllama` (and use the reasoning content separately when desired). This implementation takes the same approach as [ChatDeepseek](`5d581ba22c/libs/partners/deepseek/langchain_deepseek/chat_models.py (L215)`), which adds the reasoning content to chunk.additional_kwargs.reasoning_content; ```python if hasattr(response.choices[0].message, "reasoning_content"): # type: ignore rtn.generations[0].message.additional_kwargs["reasoning_content"] = ( response.choices[0].message.reasoning_content # type: ignore ) ``` This should probably be handled upstream in ollama + ollama-python, but this seems like a reasonably effective solution. This is a standalone example of what is happening; ```python async def deepseek_message_astream( llm: BaseChatModel, messages: list[BaseMessage], config: RunnableConfig \| None = None, , model_target: str = "deepseek-r1", kwargs: Any, ) -> AsyncIterator[BaseMessageChunk]: """Stream responses from Deepseek models, filtering out <think> tags. Args: llm: The language model to stream from messages: The messages to send to the model Yields: Filtered chunks from the model response """ # check if the model is deepseek based if (llm.name and model_target not in llm.name) or (hasattr(llm, "model") and model_target not in llm.model): async for chunk in llm.astream(messages, config=config, kwargs): yield chunk return # Yield with a buffer, upon completing the <think></think> tags, move them to the reasoning content and start over buffer = "" async for chunk in llm.astream(messages, config=config, *kwargs): # start or append if not buffer: buffer = chunk.content else: buffer += chunk.content if hasattr(chunk, "content") else chunk # Process buffer to remove <think> tags if "<think>" in buffer or "</think>" in buffer: if hasattr(chunk, "tool_calls") and chunk.tool_calls: raise NotImplementedError("tool calls during reasoning should be removed?") if "<think>" in chunk.content or "</think>" in chunk.content: continue chunk.additional_kwargs["reasoning_content"] = chunk.content chunk.content = "" # upon block completion, reset the buffer if "<think>" in buffer and "</think>" in buffer: buffer = "" yield chunk ``` # Issue Integrating reasoning models (e.g. deepseek-r1) into existing LangChain based workflows is hard due to the thinking blocks that are included in the message contents. To avoid this, we could match the `ChatOllama` integration with `ChatDeepseek` to return the reasoning content inside `message.additional_arguments.reasoning_content` instead. # Dependenices None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-21 15:44:54 +00:00
ccurme	d8145dda95	xai: release 0.2.2 (#30403 )	2025-03-20 20:25:16 +00:00
ccurme	e194902994	mistral: release 0.2.9 (#30402 )	2025-03-20 20:22:24 +00:00

... 3 4 5 6 7 ...

13194 Commits