langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-10 13:27:36 +00:00

Author	SHA1	Message	Date
Brandon Luu	bbbd4e1db8	docs: Update VectorStoreTab vector store initializations (#30413 ) Description: Update vector store tab inits to match either the docs or api_ref (whichever was more comprehensive) List of changes per vector stores: - In-memory - no change - AstraDB - match to docs - docs/api_refs match (excluding embeddings) - Chroma - match to docs - api_refs is less descriptive - FAISS - match to docs - docs/api_refs match (excluding embeddings) - Milvus - match to docs to use Milvus Lite with Flat index - api_refs does not have index_param for generalization - MongoDB - match to docs - api_refs are sparser - PGVector - match to api_ref - changed to include docker cmd directly in code - docs/api_ref has comment to view docker command in separate code block - Pinecone - match to api_refs - docs have code dispersed - Qdrant - match to api_ref - docs has size=3072, api_ref has size=1536 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-22 17:29:45 -04:00
Matthew Farrellee	e7032901c3	langchain-tests: allow test_serdes for packages outside the default valid namespaces (#30343 ) Description: a third party package not listed in the default valid namespaces cannot pass test_serdes because the load() does not allow for extending the valid_namespaces. test_serdes will fail with - ValueError: Invalid namespace: {'lc': 1, 'type': 'constructor', 'id': ['langchain_other', 'chat_models', 'ChatOther'], 'kwargs': {'model_name': '...', 'api_key': '...'}, 'name': 'ChatOther'} this change has test_serdes automatically extend valid_namespaces based off the ChatModel under test's namespace.	2025-03-22 17:27:39 -04:00
Jiwon Kang	699475a01d	community: uuidv1 is unsafe (#30432 ) this_row_id previously used UUID v1. However, since UUID v1 can be predicted if the MAC address and timestamp are known, it poses a potential security risk. Therefore, it has been changed to UUID v4.	2025-03-22 15:27:49 -04:00
Dhruvajyoti Sarma	31551dab40	feature: added warning when duckdb is used as a vectorstore without pandas (#30435 ) added warning when duckdb is used as a vectorstore without pandas being installed (currently used for similarity search result processing) Thank you for contributing to LangChain! - [ ] PR title: "community: added warning when duckdb is used as a vectorstore without pandas" - [ ] PR message: *Delete this entire checklist* and replace with - Description: displays a warning when using duckdb as a vector store without pandas being installed, as it is used by the `similarity_search` function - Issue: #29933 - Dependencies: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-22 19:27:21 +00:00
ccurme	e81b82ee0b	docs: update cassettes (#30434 ) Following updates to `draw_mermaid_png`	2025-03-22 12:57:36 -04:00
ccurme	6484635ac3	docs: update cassettes for response metadata guide (#30431 ) As of langchain-groq 0.3 ChatGroq requires a model name. Also update other models.	2025-03-22 07:52:08 -04:00
Cesar Sanz	5383abfeee	Fix incorrect import path for AzureAIChatCompletionsModel (#30417 ) Fixes #30416 Correct the import path for `AzureAIChatCompletionsModel` in the `_init_chat_model_helper` function. * Update the import statement in `libs/langchain/langchain/chat_models/base.py` to `from langchain_azure_ai.chat_models import AzureAIChatCompletionsModel`. --- For more details, open the [Copilot Workspace session](https://copilot-workspace.githubnext.com/langchain-ai/langchain/pull/30417?shareId=6ff6d5de-e3d1-4972-8d24-5e74838e9945).	2025-03-22 07:44:51 -04:00
Misakar	7750ad588b	community：ChatLiteLLM support output reasoning content (#30430 )	2025-03-22 07:43:33 -04:00
Adrián Panella	b75573e858	core: add tool_call exclusion in filter_message (#30289 ) Extend functionallity to allow to filter pairs of tool calls (ai + tool). --------- Co-authored-by: vbarda <vadym@langchain.dev>	2025-03-21 23:05:29 +00:00
Vadym Barda	673ec00030	docs[patch]: add warning to token counter docstring (#30426 )	2025-03-21 18:59:40 -04:00
Adrián Panella	3933a4abc3	core(mermaid): allow greater customization (#29939 ) Adds greater style customization by allowing a custom frontmatter config. This allows to set a `theme` and `look` or to adjust theme by setting `themeVariables` Example: ```python node_colors = NodeStyles( default="fill:#e2e2e2,line-height:1.2,stroke:#616161", first="fill:#cfeab8,fill-opacity:0", last="fill:#eac3b8", ) frontmatter_config = { "config": { "theme": "neutral", "look": "handDrawn" } } graph.get_graph().draw_mermaid_png(node_colors=node_colors, frontmatter_config=frontmatter_config) ``` ![image](https://github.com/user-attachments/assets/11b56d30-3be2-482f-8432-3ce704a09552) --------- Co-authored-by: vbarda <vadym@langchain.dev>	2025-03-21 18:25:26 -04:00
Vadym Barda	07823cd41c	core[patch]: optimize trim_messages (#30327 ) Refactored w/ Claude Up to 20x speedup! (with theoretical max improvement of `O(n / log n)`)	2025-03-21 17:08:26 -04:00
ccurme	b78ae7817e	openai[patch]: trace strict in structured_output_kwargs (#30425 )	2025-03-21 14:37:28 -04:00
axiangcoding	428de88398	docs: Update a note about how to track azure openai's token usage when streaming (#30409 ) - Description: Update a note about how to track azure openai's token usage when streaming - Issue: #30390 - Dependencies: None - Twitter handle: None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-21 14:18:50 -04:00
ccurme	1de7fa8f3a	Revert "deepseek: temporarily bypass tests" (#30424 ) Reverts langchain-ai/langchain#30423	2025-03-21 17:14:31 +00:00
ccurme	c74dfff836	deepseek: temporarily bypass tests (#30423 ) Deepseek infra is not stable enough to get through integration tests. Previous two attempts had two tests time out, they both pass locally.	2025-03-21 17:08:35 +00:00
ccurme	7147903724	deepseek: release 0.1.3 (#30422 )	2025-03-21 16:39:50 +00:00
Andras L Ferenczi	b5f49df86a	partner: ChatDeepSeek on openrouter not returning reasoning (#30240 ) Deepseek model does not return reasoning when hosted on openrouter (Issue [30067](https://github.com/langchain-ai/langchain/issues/30067)) the following code did not return reasoning: ```python llm = ChatDeepSeek( model = 'deepseek/deepseek-r1:nitro', api_base="https://openrouter.ai/api/v1", api_key=os.getenv("OPENROUTER_API_KEY")) messages = [ {"role": "system", "content": "You are an assistant."}, {"role": "user", "content": "9.11 and 9.8, which is greater? Explain the reasoning behind this decision."} ] response = llm.invoke(messages, extra_body={"include_reasoning": True}) print(response.content) print(f"REASONING: {response.additional_kwargs.get('reasoning_content', '')}") print(response) ``` The fix is to extract reasoning from response.choices[0].message["model_extra"] and from choices[0].delta["reasoning"]. and place in response additional_kwargs. Change is really just the addition of a couple one-sentence if statements. --------- Co-authored-by: andrasfe <andrasf94@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-21 16:35:37 +00:00
Vadym Barda	4852ab8d0a	core[patch]: more tests for trim_messages (#30421 )	2025-03-21 16:19:52 +00:00
ccurme	e8e3b2bfae	ollama: release 0.3.0 (#30420 )	2025-03-21 15:50:08 +00:00
Jojo	8f300740ed	docs: fix several typos in docs/docs/how_to/split_html.ipynb (#30407 ) Fix several typos in docs/docs/how_to/split_html.ipynb * `structered` should be `structured` * `signifcant` should be `significant` * `seperator` should be `separator`	2025-03-21 11:46:26 -04:00
Jojo	c77ee99980	docs: fix typo in chat_history.ipynb (#30406 ) `peristence` should be `persistence`	2025-03-21 11:45:52 -04:00
Jojo	f657b19a24	docs: Fix typo in chat_history.ipynb (#30405 ) `repsonse` should be `response`	2025-03-21 11:45:31 -04:00
Bob Merkus	5700646cc5	ollama: add reasoning model support (e.g. deepseek) (#29689 ) # Description This PR adds reasoning model support for `langchain-ollama` by extracting reasoning token blocks, like those used in deepseek. It was inspired by [ollama-deep-researcher](https://github.com/langchain-ai/ollama-deep-researcher), specifically the parsing of [thinking blocks](`6d1aaf2139/src/assistant/graph.py (L91)`): ```python # TODO: This is a hack to remove the <think> tags w/ Deepseek models # It appears very challenging to prompt them out of the responses while "<think>" in running_summary and "</think>" in running_summary: start = running_summary.find("<think>") end = running_summary.find("</think>") + len("</think>") running_summary = running_summary[:start] + running_summary[end:] ``` This notes that it is very hard to remove the reasoning block from prompting, but we actually want the model to reason in order to increase model performance. This implementation extracts the thinking block, so the client can still expect a proper message to be returned by `ChatOllama` (and use the reasoning content separately when desired). This implementation takes the same approach as [ChatDeepseek](`5d581ba22c/libs/partners/deepseek/langchain_deepseek/chat_models.py (L215)`), which adds the reasoning content to chunk.additional_kwargs.reasoning_content; ```python if hasattr(response.choices[0].message, "reasoning_content"): # type: ignore rtn.generations[0].message.additional_kwargs["reasoning_content"] = ( response.choices[0].message.reasoning_content # type: ignore ) ``` This should probably be handled upstream in ollama + ollama-python, but this seems like a reasonably effective solution. This is a standalone example of what is happening; ```python async def deepseek_message_astream( llm: BaseChatModel, messages: list[BaseMessage], config: RunnableConfig \| None = None, , model_target: str = "deepseek-r1", kwargs: Any, ) -> AsyncIterator[BaseMessageChunk]: """Stream responses from Deepseek models, filtering out <think> tags. Args: llm: The language model to stream from messages: The messages to send to the model Yields: Filtered chunks from the model response """ # check if the model is deepseek based if (llm.name and model_target not in llm.name) or (hasattr(llm, "model") and model_target not in llm.model): async for chunk in llm.astream(messages, config=config, kwargs): yield chunk return # Yield with a buffer, upon completing the <think></think> tags, move them to the reasoning content and start over buffer = "" async for chunk in llm.astream(messages, config=config, *kwargs): # start or append if not buffer: buffer = chunk.content else: buffer += chunk.content if hasattr(chunk, "content") else chunk # Process buffer to remove <think> tags if "<think>" in buffer or "</think>" in buffer: if hasattr(chunk, "tool_calls") and chunk.tool_calls: raise NotImplementedError("tool calls during reasoning should be removed?") if "<think>" in chunk.content or "</think>" in chunk.content: continue chunk.additional_kwargs["reasoning_content"] = chunk.content chunk.content = "" # upon block completion, reset the buffer if "<think>" in buffer and "</think>" in buffer: buffer = "" yield chunk ``` # Issue Integrating reasoning models (e.g. deepseek-r1) into existing LangChain based workflows is hard due to the thinking blocks that are included in the message contents. To avoid this, we could match the `ChatOllama` integration with `ChatDeepseek` to return the reasoning content inside `message.additional_arguments.reasoning_content` instead. # Dependenices None --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-21 15:44:54 +00:00
ccurme	d8145dda95	xai: release 0.2.2 (#30403 )	2025-03-20 20:25:16 +00:00
ccurme	e194902994	mistral: release 0.2.9 (#30402 )	2025-03-20 20:22:24 +00:00
ccurme	49466ec9ca	groq: release 0.3.1 (#30401 )	2025-03-20 20:19:49 +00:00
ccurme	db1e340387	fireworks: release 0.2.8 (#30400 )	2025-03-20 16:15:51 -04:00
ccurme	238f7fb345	docs: add links in Writer provider page (#30399 )	2025-03-20 16:13:48 -04:00
ccurme	785a8e7d45	tests: release 0.3.15 (#30397 )	2025-03-20 15:38:40 -04:00
ccurme	5588ca4cfb	core: release 0.3.47 (#30396 )	2025-03-20 18:52:53 +00:00
ccurme	de3960d285	multiple: enforce standards on tool_choice (#30372 ) - Test if models support forcing tool calls via `tool_choice`. If they do, they should support - `"any"` to specify any tool - the tool name as a string to force calling a particular tool - Add `tool_choice` to signature of `BaseChatModel.bind_tools` in core - Deprecate `tool_choice_value` in standard tests in favor of a boolean `has_tool_choice` Will follow up with PRs in external repos (tested in AWS and Google already).	2025-03-20 17:48:59 +00:00
ccurme	b86cd8270c	multiple: support `strict` and `method` in with_structured_output (#30385 )	2025-03-20 13:17:07 -04:00
Mohammad Mohtashim	1103bdfaf1	(Ollama) Fix String Value parsing in _parse_arguments_from_tool_call (#30154 ) - Description: Fix String Value parsing in _parse_arguments_from_tool_call - Issue: #30145 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-19 21:47:18 -04:00
Daniel Liden	c0ffc9aa29	Update MLflow integration docs with concise examples and external links (#30082 ) - Description: This PR updates the [MLflow integration](https://python.langchain.com/docs/integrations/providers/mlflow_tracking/) docs. This PR is based on feedback and suggestions from @efriis on #29612 . This proposed revision is much shorter, does not contain images, and links out to the MLflow docs rather than providing lengthy descriptions directly within these docs. Thank you for taking another look! - Issue: NA - Dependencies: NA --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-20 00:25:10 +00:00
Tim König	b5992695ae	community: add ZoteroRetriever (#30270 ) Description This contribution adds a retriever for the Zotero API. [Zotero](https://www.zotero.org/) is an open source reference management for bibliographic data and related research materials. A retriever will allow langchain applications to retrieve relevant documents from personal or shared group libraries, which I believe will be helpful for numerous applications, such as RAG systems, personal research assistants, etc. Tests and docs were added. The documentation provided assumes the retriever will be part of the langchain-community package, as this seemed customary. Please let me know if this is not the preferred way to do it. I also uploaded the implementation to PyPI. Dependencies The retriever requires the `pyzotero` package for API access. This dependency is stated in the docs, and the retriever will return an error if the package is not found. However, this dependency is not added to the langchain package itself. Twitter handle I'm no longer using Twitter, but I'd appreciate a shoutout on [Bluesky](https://bsky.app/profile/koenigt.bsky.social) or [LinkedIn](https://www.linkedin.com/in/dr-tim-k%C3%B6nig-534aa2324/)! Let me know if there are any issues, I'll gladly try and sort them out! --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-19 20:19:32 -04:00
ccurme	aa5ac9279a	docs: update tavily guides (#30387 ) AgentExecutor -> langgraph	2025-03-19 19:29:57 -04:00
pulvedu	4346aca5cf	Integration update (#30381 ) This pull request includes a change to the following - docs/docs/integrations/tools/tavily_search.ipynb - docs/docs/integrations/tools/tavily_extract.ipynb - added docs/docs/integrations/providers/tavily.mdx --------- Co-authored-by: pulvedu <dustin@tavily.com>	2025-03-19 17:58:25 -04:00
Daniel Rauber	9b687d7fbd	community[minor]: PlaywrightURLLoader can take stored session file (#30152 ) Description: Implements an additional `browser_session` parameter on PlaywrightURLLoader which can be used to initialize the browser context by providing a stored playwright context.	2025-03-19 16:29:07 -04:00
Ikko Eltociear Ashimine	bffa530816	docs: update contextual.ipynb (#30384 ) intialize -> initialize	2025-03-19 15:48:58 -04:00
Yeonseolee	65b16d3200	Docs: Fix deprecated initialize agent in ainetwork (#30355 ) ## Description - Replaced `initialize_agent`, `AgentType` usage in ainetwork integration - Updated usage example to `create_react_agent` in langgraph ## Issue - #29277 ## Dependencies - N/A ## Twitter handler - I don't use Twitter	2025-03-19 15:20:21 -04:00
Vadym Barda	73c04f4707	core[patch]: release 0.3.46 (#30383 )	2025-03-19 15:09:08 -04:00
William FH	ce84f8ba7e	Dereference run tree (#30377 )	2025-03-19 19:05:06 +00:00
William FH	8265be4d3e	Unset context to None in var (#30380 )	2025-03-19 18:53:17 +00:00
William FH	4130e6476b	Unset context after step (#30378 ) While we are already careful to copy before setting the config, if other objects hold a reference to the config or context, it wouldn't be cleared.	2025-03-19 11:46:23 -07:00
Vadym Barda	37190881d3	core[patch]: add util for approximate token counting (#30373 )	2025-03-19 17:48:38 +00:00
Brandon Luu	5ede4248ef	docs: Update Vector Store docs formatting (#30359 ) Description: Fix formatting in Vector Stores docs. - astradb: fix API ref spacing - milvus, pgvector, pinecone, qdrant: removed % in cmds for docs consistency - pgvector: removed redundant code and reorganized imports --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-19 15:54:18 +00:00
Matthew Farrellee	5f812f5968	langchain-tests: skip instead of passing image message tests (#30375 ) Description: use skip for image message tests	2025-03-19 15:35:32 +00:00
ccurme	aae8306d6c	groq: release 0.3.0 (#30374 )	2025-03-19 15:23:30 +00:00
Ashwin	83cfb9691f	Fix typo: change 'ben' to 'be' in comment (#30358 ) Description: This PR fixes a minor typo in the comments within `libs/partners/openai/langchain_openai/chat_models/base.py`. The word "ben" has been corrected to "be" for clarity and professionalism. Issue: N/A Dependencies: None	2025-03-19 10:35:35 -04:00

1 2 3 4 5 ...

12991 Commits