x

Update chatbot.ipynb (#27243 )
Async invocation: remove : from at the end of line line 441 because there is not any structure block after it. Thank you for contributing to LangChain! - [ ] **PR title**: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] **PR message**: ***Delete this entire checklist*** and replace with - **Description:** a description of the change - **Issue:** the issue # it fixes, if applicable - **Dependencies:** any dependencies required for this change - **Twitter handle:** if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] **Add tests and docs**: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] **Lint and test**: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.
2026-02-11 03:30:09 +00:00 · 2024-10-11 12:34:55 -04:00 · 2024-10-10 18:03:10 +00:00 · 2024-10-09 11:25:58 -04:00 · 2024-10-09 10:51:39 -04:00 · 2024-10-09 10:05:22 -04:00
176 changed files with 6089 additions and 2580 deletions
--- a/.github/workflows/api_doc_build.yml
+++ b/.github/workflows/api_doc_build.yml
@@ -101,8 +101,8 @@ jobs:
          mv langchain-google/libs/genai langchain/libs/partners/google-genai
          mv langchain-google/libs/vertexai langchain/libs/partners/google-vertexai
          mv langchain-google/libs/community langchain/libs/partners/google-community
-          # mv langchain-datastax/libs/astradb langchain/libs/partners/astradb
-          # mv langchain-nvidia/libs/ai-endpoints langchain/libs/partners/nvidia-ai-endpoints
+          mv langchain-datastax/libs/astradb langchain/libs/partners/astradb
+          mv langchain-nvidia/libs/ai-endpoints langchain/libs/partners/nvidia-ai-endpoints
          mv langchain-cohere/libs/cohere langchain/libs/partners/cohere
          mv langchain-elastic/libs/elasticsearch langchain/libs/partners/elasticsearch
          mv langchain-postgres langchain/libs/partners/postgres
--- a/MIGRATE.md
+++ b/MIGRATE.md
@@ -1,70 +1,11 @@
 # Migrating

-## 🚨Breaking Changes for select chains (SQLDatabase) on 7/28/23
+Please see the following guides for migratin LangChain code:

-In an effort to make `langchain` leaner and safer, we are moving select chains to `langchain_experimental`.
-This migration has already started, but we are remaining backwards compatible until 7/28.
-On that date, we will remove functionality from `langchain`.
-Read more about the motivation and the progress [here](https://github.com/langchain-ai/langchain/discussions/8043).
+* Migrate to [LangChain v0.3](https://python.langchain.com/docs/versions/v0_3/)
+* Migrate to [LangChain v0.2](https://python.langchain.com/docs/versions/v0_2/)
+* Migrating from [LangChain 0.0.x Chains](https://python.langchain.com/docs/versions/migrating_chains/)
+* Upgrate to [LangGraph Memory](https://python.langchain.com/docs/versions/migrating_memory/)

-### Migrating to `langchain_experimental`
-
-We are moving any experimental components of LangChain, or components with vulnerability issues, into `langchain_experimental`.
-This guide covers how to migrate.
-
-### Installation
-
-Previously:
-
-`pip install -U langchain`
-
-Now (only if you want to access things in experimental):
-
-`pip install -U langchain langchain_experimental`
-
-### Things in `langchain.experimental`
-
-Previously:
-
-`from langchain.experimental import ...`
-
-Now:
-
-`from langchain_experimental import ...`
-
-### PALChain
-
-Previously:
-
-`from langchain.chains import PALChain`
-
-Now:
-
-`from langchain_experimental.pal_chain import PALChain`
-
-### SQLDatabaseChain
-
-Previously:
-
-`from langchain.chains import SQLDatabaseChain`
-
-Now:
-
-`from langchain_experimental.sql import SQLDatabaseChain`
-
-Alternatively, if you are just interested in using the query generation part of the SQL chain, you can check out this [`SQL question-answering tutorial`](https://python.langchain.com/v0.2/docs/tutorials/sql_qa/#convert-question-to-sql-query)
-
-`from langchain.chains import create_sql_query_chain`
-
-### `load_prompt` for Python files
-
-Note: this only applies if you want to load Python files as prompts.
-If you want to load json/yaml files, no change is needed.
-
-Previously:
-
-`from langchain.prompts import load_prompt`
-
-Now:
-
-`from langchain_experimental.prompts import load_prompt`
+The [LangChain CLI](https://python.langchain.com/docs/versions/v0_3/#migrate-using-langchain-cli) can help automatically upgrade your code to use non deprecated imports. 
+This will be especially helpful if you're still on either version 0.0.x or 0.1.x of LangChain.
--- a/README.md
+++ b/README.md
@@ -119,7 +119,7 @@ Agents allow an LLM autonomy over how a task is accomplished. Agents make decisi
 Please see [here](https://python.langchain.com) for full documentation, which includes:

 - [Introduction](https://python.langchain.com/docs/introduction/): Overview of the framework and the structure of the docs.
- [Tutorials](https://python.langchain.com/docs/use_cases/): If you're looking to build something specific or are more of a hands-on learner, check out our tutorials. This is the best place to get started.
+- [Tutorials](https://python.langchain.com/docs/tutorials/): If you're looking to build something specific or are more of a hands-on learner, check out our tutorials. This is the best place to get started.
 - [How-to guides](https://python.langchain.com/docs/how_to/): Answers to “How do I….?” type questions. These guides are goal-oriented and concrete; they're meant to help you complete a specific task.
 - [Conceptual guide](https://python.langchain.com/docs/concepts/): Conceptual explanations of the key parts of the framework.
 - [API Reference](https://api.python.langchain.com): Thorough documentation of every class and method.
--- a/docs/data/people.yml
+++ b/docs/data/people.yml
--- a/docs/docs/concepts.mdx
+++ b/docs/docs/concepts.mdx
@@ -611,7 +611,7 @@ Read more about [defining tools that return artifacts here](/docs/how_to/tool_ar
 When designing tools to be used by a model, it is important to keep in mind that:

 - Chat models that have explicit [tool-calling APIs](/docs/concepts/#functiontool-calling) will be better at tool calling than non-fine-tuned models.
- Models will perform better if the tools have well-chosen names, descriptions, and JSON schemas. This another form of prompt engineering.
+- Models will perform better if the tools have well-chosen names, descriptions, and JSON schemas. This is another form of prompt engineering.
 - Simple, narrowly scoped tools are easier for models to use than complex tools.

 #### Related
--- a/docs/docs/how_to/custom_tools.ipynb
+++ b/docs/docs/how_to/custom_tools.ipynb
@@ -22,7 +22,7 @@
    "2. LangChain [Runnables](/docs/concepts#runnable-interface);\n",
    "3. By sub-classing from [BaseTool](https://python.langchain.com/api_reference/core/tools/langchain_core.tools.BaseTool.html) -- This is the most flexible method, it provides the largest degree of control, at the expense of more effort and code.\n",
    "\n",
-    "Creating tools from functions may be sufficient for most use cases, and can be done via a simple [@tool decorator](https://python.langchain.com/api_reference/core/tools/langchain_core.tools.tool.html#langchain_core.tools.tool). If more configuration is needed-- e.g., specification of both sync and async implementations-- one can also use the [StructuredTool.from_function](https://python.langchain.com/api_reference/core/tools/langchain_core.tools.StructuredTool.html#langchain_core.tools.StructuredTool.from_function) class method.\n",
+    "Creating tools from functions may be sufficient for most use cases, and can be done via a simple [@tool decorator](https://python.langchain.com/api_reference/core/tools/langchain_core.tools.tool.html#langchain_core.tools.tool). If more configuration is needed-- e.g., specification of both sync and async implementations-- one can also use the [StructuredTool.from_function](https://python.langchain.com/api_reference/core/tools/langchain_core.tools.structured.StructuredTool.html#langchain_core.tools.structured.StructuredTool.from_function) class method.\n",
    "\n",
    "In this guide we provide an overview of these methods.\n",
    "\n",
--- a/docs/docs/integrations/chat/ibm_watsonx.ipynb
+++ b/docs/docs/integrations/chat/ibm_watsonx.ipynb
@@ -36,7 +36,7 @@
    "### Integration details\n",
    "| Class | Package | Local | Serializable | [JS support](https://js.langchain.com/docs/integrations/chat/openai) | Package downloads | Package latest |\n",
    "| :--- | :--- | :---: | :---: |  :---: | :---: | :---: |\n",
-    "| ChatWatsonx | ❌ | ❌ | ❌ | ![PyPI - Downloads](https://img.shields.io/pypi/dm/langchain-ibm?style=flat-square&label=%20) | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-ibm?style=flat-square&label=%20) |\n",
+    "| ChatWatsonx | ❌ | ❌ | ❌ | ❌ | ![PyPI - Downloads](https://img.shields.io/pypi/dm/langchain-ibm?style=flat-square&label=%20) | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-ibm?style=flat-square&label=%20) |\n",
    "\n",
    "### Model features\n",
    "| [Tool calling](/docs/how_to/tool_calling/) | [Structured output](/docs/how_to/structured_output/) | JSON mode | Image input | Audio input | Video input | [Token-level streaming](/docs/how_to/chat_streaming/) | Native async | [Token usage](/docs/how_to/chat_token_usage_tracking/) | [Logprobs](/docs/how_to/logprobs/) |\n",
@@ -126,21 +126,19 @@
   "source": [
    "## Instantiation\n",
    "\n",
-    "You might need to adjust model `parameters` for different models or tasks. For details, refer to [Available MetaNames](https://ibm.github.io/watsonx-ai-python-sdk/fm_model.html#metanames.GenTextParamsMetaNames)."
+    "You might need to adjust model `parameters` for different models or tasks. For details, refer to [Available TextChatParameters](https://ibm.github.io/watsonx-ai-python-sdk/fm_schema.html#ibm_watsonx_ai.foundation_models.schema.TextChatParameters)."
   ]
  },
  {
   "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": 5,
   "id": "407cd500",
   "metadata": {},
   "outputs": [],
   "source": [
    "parameters = {\n",
-    "    \"decoding_method\": \"sample\",\n",
-    "    \"max_new_tokens\": 100,\n",
-    "    \"min_new_tokens\": 1,\n",
-    "    \"stop_sequences\": [\".\"],\n",
+    "    \"temperature\": 0.9,\n",
+    "    \"max_tokens\": 200,\n",
    "}"
   ]
  },
@@ -160,20 +158,20 @@
    "In this example, we’ll use the `project_id` and Dallas URL.\n",
    "\n",
    "\n",
-    "You need to specify the `model_id` that will be used for inferencing. You can find the list of all the available models in [Supported foundation models](https://ibm.github.io/watsonx-ai-python-sdk/fm_model.html#ibm_watsonx_ai.foundation_models.utils.enums.ModelTypes)."
+    "You need to specify the `model_id` that will be used for inferencing. You can find the list of all the available models in [Supported chat models](https://ibm.github.io/watsonx-ai-python-sdk/fm_helpers.html#ibm_watsonx_ai.foundation_models_manager.FoundationModelsManager.get_chat_model_specs)."
   ]
  },
  {
   "cell_type": "code",
   "execution_count": null,
-   "id": "98371396",
+   "id": "e3568e91",
   "metadata": {},
   "outputs": [],
   "source": [
    "from langchain_ibm import ChatWatsonx\n",
    "\n",
    "chat = ChatWatsonx(\n",
-    "    model_id=\"ibm/granite-13b-chat-v2\",\n",
+    "    model_id=\"ibm/granite-34b-code-instruct\",\n",
    "    url=\"https://us-south.ml.cloud.ibm.com\",\n",
    "    project_id=\"PASTE YOUR PROJECT_ID HERE\",\n",
    "    params=parameters,\n",
@@ -196,7 +194,7 @@
   "outputs": [],
   "source": [
    "chat = ChatWatsonx(\n",
-    "    model_id=\"ibm/granite-13b-chat-v2\",\n",
+    "    model_id=\"ibm/granite-34b-code-instruct\",\n",
    "    url=\"PASTE YOUR URL HERE\",\n",
    "    username=\"PASTE YOUR USERNAME HERE\",\n",
    "    password=\"PASTE YOUR PASSWORD HERE\",\n",
@@ -242,17 +240,17 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 22,
+   "execution_count": 8,
   "id": "beea2b5b",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content=\"Je t'aime pour écouter la Rock.\", response_metadata={'token_usage': {'generated_token_count': 12, 'input_token_count': 28}, 'model_name': 'ibm/granite-13b-chat-v2', 'system_fingerprint': '', 'finish_reason': 'stop_sequence'}, id='run-05b305ce-5401-4a10-b557-41a4b15c7f6f-0')"
+       "AIMessage(content=\"J'adore que tu escois de écouter de la rock ! \", additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 19, 'prompt_tokens': 34, 'total_tokens': 53}, 'model_name': 'ibm/granite-34b-code-instruct', 'system_fingerprint': '', 'finish_reason': 'stop'}, id='chat-ef888fc41f0d4b37903b622250ff7528', usage_metadata={'input_tokens': 34, 'output_tokens': 19, 'total_tokens': 53})"
      ]
     },
-     "execution_count": 22,
+     "execution_count": 8,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -273,17 +271,17 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 41,
+   "execution_count": 9,
   "id": "8ab1a25a",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='Sure, I can help you with that! Horses are large, powerful mammals that belong to the family Equidae.', response_metadata={'token_usage': {'generated_token_count': 24, 'input_token_count': 24}, 'model_name': 'ibm/granite-13b-chat-v2', 'system_fingerprint': '', 'finish_reason': 'stop_sequence'}, id='run-391776ff-3b38-4768-91e8-ff64177149e5-0')"
+       "AIMessage(content='horses are quadrupedal mammals that are members of the family Equidae. They are typically farm animals, competing in horse racing and other forms of equine competition. With over 200 breeds, horses are diverse in their physical appearance and behavior. They are intelligent, social animals that are often used for transportation, food, and entertainment.', additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 89, 'prompt_tokens': 29, 'total_tokens': 118}, 'model_name': 'ibm/granite-34b-code-instruct', 'system_fingerprint': '', 'finish_reason': 'stop'}, id='chat-9a6e28abb3d448aaa4f83b677a9fd653', usage_metadata={'input_tokens': 29, 'output_tokens': 89, 'total_tokens': 118})"
      ]
     },
-     "execution_count": 41,
+     "execution_count": 9,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -314,7 +312,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 17,
+   "execution_count": 10,
   "id": "dd919925",
   "metadata": {},
   "outputs": [],
@@ -338,17 +336,17 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 18,
+   "execution_count": 11,
   "id": "68160377",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='Ich liebe Python.', response_metadata={'token_usage': {'generated_token_count': 5, 'input_token_count': 23}, 'model_name': 'ibm/granite-13b-chat-v2', 'system_fingerprint': '', 'finish_reason': 'stop_sequence'}, id='run-1b1ccf5d-0e33-46f2-a087-e2a136ba1fb7-0')"
+       "AIMessage(content='Ich liebe Python.', additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 7, 'prompt_tokens': 28, 'total_tokens': 35}, 'model_name': 'ibm/granite-34b-code-instruct', 'system_fingerprint': '', 'finish_reason': 'stop'}, id='chat-fef871190b6047a7a3e68c58b3810c33', usage_metadata={'input_tokens': 28, 'output_tokens': 7, 'total_tokens': 35})"
      ]
     },
-     "execution_count": 18,
+     "execution_count": 11,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -376,7 +374,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 12,
   "id": "3f63166a",
   "metadata": {},
   "outputs": [
@@ -384,7 +382,7 @@
     "name": "stdout",
     "output_type": "stream",
     "text": [
-      "The moon is a natural satellite of the Earth, and it has been a source of fascination for humans for centuries."
+      "The Moon is the fifth largest moon in the solar system and the largest relative to its host planet. It is the fifth brightest object in Earth's night sky after the Sun, the stars, the Milky Way, and the Moon itself. It orbits around the Earth at an average distance of 238,855 miles (384,400 kilometers). The Moon's gravity is about one-sixthth of Earth's and thus allows for the formation of tides on Earth. The Moon is thought to have formed around 4.5 billion years ago from debris from a collision between Earth and a Mars-sized body named Theia. The Moon is effectively immutable, with its current characteristics remaining from formation. Aside from Earth, the Moon is the only other natural satellite of Earth. The most widely accepted theory is that it formed from the debris of a collision"
     ]
    }
   ],
@@ -410,18 +408,18 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 32,
+   "execution_count": 13,
   "id": "9e948729",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "[AIMessage(content='Cats are domestic animals that belong to the Felidae family.', response_metadata={'token_usage': {'generated_token_count': 13, 'input_token_count': 24}, 'model_name': 'ibm/granite-13b-chat-v2', 'system_fingerprint': '', 'finish_reason': 'stop_sequence'}, id='run-71a8bd7a-a1aa-497b-9bdd-a4d6fe1d471a-0'),\n",
-       " AIMessage(content='Dogs are domesticated mammals of the family Canidae, characterized by their adaptability to various environments and social structures.', response_metadata={'token_usage': {'generated_token_count': 24, 'input_token_count': 24}, 'model_name': 'ibm/granite-13b-chat-v2', 'system_fingerprint': '', 'finish_reason': 'stop_sequence'}, id='run-22b7a0cb-e44a-4b68-9921-872f82dcd82b-0')]"
+       "[AIMessage(content='The cat is a popular domesticated carnivorous mammal that belongs to the family Felidae. Cats arefriendly, intelligent, and independent animals that are well-known for their playful behavior, agility, and ability to hunt prey. cats come in a wide range of breeds, each with their own unique physical and behavioral characteristics. They are kept as pets worldwide due to their affectionate nature and companionship. Cats are important members of the household and are often involved in everything from childcare to entertainment.', additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 127, 'prompt_tokens': 28, 'total_tokens': 155}, 'model_name': 'ibm/granite-34b-code-instruct', 'system_fingerprint': '', 'finish_reason': 'stop'}, id='chat-fa452af0a0fa4a668b6a704aecd7d718', usage_metadata={'input_tokens': 28, 'output_tokens': 127, 'total_tokens': 155}),\n",
+       " AIMessage(content='Dogs are domesticated animals that belong to the Canidae family, also known as wolves. They are one of the most popular pets worldwide, known for their loyalty and affection towards their owners. Dogs come in various breeds, each with unique characteristics, and are trained for different purposes such as hunting, herding, or guarding. They require a lot of exercise and mental stimulation to stay healthy and happy, and they need proper training and socialization to be well-behaved. Dogs are also known for their playful and energetic nature, making them great companions for people of all ages.', additional_kwargs={}, response_metadata={'token_usage': {'completion_tokens': 144, 'prompt_tokens': 28, 'total_tokens': 172}, 'model_name': 'ibm/granite-34b-code-instruct', 'system_fingerprint': '', 'finish_reason': 'stop'}, id='chat-cae7663c50cf4f3499726821cc2f0ec7', usage_metadata={'input_tokens': 28, 'output_tokens': 144, 'total_tokens': 172})]"
      ]
     },
-     "execution_count": 32,
+     "execution_count": 13,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -452,9 +450,7 @@
    "\n",
    "### ChatWatsonx.bind_tools()\n",
    "\n",
-    "Please note that `ChatWatsonx.bind_tools` is on beta state, so right now we only support `mistralai/mixtral-8x7b-instruct-v01` model.\n",
-    "\n",
-    "You should also redefine `max_new_tokens` parameter to get the entire model response. By default `max_new_tokens` is set to 20."
+    "Please note that `ChatWatsonx.bind_tools` is on beta state, so we recommend using `mistralai/mistral-large` model."
   ]
  },
  {
@@ -466,10 +462,8 @@
   "source": [
    "from langchain_ibm import ChatWatsonx\n",
    "\n",
-    "parameters = {\"max_new_tokens\": 200}\n",
-    "\n",
    "chat = ChatWatsonx(\n",
-    "    model_id=\"mistralai/mixtral-8x7b-instruct-v01\",\n",
+    "    model_id=\"mistralai/mistral-large\",\n",
    "    url=\"https://us-south.ml.cloud.ibm.com\",\n",
    "    project_id=\"PASTE YOUR PROJECT_ID HERE\",\n",
    "    params=parameters,\n",
@@ -478,7 +472,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": 2,
   "id": "e1633a73",
   "metadata": {},
   "outputs": [],
@@ -497,17 +491,17 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 3,
   "id": "3bf9b8ab",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "AIMessage(content='', additional_kwargs={'function_call': {'type': 'function'}, 'tool_calls': [{'type': 'function', 'function': {'name': 'GetWeather', 'arguments': '{\"location\": \"Los Angeles\"}'}, 'id': None}, {'type': 'function', 'function': {'name': 'GetWeather', 'arguments': '{\"location\": \"New York\"}'}, 'id': None}]}, response_metadata={'token_usage': {'generated_token_count': 99, 'input_token_count': 320}, 'model_name': 'mistralai/mixtral-8x7b-instruct-v01', 'system_fingerprint': '', 'finish_reason': 'eos_token'}, id='run-38627104-f2ac-4edb-8390-d5425fb65979-0', tool_calls=[{'name': 'GetWeather', 'args': {'location': 'Los Angeles'}, 'id': None}, {'name': 'GetWeather', 'args': {'location': 'New York'}, 'id': None}])"
+       "AIMessage(content='', additional_kwargs={'tool_calls': [{'id': 'chatcmpl-tool-6c06a19bbe824d78a322eb193dbde12d', 'type': 'function', 'function': {'name': 'GetWeather', 'arguments': '{\"location\": \"Los Angeles, CA\"}'}}, {'id': 'chatcmpl-tool-493542e46f1141bfbfeb5deae6c9e086', 'type': 'function', 'function': {'name': 'GetWeather', 'arguments': '{\"location\": \"New York, NY\"}'}}]}, response_metadata={'token_usage': {'completion_tokens': 46, 'prompt_tokens': 95, 'total_tokens': 141}, 'model_name': 'mistralai/mistral-large', 'system_fingerprint': '', 'finish_reason': 'tool_calls'}, id='chat-027f2bdb217e4238909cb26d3e8a8fbf', tool_calls=[{'name': 'GetWeather', 'args': {'location': 'Los Angeles, CA'}, 'id': 'chatcmpl-tool-6c06a19bbe824d78a322eb193dbde12d', 'type': 'tool_call'}, {'name': 'GetWeather', 'args': {'location': 'New York, NY'}, 'id': 'chatcmpl-tool-493542e46f1141bfbfeb5deae6c9e086', 'type': 'tool_call'}], usage_metadata={'input_tokens': 95, 'output_tokens': 46, 'total_tokens': 141})"
      ]
     },
-     "execution_count": 4,
+     "execution_count": 3,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -530,18 +524,24 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 4,
   "id": "38f10ba7",
   "metadata": {},
   "outputs": [
    {
     "data": {
      "text/plain": [
-       "[{'name': 'GetWeather', 'args': {'location': 'Los Angeles'}, 'id': None},\n",
-       " {'name': 'GetWeather', 'args': {'location': 'New York'}, 'id': None}]"
+       "[{'name': 'GetWeather',\n",
+       "  'args': {'location': 'Los Angeles, CA'},\n",
+       "  'id': 'chatcmpl-tool-6c06a19bbe824d78a322eb193dbde12d',\n",
+       "  'type': 'tool_call'},\n",
+       " {'name': 'GetWeather',\n",
+       "  'args': {'location': 'New York, NY'},\n",
+       "  'id': 'chatcmpl-tool-493542e46f1141bfbfeb5deae6c9e086',\n",
+       "  'type': 'tool_call'}]"
      ]
     },
-     "execution_count": 5,
+     "execution_count": 4,
     "metadata": {},
     "output_type": "execute_result"
    }
@@ -567,7 +567,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.9"
+   "version": "3.10.14"
  }
 },
 "nbformat": 4,
--- a/docs/docs/integrations/chat/sambastudio.ipynb
+++ b/docs/docs/integrations/chat/sambastudio.ipynb
@@ -0,0 +1,383 @@
+{
+ "cells": [
+  {
+   "cell_type": "raw",
+   "metadata": {
+    "vscode": {
+     "languageId": "raw"
+    }
+   },
+   "source": [
+    "---\n",
+    "sidebar_label: SambaStudio\n",
+    "---"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "# ChatSambaStudio\n",
+    "\n",
+    "This will help you getting started with SambaStudio [chat models](/docs/concepts/#chat-models). For detailed documentation of all ChatStudio features and configurations head to the [API reference](https://api.python.langchain.com/en/latest/chat_models/langchain_community.chat_models.sambanova.ChatSambaStudio.html).\n",
+    "\n",
+    "**[SambaNova](https://sambanova.ai/)'s** [SambaStudio](https://docs.sambanova.ai/sambastudio/latest/sambastudio-intro.html) SambaStudio is a rich, GUI-based platform that provides the functionality to train, deploy, and manage models in SambaNova [DataScale](https://sambanova.ai/products/datascale) systems.\n",
+    "\n",
+    "## Overview\n",
+    "### Integration details\n",
+    "\n",
+    "| Class | Package | Local | Serializable | JS support | Package downloads | Package latest |\n",
+    "| :--- | :--- | :---: | :---: |  :---: | :---: | :---: |\n",
+    "| [ChatSambaStudio](https://api.python.langchain.com/en/latest/chat_models/langchain_community.chat_models.sambanova.ChatSambaStudio.html) | [langchain-community](https://python.langchain.com/api_reference/community/index.html) | ❌ | ❌ | ❌ | ![PyPI - Downloads](https://img.shields.io/pypi/dm/langchain_community?style=flat-square&label=%20) | ![PyPI - Version](https://img.shields.io/pypi/v/langchain_community?style=flat-square&label=%20) |\n",
+    "\n",
+    "### Model features\n",
+    "\n",
+    "| [Tool calling](/docs/how_to/tool_calling) | [Structured output](/docs/how_to/structured_output/) | JSON mode | [Image input](/docs/how_to/multimodal_inputs/) | Audio input | Video input | [Token-level streaming](/docs/how_to/chat_streaming/) | Native async | [Token usage](/docs/how_to/chat_token_usage_tracking/) | [Logprobs](/docs/how_to/logprobs/) |\n",
+    "| :---: | :---: | :---: | :---: |  :---: | :---: | :---: | :---: | :---: | :---: |\n",
+    "| ❌ | ❌ | ❌ | ❌ | ❌ | ❌ | ✅ | ✅ | ✅ | ❌ | \n",
+    "\n",
+    "## Setup\n",
+    "\n",
+    "To access ChatSambaStudio models you will need to [deploy an endpoint](https://docs.sambanova.ai/sambastudio/latest/language-models.html) in your SambaStudio platform, install the `langchain_community` integration package, and install the `SSEClient` Package.\n",
+    "\n",
+    "```bash\n",
+    "pip install langchain-community\n",
+    "pip install sseclient-py\n",
+    "```\n",
+    "\n",
+    "### Credentials\n",
+    "\n",
+    "Get the URL and API Key from your SambaStudio deployed endpoint and add them to your environment variables:\n",
+    "\n",
+    "``` bash\n",
+    "export SAMBASTUDIO_URL=\"your-api-key-here\"\n",
+    "export SAMBASTUDIO_API_KEY=\"your-api-key-here\"\n",
+    "```"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 1,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import getpass\n",
+    "import os\n",
+    "\n",
+    "if not os.getenv(\"SAMBASTUDIO_URL\"):\n",
+    "    os.environ[\"SAMBASTUDIO_URL\"] = getpass.getpass(\"Enter your SambaStudio URL: \")\n",
+    "if not os.getenv(\"SAMBASTUDIO_API_KEY\"):\n",
+    "    os.environ[\"SAMBASTUDIO_API_KEY\"] = getpass.getpass(\n",
+    "        \"Enter your SambaStudio API key: \"\n",
+    "    )"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "If you want to get automated tracing of your model calls you can also set your [LangSmith](https://docs.smith.langchain.com/) API key by uncommenting below:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# os.environ[\"LANGCHAIN_TRACING_V2\"] = \"true\"\n",
+    "# os.environ[\"LANGCHAIN_API_KEY\"] = getpass.getpass(\"Enter your LangSmith API key: \")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### Installation\n",
+    "\n",
+    "The LangChain __SambaStudio__ integration lives in the `langchain_community` package:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "%pip install -qU langchain-community\n",
+    "%pip install -qu sseclient-py"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Instantiation\n",
+    "\n",
+    "Now we can instantiate our model object and generate chat completions:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 2,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain_community.chat_models.sambanova import ChatSambaStudio\n",
+    "\n",
+    "llm = ChatSambaStudio(\n",
+    "    model=\"Meta-Llama-3-70B-Instruct-4096\",  # set if using a CoE endpoint\n",
+    "    max_tokens=1024,\n",
+    "    temperature=0.7,\n",
+    "    top_k=1,\n",
+    "    top_p=0.01,\n",
+    "    do_sample=True,\n",
+    "    process_prompt=\"True\",  # set if using a CoE endpoint\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Invocation"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 3,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content=\"J'adore la programmation.\", response_metadata={'id': 'item0', 'partial': False, 'value': {'completion': \"J'adore la programmation.\", 'logprobs': {'text_offset': [], 'top_logprobs': []}, 'prompt': '<|start_header_id|>system<|end_header_id|>\\n\\nYou are a helpful assistant that translates English to French. Translate the user sentence.<|eot_id|><|start_header_id|>user<|end_header_id|>\\n\\nI love programming.<|eot_id|><|start_header_id|>assistant<|end_header_id|>\\n\\n', 'stop_reason': 'end_of_text', 'tokens': ['J', \"'\", 'ad', 'ore', ' la', ' programm', 'ation', '.'], 'total_tokens_count': 43}, 'params': {}, 'status': None}, id='item0')"
+      ]
+     },
+     "execution_count": 3,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "messages = [\n",
+    "    (\n",
+    "        \"system\",\n",
+    "        \"You are a helpful assistant that translates English to French. Translate the user sentence.\",\n",
+    "    ),\n",
+    "    (\"human\", \"I love programming.\"),\n",
+    "]\n",
+    "ai_msg = llm.invoke(messages)\n",
+    "ai_msg"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "J'adore la programmation.\n"
+     ]
+    }
+   ],
+   "source": [
+    "print(ai_msg.content)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Chaining\n",
+    "\n",
+    "We can [chain](/docs/how_to/sequence/) our model with a prompt template like so:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content='Ich liebe das Programmieren.', response_metadata={'id': 'item0', 'partial': False, 'value': {'completion': 'Ich liebe das Programmieren.', 'logprobs': {'text_offset': [], 'top_logprobs': []}, 'prompt': '<|start_header_id|>system<|end_header_id|>\\n\\nYou are a helpful assistant that translates English to German.<|eot_id|><|start_header_id|>user<|end_header_id|>\\n\\nI love programming.<|eot_id|><|start_header_id|>assistant<|end_header_id|>\\n\\n', 'stop_reason': 'end_of_text', 'tokens': ['Ich', ' liebe', ' das', ' Programm', 'ieren', '.'], 'total_tokens_count': 36}, 'params': {}, 'status': None}, id='item0')"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "from langchain_core.prompts import ChatPromptTemplate\n",
+    "\n",
+    "prompt = ChatPromptTemplate(\n",
+    "    [\n",
+    "        (\n",
+    "            \"system\",\n",
+    "            \"You are a helpful assistant that translates {input_language} to {output_language}.\",\n",
+    "        ),\n",
+    "        (\"human\", \"{input}\"),\n",
+    "    ]\n",
+    ")\n",
+    "\n",
+    "chain = prompt | llm\n",
+    "chain.invoke(\n",
+    "    {\n",
+    "        \"input_language\": \"English\",\n",
+    "        \"output_language\": \"German\",\n",
+    "        \"input\": \"I love programming.\",\n",
+    "    }\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Streaming"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 6,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Arrr, ye landlubber! Ye be wantin' to learn about owls, eh? Well, matey, settle yerself down with a pint o' grog and listen close, for I be tellin' ye about these fascinatin' creatures o' the night!\n",
+      "\n",
+      "Owls be birds, but not just any birds, me hearty! They be nocturnal, meanin' they do their huntin' at night, when the rest o' the world be sleepin'. And they be experts at it, too! Their big, round eyes be designed for seein' in the dark, with a special reflective layer called the tapetum lucidum that helps 'em spot prey in the shadows. It's like havin' a built-in lantern, savvy?\n",
+      "\n",
+      "But that be not all, me matey! Owls also have acute hearin', which helps 'em pinpoint the slightest sounds in the dark. And their ears be asymmetrical, meanin' one ear be higher than the other, which gives 'em better depth perception. It's like havin' a built-in sonar system, arrr!\n",
+      "\n",
+      "Now, ye might be wonderin' how owls fly so silently, like ghosts in the night. Well, it be because o' their special feathers, me hearty! They have soft, fringed feathers on their wings that help reduce noise and turbulence, makin' 'em the sneakiest flyers on the seven seas... er, skies!\n",
+      "\n",
+      "Owls come in all shapes and sizes, from the tiny elf owl to the great grey owl, which be one o' the largest owl species in the world. And they be found on every continent, except Antarctica, o' course. They be solitary creatures, but some species be known to form long-term monogamous relationships, like the barn owl and its mate.\n",
+      "\n",
+      "So, there ye have it, me hearty! Owls be amazin' creatures, with their clever adaptations and stealthy ways. Now, go forth and spread the word about these magnificent birds o' the night! And remember, if ye ever encounter an owl in the wild, be sure to show respect and keep a weather eye open, or ye might just find yerself on the receivin' end o' a silent, flyin' tackle! Arrr!"
+     ]
+    }
+   ],
+   "source": [
+    "system = \"You are a helpful assistant with pirate accent.\"\n",
+    "human = \"I want to learn more about this animal: {animal}\"\n",
+    "prompt = ChatPromptTemplate.from_messages([(\"system\", system), (\"human\", human)])\n",
+    "\n",
+    "chain = prompt | llm\n",
+    "\n",
+    "for chunk in chain.stream({\"animal\": \"owl\"}):\n",
+    "    print(chunk.content, end=\"\", flush=True)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Async"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 7,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "AIMessage(content='The capital of France is Paris.', response_metadata={'id': 'item0', 'partial': False, 'value': {'completion': 'The capital of France is Paris.', 'logprobs': {'text_offset': [], 'top_logprobs': []}, 'prompt': '<|start_header_id|>user<|end_header_id|>\\n\\nwhat is the capital of France?<|eot_id|><|start_header_id|>assistant<|end_header_id|>\\n\\n', 'stop_reason': 'end_of_text', 'tokens': ['The', ' capital', ' of', ' France', ' is', ' Paris', '.'], 'total_tokens_count': 24}, 'params': {}, 'status': None}, id='item0')"
+      ]
+     },
+     "execution_count": 7,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "prompt = ChatPromptTemplate.from_messages(\n",
+    "    [\n",
+    "        (\n",
+    "            \"human\",\n",
+    "            \"what is the capital of {country}?\",\n",
+    "        )\n",
+    "    ]\n",
+    ")\n",
+    "\n",
+    "chain = prompt | llm\n",
+    "await chain.ainvoke({\"country\": \"France\"})"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Async Streaming"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 8,
+   "metadata": {},
+   "outputs": [
+    {
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "Quantum computers use quantum bits (qubits) to process multiple possibilities simultaneously, exponentially faster than classical computers, enabling breakthroughs in fields like cryptography, optimization, and simulation."
+     ]
+    }
+   ],
+   "source": [
+    "prompt = ChatPromptTemplate.from_messages(\n",
+    "    [\n",
+    "        (\n",
+    "            \"human\",\n",
+    "            \"in less than {num_words} words explain me {topic} \",\n",
+    "        )\n",
+    "    ]\n",
+    ")\n",
+    "chain = prompt | llm\n",
+    "\n",
+    "async for chunk in chain.astream({\"num_words\": 30, \"topic\": \"quantum computers\"}):\n",
+    "    print(chunk.content, end=\"\", flush=True)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## API reference\n",
+    "\n",
+    "For detailed documentation of all ChatSambaStudio features and configurations head to the API reference: https://api.python.langchain.com/en/latest/chat_models/langchain_community.chat_models.sambanova.ChatSambaStudio.html"
+   ]
+  }
+ ],
+ "metadata": {
+  "kernelspec": {
+   "display_name": "langchain",
+   "language": "python",
+   "name": "python3"
+  },
+  "language_info": {
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.9.19"
+  }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 2
+}
--- a/docs/docs/integrations/platforms/microsoft.mdx
+++ b/docs/docs/integrations/platforms/microsoft.mdx
@@ -264,22 +264,20 @@ See a [usage example](/docs/integrations/document_loaders/url/#playwright-url-lo
 from langchain_community.document_loaders.onenote import OneNoteLoader
 ```

-## AI Agent Memory System
-
-[AI agent](https://learn.microsoft.com/en-us/azure/cosmos-db/ai-agents) needs robust memory systems that support multi-modality, offer strong operational performance, and enable agent memory sharing as well as separation.
+## Vector Stores

 ### Azure Cosmos DB
 AI agents can rely on Azure Cosmos DB as a unified [memory system](https://learn.microsoft.com/en-us/azure/cosmos-db/ai-agents#memory-can-make-or-break-agents) solution, enjoying speed, scale, and simplicity. This service successfully [enabled OpenAI's ChatGPT service](https://www.youtube.com/watch?v=6IIUtEFKJec&t) to scale dynamically with high reliability and low maintenance. Powered by an atom-record-sequence engine, it is the world's first globally distributed [NoSQL](https://learn.microsoft.com/en-us/azure/cosmos-db/distributed-nosql), [relational](https://learn.microsoft.com/en-us/azure/cosmos-db/distributed-relational), and [vector database](https://learn.microsoft.com/en-us/azure/cosmos-db/vector-database) service that offers a serverless mode. 

 Below are two available Azure Cosmos DB APIs that can provide vector store functionalities.

-### Azure Cosmos DB for MongoDB (vCore)
+#### Azure Cosmos DB for MongoDB (vCore)

 >[Azure Cosmos DB for MongoDB vCore](https://learn.microsoft.com/en-us/azure/cosmos-db/mongodb/vcore/) makes it easy to create a database with full native MongoDB support.
 > You can apply your MongoDB experience and continue to use your favorite MongoDB drivers, SDKs, and tools by pointing your application to the API for MongoDB vCore account's connection string.
 > Use vector search in Azure Cosmos DB for MongoDB vCore to seamlessly integrate your AI-based applications with your data that's stored in Azure Cosmos DB.

-#### Installation and Setup
+##### Installation and Setup

 See [detail configuration instructions](/docs/integrations/vectorstores/azure_cosmos_db).

@@ -289,7 +287,7 @@ We need to install `pymongo` python package.
 pip install pymongo
 ```

-#### Deploy Azure Cosmos DB on Microsoft Azure
+##### Deploy Azure Cosmos DB on Microsoft Azure

 Azure Cosmos DB for MongoDB vCore provides developers with a fully managed MongoDB-compatible database service for building modern applications with a familiar architecture.

@@ -303,7 +301,7 @@ See a [usage example](/docs/integrations/vectorstores/azure_cosmos_db).
 from langchain_community.vectorstores import AzureCosmosDBVectorSearch
 ```

-### Azure Cosmos DB NoSQL
+#### Azure Cosmos DB NoSQL

 >[Azure Cosmos DB for NoSQL](https://learn.microsoft.com/en-us/azure/cosmos-db/nosql/vector-search) now offers vector indexing and search in preview.
 This feature is designed to handle high-dimensional vectors, enabling efficient and accurate vector search at any scale. You can now store vectors
@@ -312,7 +310,7 @@ but also high-dimensional vectors as other properties of the documents. This col
 as the vectors are stored in the same logical unit as the data they represent. This simplifies data management, AI application architectures, and the
 efficiency of vector-based operations.

-#### Installation and Setup
+##### Installation and Setup

 See [detail configuration instructions](/docs/integrations/vectorstores/azure_cosmos_db_no_sql).

@@ -322,7 +320,7 @@ We need to install `azure-cosmos` python package.
 pip install azure-cosmos
 ```

-#### Deploy Azure Cosmos DB on Microsoft Azure
+##### Deploy Azure Cosmos DB on Microsoft Azure

 Azure Cosmos DB offers a solution for modern apps and intelligent workloads by being very responsive with dynamic and elastic autoscale. It is available
 in every Azure region and can automatically replicate data closer to users. It has SLA guaranteed low-latency and high availability.
@@ -336,6 +334,7 @@ from langchain_community.vectorstores import AzureCosmosDBNoSQLVectorSearch
 ```

 ### Azure Database for PostgreSQL
+
 >[Azure Database for PostgreSQL - Flexible Server](https://learn.microsoft.com/en-us/azure/postgresql/flexible-server/service-overview) is a relational database service based on the open-source Postgres database engine. It's a fully managed database-as-a-service that can handle mission-critical workloads with predictable performance, security, high availability, and dynamic scalability.

 See [set up instructions](https://learn.microsoft.com/en-us/azure/postgresql/flexible-server/quickstart-create-server-portal) for Azure Database for PostgreSQL. 
@@ -446,6 +445,38 @@ The `azure_ai_services` toolkit includes the following tools:
 - Text to Speech: [AzureAiServicesTextToSpeechTool](https://python.langchain.com/api_reference/community/tools/langchain_community.tools.azure_ai_services.text_to_speech.AzureAiServicesTextToSpeechTool.html)
 - Text Analytics for Health: [AzureAiServicesTextAnalyticsForHealthTool](https://python.langchain.com/api_reference/community/tools/langchain_community.tools.azure_ai_services.text_analytics_for_health.AzureAiServicesTextAnalyticsForHealthTool.html)

+### Azure Cognitive Services
+
+We need to install several python packages.
+
+```bash
+pip install azure-ai-formrecognizer azure-cognitiveservices-speech azure-ai-vision-imageanalysis
+```
+
+See a [usage example](/docs/integrations/tools/azure_cognitive_services).
+
+```python
+from langchain_community.agent_toolkits import AzureCognitiveServicesToolkit
+```
+
+#### Azure AI Services individual tools
+
+The `azure_ai_services` toolkit includes the tools that queries the `Azure Cognitive Services`:
+- `AzureCogsFormRecognizerTool`: Form Recognizer API
+- `AzureCogsImageAnalysisTool`: Image Analysis API
+- `AzureCogsSpeech2TextTool`: Speech2Text API
+- `AzureCogsText2SpeechTool`: Text2Speech API
+- `AzureCogsTextAnalyticsHealthTool`: Text Analytics for Health API
+
+```python
+from langchain_community.tools.azure_cognitive_services import (
+    AzureCogsFormRecognizerTool,
+    AzureCogsImageAnalysisTool,
+    AzureCogsSpeech2TextTool,
+    AzureCogsText2SpeechTool,
+    AzureCogsTextAnalyticsHealthTool,
+)
+```

 ### Microsoft Office 365 email and calendar

@@ -465,11 +496,11 @@ from langchain_community.agent_toolkits import O365Toolkit
 #### Office 365 individual tools

 You can use individual tools from the Office 365 Toolkit:
- `O365CreateDraftMessage`: tool for creating a draft email in Office 365
- `O365SearchEmails`: tool for searching email messages in Office 365
- `O365SearchEvents`: tool for searching calendar events in Office 365
- `O365SendEvent`: tool for sending calendar events in Office 365
- `O365SendMessage`: tool for sending an email in Office 365
+- `O365CreateDraftMessage`: creating a draft email in Office 365
+- `O365SearchEmails`: searching email messages in Office 365
+- `O365SearchEvents`: searching calendar events in Office 365
+- `O365SendEvent`: sending calendar events in Office 365
+- `O365SendMessage`: sending an email in Office 365

 ```python
 from langchain_community.tools.office365 import O365CreateDraftMessage
@@ -497,9 +528,9 @@ from langchain_community.utilities.powerbi import PowerBIDataset
 #### PowerBI individual tools

 You can use individual tools from the Azure PowerBI Toolkit:
- `InfoPowerBITool`: tool for getting metadata about a PowerBI Dataset
- `ListPowerBITool`: tool for getting tables names
- `QueryPowerBITool`: tool for querying a PowerBI Dataset
+- `InfoPowerBITool`: getting metadata about a PowerBI Dataset
+- `ListPowerBITool`: getting tables names
+- `QueryPowerBITool`: querying a PowerBI Dataset

 ```python
 from langchain_community.tools.powerbi.tool import InfoPowerBITool
--- a/docs/docs/integrations/providers/baai.mdx
+++ b/docs/docs/integrations/providers/baai.mdx
@@ -0,0 +1,44 @@
+# BAAI
+
+>[Beijing Academy of Artificial Intelligence (BAAI) (Wikipedia)](https://en.wikipedia.org/wiki/Beijing_Academy_of_Artificial_Intelligence), 
+> also known as `Zhiyuan Institute`, is a Chinese non-profit artificial 
+> intelligence (AI) research laboratory. `BAAI` conducts AI research 
+> and is dedicated to promoting collaboration among academia and industry, 
+> as well as fostering top talent and a focus on long-term research on 
+> the fundamentals of AI technology. As a collaborative hub, BAAI's founding 
+> members include leading AI companies, universities, and research institutes.
+
+
+## Embedding Models
+
+### HuggingFaceBgeEmbeddings
+
+>[BGE models on the HuggingFace](https://huggingface.co/BAAI/bge-large-en-v1.5) 
+> are one of [the best open-source embedding models](https://huggingface.co/spaces/mteb/leaderboard).
+
+See a [usage example](/docs/integrations/text_embedding/bge_huggingface).
+
+```python
+from langchain_community.embeddings import HuggingFaceBgeEmbeddings
+```
+
+### IpexLLMBgeEmbeddings
+
+>[IPEX-LLM](https://github.com/intel-analytics/ipex-llm) is a PyTorch 
+> library for running LLM on Intel CPU and GPU (e.g., local PC with iGPU, 
+> discrete GPU such as Arc, Flex and Max) with very low latency.
+
+See a [usage example running model on Intel CPU](/docs/integrations/text_embedding/ipex_llm).
+See a [usage example running model on Intel GPU](/docs/integrations/text_embedding/ipex_llm_gpu).
+
+```python
+from langchain_community.embeddings import IpexLLMBgeEmbeddings
+```
+
+### QuantizedBgeEmbeddings
+
+See a [usage example](/docs/integrations/text_embedding/itrex).
+
+```python
+from langchain_community.embeddings import QuantizedBgeEmbeddings
+```
--- a/docs/docs/integrations/providers/jina.mdx
+++ b/docs/docs/integrations/providers/jina.mdx
@@ -1,20 +1,35 @@
-# Jina
+# Jina AI

-This page covers how to use the Jina Embeddings within LangChain.
-It is broken into two parts: installation and setup, and then references to specific Jina wrappers.
+>[Jina AI](https://jina.ai/about-us) is a search AI company. `Jina` helps businesses and developers unlock multimodal data with a better search.

 ## Installation and Setup
 - Get a Jina AI API token from [here](https://jina.ai/embeddings/) and set it as an environment variable (`JINA_API_TOKEN`)

-There exists a Jina Embeddings wrapper, which you can access with 
+## Chat Models

 ```python
-from langchain_community.embeddings import JinaEmbeddings
-
-# you can pas jina_api_key, if none is passed it will be taken from `JINA_API_TOKEN` environment variable
-embeddings = JinaEmbeddings(jina_api_key='jina_**', model_name='jina-embeddings-v2-base-en')
+from langchain_community.chat_models import JinaChat
 ```

+See a [usage examples](/docs/integrations/chat/jinachat).
+
+## Embedding Models
+
 You can check the list of available models from [here](https://jina.ai/embeddings/)

-For a more detailed walkthrough of this, see [this notebook](/docs/integrations/text_embedding/jina)
+```python
+from langchain_community.embeddings import JinaEmbeddings
+```
+
+See a [usage examples](/docs/integrations/text_embedding/jina).
+
+## Document Transformers
+
+### Jina Rerank
+
+```python
+from langchain_community.document_compressors import JinaRerank
+```
+
+See a [usage examples](/docs/integrations/document_transformers/jina_rerank).
+
--- a/docs/docs/integrations/providers/koboldai.mdx
+++ b/docs/docs/integrations/providers/koboldai.mdx
@@ -0,0 +1,20 @@
+# KoboldAI
+
+>[KoboldAI](https://koboldai.com/) is a free, open-source project that allows users to run AI models locally 
+> on their own computer. 
+> It's a browser-based front-end that can be used for writing or role playing with an AI.
+>[KoboldAI](https://github.com/KoboldAI/KoboldAI-Client) is a "a browser-based front-end for 
+> AI-assisted writing with multiple local & remote AI models...". 
+> It has a public and local API that can be used in LangChain.
+
+## Installation and Setup
+
+Check out the [installation guide](https://github.com/KoboldAI/KoboldAI-Client).
+
+## LLMs
+
+See a [usage example](/docs/integrations/llms/koboldai).
+
+```python
+from langchain_community.llms import KoboldApiLLM
+```
--- a/docs/docs/integrations/text_embedding/ipex_llm.ipynb
+++ b/docs/docs/integrations/text_embedding/ipex_llm.ipynb
@@ -4,7 +4,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Local BGE Embeddings with IPEX-LLM on Intel CPU\n",
+    "# IPEX-LLM: Local BGE Embeddings on Intel CPU\n",
    "\n",
    "> [IPEX-LLM](https://github.com/intel-analytics/ipex-llm) is a PyTorch library for running LLM on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with very low latency.\n",
    "\n",
@@ -92,10 +92,24 @@
  }
 ],
 "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
  "language_info": {
-   "name": "python"
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/docs/integrations/text_embedding/ipex_llm_gpu.ipynb
+++ b/docs/docs/integrations/text_embedding/ipex_llm_gpu.ipynb
@@ -4,7 +4,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Local BGE Embeddings with IPEX-LLM on Intel GPU\n",
+    "# IPEX-LLM: Local BGE Embeddings on Intel GPU\n",
    "\n",
    "> [IPEX-LLM](https://github.com/intel-analytics/ipex-llm) is a PyTorch library for running LLM on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and Max) with very low latency.\n",
    "\n",
@@ -155,10 +155,24 @@
  }
 ],
 "metadata": {
+  "kernelspec": {
+   "display_name": "Python 3 (ipykernel)",
+   "language": "python",
+   "name": "python3"
+  },
  "language_info": {
-   "name": "python"
+   "codemirror_mode": {
+    "name": "ipython",
+    "version": 3
+   },
+   "file_extension": ".py",
+   "mimetype": "text/x-python",
+   "name": "python",
+   "nbconvert_exporter": "python",
+   "pygments_lexer": "ipython3",
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
- "nbformat_minor": 2
+ "nbformat_minor": 4
 }
--- a/docs/docs/integrations/text_embedding/jina.ipynb
+++ b/docs/docs/integrations/text_embedding/jina.ipynb
@@ -5,7 +5,11 @@
   "id": "1c0cf975",
   "metadata": {},
   "source": [
-    "# Jina"
+    "# Jina\n",
+    "\n",
+    "You can check the list of available models from [here](https://jina.ai/embeddings/).\n",
+    "\n",
+    "## Installation and setup"
   ]
  },
  {
@@ -231,7 +235,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.9.13"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/integrations/tools/databricks.ipynb
+++ b/docs/docs/integrations/tools/databricks.ipynb
@@ -74,6 +74,24 @@
    ")"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "(Optional) To increase the retry time for getting a function execution response, set environment variable UC_TOOL_CLIENT_EXECUTION_TIMEOUT. Default retry time value is 120s."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "import os\n",
+    "\n",
+    "os.environ[\"UC_TOOL_CLIENT_EXECUTION_TIMEOUT\"] = \"200\""
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": 4,
--- a/docs/docs/integrations/tools/jira.ipynb
+++ b/docs/docs/integrations/tools/jira.ipynb
@@ -11,6 +11,8 @@
    "\n",
    "The `Jira` toolkit allows agents to interact with a given Jira instance, performing actions such as searching for issues and creating issues, the tool wraps the atlassian-python-api library, for more see: https://atlassian-python-api.readthedocs.io/jira.html\n",
    "\n",
+    "## Installation and setup\n",
+    "\n",
    "To use this tool, you must first set as environment variables:\n",
    "    JIRA_API_TOKEN\n",
    "    JIRA_USERNAME\n",
@@ -47,7 +49,7 @@
   },
   "outputs": [],
   "source": [
-    "%pip install -qU langchain-community"
+    "%pip install -qU langchain-community langchain_openai"
   ]
  },
  {
@@ -58,6 +60,13 @@
    "ExecuteTime": {
     "end_time": "2023-04-17T10:21:23.730922Z",
     "start_time": "2023-04-17T10:21:22.911233Z"
+    },
+    "execution": {
+     "iopub.execute_input": "2024-10-02T17:40:07.356954Z",
+     "iopub.status.busy": "2024-10-02T17:40:07.356792Z",
+     "iopub.status.idle": "2024-10-02T17:40:07.359943Z",
+     "shell.execute_reply": "2024-10-02T17:40:07.359476Z",
+     "shell.execute_reply.started": "2024-10-02T17:40:07.356942Z"
    }
   },
   "outputs": [],
@@ -72,7 +81,7 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 3,
   "id": "b3050b55",
   "metadata": {
    "ExecuteTime": {
@@ -80,6 +89,13 @@
     "start_time": "2023-04-17T10:22:42.499447Z"
    },
    "collapsed": false,
+    "execution": {
+     "iopub.execute_input": "2024-10-02T17:40:16.201684Z",
+     "iopub.status.busy": "2024-10-02T17:40:16.200922Z",
+     "iopub.status.idle": "2024-10-02T17:40:16.208035Z",
+     "shell.execute_reply": "2024-10-02T17:40:16.207564Z",
+     "shell.execute_reply.started": "2024-10-02T17:40:16.201634Z"
+    },
    "jupyter": {
     "outputs_hidden": false
    }
@@ -93,6 +109,74 @@
    "os.environ[\"JIRA_CLOUD\"] = \"True\""
   ]
  },
+  {
+   "cell_type": "code",
+   "execution_count": 4,
+   "id": "c0768000-227b-4aa1-a838-4befbdefadb1",
+   "metadata": {
+    "execution": {
+     "iopub.execute_input": "2024-10-02T17:42:00.792867Z",
+     "iopub.status.busy": "2024-10-02T17:42:00.792365Z",
+     "iopub.status.idle": "2024-10-02T17:42:00.816979Z",
+     "shell.execute_reply": "2024-10-02T17:42:00.816419Z",
+     "shell.execute_reply.started": "2024-10-02T17:42:00.792827Z"
+    }
+   },
+   "outputs": [],
+   "source": [
+    "llm = OpenAI(temperature=0)\n",
+    "jira = JiraAPIWrapper()\n",
+    "toolkit = JiraToolkit.from_jira_api_wrapper(jira)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "id": "961b3187-daf0-4907-9cc0-a69796fba4aa",
+   "metadata": {},
+   "source": [
+    "## Tool usage\n",
+    "\n",
+    "Let's see what individual tools are in the Jira toolkit:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "eb5cf521-9a91-44bc-b68e-bc4067d05a76",
+   "metadata": {
+    "execution": {
+     "iopub.execute_input": "2024-10-02T17:42:27.232022Z",
+     "iopub.status.busy": "2024-10-02T17:42:27.231140Z",
+     "iopub.status.idle": "2024-10-02T17:42:27.240169Z",
+     "shell.execute_reply": "2024-10-02T17:42:27.239693Z",
+     "shell.execute_reply.started": "2024-10-02T17:42:27.231949Z"
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[('JQL Query',\n",
+       "  '\\n    This tool is a wrapper around atlassian-python-api\\'s Jira jql API, useful when you need to search for Jira issues.\\n    The input to this tool is a JQL query string, and will be passed into atlassian-python-api\\'s Jira `jql` function,\\n    For example, to find all the issues in project \"Test\" assigned to the me, you would pass in the following string:\\n    project = Test AND assignee = currentUser()\\n    or to find issues with summaries that contain the word \"test\", you would pass in the following string:\\n    summary ~ \\'test\\'\\n    '),\n",
+       " ('Get Projects',\n",
+       "  \"\\n    This tool is a wrapper around atlassian-python-api's Jira project API, \\n    useful when you need to fetch all the projects the user has access to, find out how many projects there are, or as an intermediary step that involv searching by projects. \\n    there is no input to this tool.\\n    \"),\n",
+       " ('Create Issue',\n",
+       "  '\\n    This tool is a wrapper around atlassian-python-api\\'s Jira issue_create API, useful when you need to create a Jira issue. \\n    The input to this tool is a dictionary specifying the fields of the Jira issue, and will be passed into atlassian-python-api\\'s Jira `issue_create` function.\\n    For example, to create a low priority task called \"test issue\" with description \"test description\", you would pass in the following dictionary: \\n    {{\"summary\": \"test issue\", \"description\": \"test description\", \"issuetype\": {{\"name\": \"Task\"}}, \"priority\": {{\"name\": \"Low\"}}}}\\n    '),\n",
+       " ('Catch all Jira API call',\n",
+       "  '\\n    This tool is a wrapper around atlassian-python-api\\'s Jira API.\\n    There are other dedicated tools for fetching all projects, and creating and searching for issues, \\n    use this tool if you need to perform any other actions allowed by the atlassian-python-api Jira API.\\n    The input to this tool is a dictionary specifying a function from atlassian-python-api\\'s Jira API, \\n    as well as a list of arguments and dictionary of keyword arguments to pass into the function.\\n    For example, to get all the users in a group, while increasing the max number of results to 100, you would\\n    pass in the following dictionary: {{\"function\": \"get_all_users_from_group\", \"args\": [\"group\"], \"kwargs\": {{\"limit\":100}} }}\\n    or to find out how many projects are in the Jira instance, you would pass in the following string:\\n    {{\"function\": \"projects\"}}\\n    For more information on the Jira API, refer to https://atlassian-python-api.readthedocs.io/jira.html\\n    '),\n",
+       " ('Create confluence page',\n",
+       "  'This tool is a wrapper around atlassian-python-api\\'s Confluence \\natlassian-python-api API, useful when you need to create a Confluence page. The input to this tool is a dictionary \\nspecifying the fields of the Confluence page, and will be passed into atlassian-python-api\\'s Confluence `create_page` \\nfunction. For example, to create a page in the DEMO space titled \"This is the title\" with body \"This is the body. You can use \\n<strong>HTML tags</strong>!\", you would pass in the following dictionary: {{\"space\": \"DEMO\", \"title\":\"This is the \\ntitle\",\"body\":\"This is the body. You can use <strong>HTML tags</strong>!\"}} ')]"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "[(tool.name, tool.description) for tool in toolkit.get_tools()]"
+   ]
+  },
  {
   "cell_type": "code",
   "execution_count": 5,
@@ -105,9 +189,6 @@
   },
   "outputs": [],
   "source": [
-    "llm = OpenAI(temperature=0)\n",
-    "jira = JiraAPIWrapper()\n",
-    "toolkit = JiraToolkit.from_jira_api_wrapper(jira)\n",
    "agent = initialize_agent(\n",
    "    toolkit.get_tools(), llm, agent=AgentType.ZERO_SHOT_REACT_DESCRIPTION, verbose=True\n",
    ")"
--- a/docs/docs/integrations/tools/json.ipynb
+++ b/docs/docs/integrations/tools/json.ipynb
@@ -35,9 +35,16 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": 2,
   "id": "ff988466-c389-4ec6-b6ac-14364a537fd5",
   "metadata": {
+    "execution": {
+     "iopub.execute_input": "2024-10-02T17:52:40.107644Z",
+     "iopub.status.busy": "2024-10-02T17:52:40.107485Z",
+     "iopub.status.idle": "2024-10-02T17:52:40.110169Z",
+     "shell.execute_reply": "2024-10-02T17:52:40.109841Z",
+     "shell.execute_reply.started": "2024-10-02T17:52:40.107633Z"
+    },
    "tags": []
   },
   "outputs": [],
@@ -50,16 +57,23 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 2,
+   "execution_count": 4,
   "id": "9ecd1ba0-3937-4359-a41e-68605f0596a1",
   "metadata": {
+    "execution": {
+     "iopub.execute_input": "2024-10-02T17:59:54.134295Z",
+     "iopub.status.busy": "2024-10-02T17:59:54.134138Z",
+     "iopub.status.idle": "2024-10-02T17:59:54.137250Z",
+     "shell.execute_reply": "2024-10-02T17:59:54.136636Z",
+     "shell.execute_reply.started": "2024-10-02T17:59:54.134283Z"
+    },
    "tags": []
   },
   "outputs": [],
   "source": [
    "with open(\"openai_openapi.yml\") as f:\n",
    "    data = yaml.load(f, Loader=yaml.FullLoader)\n",
-    "json_spec = JsonSpec(dict_=data, max_value_length=4000)\n",
+    "json_spec = JsonSpec(dict_={}, max_value_length=4000)\n",
    "json_toolkit = JsonToolkit(spec=json_spec)\n",
    "\n",
    "json_agent_executor = create_json_agent(\n",
@@ -67,6 +81,48 @@
    ")"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "id": "910eccbc-9d42-49b6-a4ca-1fbc418fcee7",
+   "metadata": {},
+   "source": [
+    "## Individual tools\n",
+    "\n",
+    "Let's see what individual tools are inside the Jira toolkit."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 5,
+   "id": "b16a3ee5-ca16-452e-993f-c27228b945ac",
+   "metadata": {
+    "execution": {
+     "iopub.execute_input": "2024-10-02T18:00:30.527665Z",
+     "iopub.status.busy": "2024-10-02T18:00:30.527053Z",
+     "iopub.status.idle": "2024-10-02T18:00:30.538483Z",
+     "shell.execute_reply": "2024-10-02T18:00:30.537672Z",
+     "shell.execute_reply.started": "2024-10-02T18:00:30.527626Z"
+    }
+   },
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "[('json_spec_list_keys',\n",
+       "  '\\n    Can be used to list all keys at a given path. \\n    Before calling this you should be SURE that the path to this exists.\\n    The input is a text representation of the path to the dict in Python syntax (e.g. data[\"key1\"][0][\"key2\"]).\\n    '),\n",
+       " ('json_spec_get_value',\n",
+       "  '\\n    Can be used to see value in string format at a given path.\\n    Before calling this you should be SURE that the path to this exists.\\n    The input is a text representation of the path to the dict in Python syntax (e.g. data[\"key1\"][0][\"key2\"]).\\n    ')]"
+      ]
+     },
+     "execution_count": 5,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "[(el.name, el.description) for el in json_toolkit.get_tools()]"
+   ]
+  },
  {
   "cell_type": "markdown",
   "id": "05cfcb24-4389-4b8f-ad9e-466e3fca8db0",
--- a/docs/docs/integrations/tools/passio_nutrition_ai.ipynb
+++ b/docs/docs/integrations/tools/passio_nutrition_ai.ipynb
@@ -176,7 +176,7 @@
   "id": "f8014c9d",
   "metadata": {},
   "source": [
-    "Now, we can initalize the agent with the LLM, the prompt, and the tools. The agent is responsible for taking in input and deciding what actions to take. Crucially, the Agent does not execute those actions - that is done by the AgentExecutor (next step). For more information about how to think about these components, see our [conceptual guide](/docs/concepts#agents)"
+    "Now, we can initialize the agent with the LLM, the prompt, and the tools. The agent is responsible for taking in input and deciding what actions to take. Crucially, the Agent does not execute those actions - that is done by the AgentExecutor (next step). For more information about how to think about these components, see our [conceptual guide](/docs/concepts#agents)"
   ]
  },
  {
--- a/docs/docs/integrations/tools/sql_database.ipynb
+++ b/docs/docs/integrations/tools/sql_database.ipynb
@@ -209,15 +209,25 @@
  },
  {
   "cell_type": "markdown",
-   "id": "5f5751e3-2e98-485f-8164-db8094039c25",
+   "id": "4e3fd064-aa86-448d-8db3-3c55eaa5bc15",
   "metadata": {},
   "source": [
-    "API references:\n",
-    "\n",
-    "- [QuerySQLDataBaseTool](https://python.langchain.com/api_reference/community/tools/langchain_community.tools.sql_database.tool.QuerySQLDataBaseTool.html)\n",
-    "- [InfoSQLDatabaseTool](https://python.langchain.com/api_reference/community/tools/langchain_community.tools.sql_database.tool.InfoSQLDatabaseTool.html)\n",
-    "- [ListSQLDatabaseTool](https://python.langchain.com/api_reference/community/tools/langchain_community.tools.sql_database.tool.ListSQLDatabaseTool.html)\n",
-    "- [QuerySQLCheckerTool](https://python.langchain.com/api_reference/community/tools/langchain_community.tools.sql_database.tool.QuerySQLCheckerTool.html)"
+    "You can use the individual tools directly:"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "7fa8d00c-750c-4803-9b66-057d12b26b06",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain_community.tools.sql_database.tool import (\n",
+    "    InfoSQLDatabaseTool,\n",
+    "    ListSQLDatabaseTool,\n",
+    "    QuerySQLCheckerTool,\n",
+    "    QuerySQLDataBaseTool,\n",
+    ")"
   ]
  },
  {
@@ -604,7 +614,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.4"
+   "version": "3.10.12"
  }
 },
 "nbformat": 4,
--- a/docs/docs/integrations/vectorstores/google_spanner.ipynb
+++ b/docs/docs/integrations/vectorstores/google_spanner.ipynb
@@ -52,7 +52,7 @@
    }
   ],
   "source": [
-    "%pip install --upgrade --quiet langchain-google-spanner"
+    "%pip install --upgrade --quiet langchain-google-spanner langchain-google-vertexai"
   ]
  },
  {
@@ -124,7 +124,8 @@
    "PROJECT_ID = \"my-project-id\"  # @param {type:\"string\"}\n",
    "\n",
    "# Set the project id\n",
-    "!gcloud config set project {PROJECT_ID}"
+    "!gcloud config set project {PROJECT_ID}\n",
+    "%env GOOGLE_CLOUD_PROJECT={PROJECT_ID}"
   ]
  },
  {
@@ -194,14 +195,16 @@
    "    instance_id=INSTANCE,\n",
    "    database_id=DATABASE,\n",
    "    table_name=TABLE_NAME,\n",
-    "    id_column=\"row_id\",\n",
-    "    metadata_columns=[\n",
-    "        TableColumn(name=\"metadata\", type=\"JSON\", is_null=True),\n",
-    "        TableColumn(name=\"title\", type=\"STRING(MAX)\", is_null=False),\n",
-    "    ],\n",
-    "    secondary_indexes=[\n",
-    "        SecondaryIndex(index_name=\"row_id_and_title\", columns=[\"row_id\", \"title\"])\n",
-    "    ],\n",
+    "    # Customize the table creation\n",
+    "    # id_column=\"row_id\",\n",
+    "    # content_column=\"content_column\",\n",
+    "    # metadata_columns=[\n",
+    "    #     TableColumn(name=\"metadata\", type=\"JSON\", is_null=True),\n",
+    "    #     TableColumn(name=\"title\", type=\"STRING(MAX)\", is_null=False),\n",
+    "    # ],\n",
+    "    # secondary_indexes=[\n",
+    "    #     SecondaryIndex(index_name=\"row_id_and_title\", columns=[\"row_id\", \"title\"])\n",
+    "    # ],\n",
    ")"
   ]
  },
@@ -262,9 +265,11 @@
    "    instance_id=INSTANCE,\n",
    "    database_id=DATABASE,\n",
    "    table_name=TABLE_NAME,\n",
-    "    ignore_metadata_columns=[],\n",
    "    embedding_service=embeddings,\n",
-    "    metadata_json_column=\"metadata\",\n",
+    "    # Connect to a custom vector store table\n",
+    "    # id_column=\"row_id\",\n",
+    "    # content_column=\"content\",\n",
+    "    # metadata_columns=[\"metadata\", \"title\"],\n",
    ")"
   ]
  },
@@ -272,7 +277,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "#### 🔐 Add Documents\n",
+    "#### Add Documents\n",
    "To add documents in the vector store."
   ]
  },
@@ -289,14 +294,15 @@
    "loader = HNLoader(\"https://news.ycombinator.com/item?id=34817881\")\n",
    "\n",
    "documents = loader.load()\n",
-    "ids = [str(uuid.uuid4()) for _ in range(len(documents))]"
+    "ids = [str(uuid.uuid4()) for _ in range(len(documents))]\n",
+    "db.add_documents(documents, ids)"
   ]
  },
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "#### 🔐 Search Documents\n",
+    "#### Search Documents\n",
    "To search documents in the vector store with similarity search."
   ]
  },
@@ -313,7 +319,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "#### 🔐 Search Documents\n",
+    "#### Search Documents\n",
    "To search documents in the vector store with max marginal relevance search."
   ]
  },
@@ -330,7 +336,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "#### 🔐 Delete Documents\n",
+    "#### Delete Documents\n",
    "To remove documents from the vector store, use the IDs that correspond to the values in the `row_id`` column when initializing the VectorStore."
   ]
  },
@@ -347,7 +353,7 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "#### 🔐 Delete Documents\n",
+    "#### Delete Documents\n",
    "To remove documents from the vector store, you can utilize the documents themselves. The content column and metadata columns provided during VectorStore initialization will be used to find out the rows corresponding to the documents. Any matching rows will then be deleted."
   ]
  },
@@ -377,7 +383,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.11.6"
+   "version": "3.11.8"
  }
 },
 "nbformat": 4,
--- a/docs/docs/tutorials/chatbot.ipynb
+++ b/docs/docs/tutorials/chatbot.ipynb
@@ -438,7 +438,7 @@
    "app = workflow.compile(checkpointer=MemorySaver())\n",
    "\n",
    "# Async invocation:\n",
-    "output = await app.ainvoke({\"messages\": input_messages}, config):\n",
+    "output = await app.ainvoke({\"messages\": input_messages}, config)\n",
    "output[\"messages\"][-1].pretty_print()\n",
    "```\n",
    "\n",
@@ -686,7 +686,7 @@
    "\n",
    "input_messages = [HumanMessage(query)]\n",
    "output = app.invoke(\n",
-    "    {\"messages\": input_messages, \"language\": language},\n",
+    "    {\"messages\": input_messages},\n",
    "    config,\n",
    ")\n",
    "output[\"messages\"][-1].pretty_print()"
--- a/docs/src/theme/FeatureTables.js
+++ b/docs/src/theme/FeatureTables.js
@@ -8,7 +8,8 @@ const FEATURE_TABLES = {
    chat: {
        link: "/docs/integrations/chat",
        columns: [
-            {title: "Provider", formatter: (item) => <a href={item.link}>{item.name}</a>},
+            {title: "Provider", mode: "category", formatter: (item) => <a href={item.link}>{item.name}</a>},
+            {title: "Provider", mode: "item", formatter: (item) => <a href={item.link}>{item.name}</a>},
            {title: <a href="/docs/how_to/tool_calling">Tool calling</a>, formatter: (item) => item.tool_calling ? "✅" : "❌"},
            {title: <a href="/docs/how_to/structured_output/">Structured output</a>, formatter: (item) => item.structured_output ? "✅" : "❌"},
            {title: "JSON mode", formatter: (item) => item.json_mode ? "✅" : "❌"},
@@ -221,7 +222,7 @@ const FEATURE_TABLES = {
    llms: {
        link: "/docs/integrations/llms",
        columns: [
-            {title: "Provider", formatter: (item) => <a href={
+            {title: "Provider", mode: "category", formatter: (item) => <a href={
                item.link
            }>{item.name}</a>},
            {title: "Package", formatter: (item) => <a href={
@@ -294,7 +295,8 @@ const FEATURE_TABLES = {
    text_embedding: {
        link: "/docs/integrations/text_embedding",
        columns: [
-            {title: "Provider", formatter: (item) => <a href={item.link}>{item.name}</a>},
+            {title: "Provider", mode: "category", formatter: (item) => <a href={item.link}>{item.name}</a>},
+            {title: "Provider", mode: "item", formatter: (item) => <a href={`/docs/integrations/${item.top ? "platforms":"providers"}/${item.link}`}>{item.name}</a>},
            {title: "Package", formatter: (item) => <a href={item.apiLink}>{item.package}</a>},
        ],
        items:[
@@ -1120,7 +1122,7 @@ const DEPRECATED_DOC_IDS = [
  "integrations/text_embedding/ernie",
 ];

-function toTable(columns, items) {
+function toTable(columns, items, mode) {
    const headers = columns.map((col) => col.title);
    return (
        <table>
@@ -1132,7 +1134,7 @@ function toTable(columns, items) {
            <tbody>
                {items.map((item, i) => (
                    <tr key={`row-${i}`}>
-                        {columns.map((col, j) => <td key={`cell-${i}-${j}`}>{col.formatter(item)}</td>)}
+                        {columns.filter(col => !col.mode || col.mode === mode).map((col, j) => <td key={`cell-${i}-${j}`}>{col.formatter(item)}</td>)}
                    </tr>
                ))}
            </tbody>
@@ -1142,7 +1144,7 @@ function toTable(columns, items) {

 export function CategoryTable({ category }) {
    const cat = FEATURE_TABLES[category];
-    const rtn = toTable(cat.columns, cat.items);
+    const rtn = toTable(cat.columns, cat.items, "category");
    return rtn;
 }

@@ -1152,7 +1154,7 @@ export function ItemTable({ category, item }) {
    if (!row) {
        throw new Error(`Item ${item} not found in category ${category}`);
    }
-    const rtn = toTable(cat.columns, [row]);
+    const rtn = toTable(cat.columns, [row], "item");
    return rtn;
 }

@@ -1185,6 +1187,7 @@ export function IndexTable() {
      },
    ],
    rows,
+    "index",
  );
  return rtn;
 }
--- a/libs/community/langchain_community/callbacks/openai_info.py
+++ b/libs/community/langchain_community/callbacks/openai_info.py
@@ -27,11 +27,11 @@ MODEL_COST_PER_1K_TOKENS = {
    "gpt-4o-mini-completion": 0.0006,
    "gpt-4o-mini-2024-07-18-completion": 0.0006,
    # GPT-4o input
-    "gpt-4o": 0.005,
+    "gpt-4o": 0.0025,
    "gpt-4o-2024-05-13": 0.005,
    "gpt-4o-2024-08-06": 0.0025,
    # GPT-4o output
-    "gpt-4o-completion": 0.015,
+    "gpt-4o-completion": 0.01,
    "gpt-4o-2024-05-13-completion": 0.015,
    "gpt-4o-2024-08-06-completion": 0.01,
    # GPT-4 input
--- a/libs/community/langchain_community/chat_models/init.py
+++ b/libs/community/langchain_community/chat_models/init.py
@@ -149,6 +149,7 @@ if TYPE_CHECKING:
    )
    from langchain_community.chat_models.sambanova import (
        ChatSambaNovaCloud,
+        ChatSambaStudio,
    )
    from langchain_community.chat_models.snowflake import (
        ChatSnowflakeCortex,
@@ -215,6 +216,7 @@ __all__ = [
    "ChatPerplexity",
    "ChatPremAI",
    "ChatSambaNovaCloud",
+    "ChatSambaStudio",
    "ChatSparkLLM",
    "ChatSnowflakeCortex",
    "ChatTongyi",
@@ -274,6 +276,7 @@ _module_lookup = {
    "ChatOpenAI": "langchain_community.chat_models.openai",
    "ChatPerplexity": "langchain_community.chat_models.perplexity",
    "ChatSambaNovaCloud": "langchain_community.chat_models.sambanova",
+    "ChatSambaStudio": "langchain_community.chat_models.sambanova",
    "ChatSnowflakeCortex": "langchain_community.chat_models.snowflake",
    "ChatSparkLLM": "langchain_community.chat_models.sparkllm",
    "ChatTongyi": "langchain_community.chat_models.tongyi",
--- a/libs/community/langchain_community/chat_models/llamacpp.py
+++ b/libs/community/langchain_community/chat_models/llamacpp.py
@@ -342,7 +342,7 @@ class ChatLlamaCpp(BaseChatModel):
        self,
        tools: Sequence[Union[Dict[str, Any], Type[BaseModel], Callable, BaseTool]],
        *,
-        tool_choice: Optional[Union[Dict[str, Dict], bool, str]] = None,
+        tool_choice: Optional[Union[dict, bool, str]] = None,
        **kwargs: Any,
    ) -> Runnable[LanguageModelInput, BaseMessage]:
        """Bind tool-like objects to this chat model
@@ -538,7 +538,8 @@ class ChatLlamaCpp(BaseChatModel):
                "Received None."
            )
        tool_name = convert_to_openai_tool(schema)["function"]["name"]
-        llm = self.bind_tools([schema], tool_choice=tool_name)
+        tool_choice = {"type": "function", "function": {"name": tool_name}}
+        llm = self.bind_tools([schema], tool_choice=tool_choice)
        if is_pydantic_schema:
            output_parser: OutputParserLike = PydanticToolsParser(
                tools=[cast(Type, schema)], first_tool_only=True
--- a/libs/community/langchain_community/chat_models/sambanova.py
+++ b/libs/community/langchain_community/chat_models/sambanova.py
--- a/libs/community/langchain_community/graphs/neo4j_graph.py
+++ b/libs/community/langchain_community/graphs/neo4j_graph.py
@@ -430,7 +430,7 @@ class Neo4jGraph(GraphStore):
        try:
            data, _, _ = self._driver.execute_query(
                Query(text=query, timeout=self.timeout),
-                database=self._database,
+                database_=self._database,
                parameters_=params,
            )
            json_data = [r.data() for r in data]
@@ -457,7 +457,7 @@ class Neo4jGraph(GraphStore):
            ):
                raise
        # fallback to allow implicit transactions
-        with self._driver.session() as session:
+        with self._driver.session(database=self._database) as session:
            data = session.run(Query(text=query, timeout=self.timeout), params)
            json_data = [r.data() for r in data]
            if self.sanitize:
--- a/libs/community/langchain_community/storage/sql.py
+++ b/libs/community/langchain_community/storage/sql.py
@@ -95,7 +95,7 @@ class SQLStore(BaseStore[str, bytes]):

        .. code-block:: python

-            from langchain_rag.storage import SQLStore
+            from langchain_community.storage import SQLStore

            # Instantiate the SQLStore with the root path
            sql_store = SQLStore(namespace="test", db_url="sqlite://:memory:")
--- a/libs/community/langchain_community/tools/databricks/_execution.py
+++ b/libs/community/langchain_community/tools/databricks/_execution.py
@@ -1,5 +1,8 @@
 import inspect
 import json
+import logging
+import os
+import time
 from dataclasses import dataclass
 from io import StringIO
 from typing import TYPE_CHECKING, Any, Dict, List, Literal, Optional
@@ -7,7 +10,7 @@ from typing import TYPE_CHECKING, Any, Dict, List, Literal, Optional
 if TYPE_CHECKING:
    from databricks.sdk import WorkspaceClient
    from databricks.sdk.service.catalog import FunctionInfo
-    from databricks.sdk.service.sql import StatementParameterListItem
+    from databricks.sdk.service.sql import StatementParameterListItem, StatementState

 EXECUTE_FUNCTION_ARG_NAME = "__execution_args__"
 DEFAULT_EXECUTE_FUNCTION_ARGS = {
@@ -15,6 +18,9 @@ DEFAULT_EXECUTE_FUNCTION_ARGS = {
    "row_limit": 100,
    "byte_limit": 4096,
 }
+UC_TOOL_CLIENT_EXECUTION_TIMEOUT = "UC_TOOL_CLIENT_EXECUTION_TIMEOUT"
+DEFAULT_UC_TOOL_CLIENT_EXECUTION_TIMEOUT = "120"
+_logger = logging.getLogger(__name__)


 def is_scalar(function: "FunctionInfo") -> bool:
@@ -174,13 +180,42 @@ def execute_function(
        parameters=parametrized_statement.parameters,
        **execute_statement_args,  # type: ignore
    )
-    status = response.status
-    assert status is not None, f"Statement execution failed: {response}"
-    if status.state != StatementState.SUCCEEDED:
-        error = status.error
+    if response.status and job_pending(response.status.state) and response.statement_id:
+        statement_id = response.statement_id
+        wait_time = 0
+        retry_cnt = 0
+        client_execution_timeout = int(
+            os.environ.get(
+                UC_TOOL_CLIENT_EXECUTION_TIMEOUT,
+                DEFAULT_UC_TOOL_CLIENT_EXECUTION_TIMEOUT,
+            )
+        )
+        while wait_time < client_execution_timeout:
+            wait = min(2**retry_cnt, client_execution_timeout - wait_time)
+            _logger.debug(
+                f"Retrying {retry_cnt} time to get statement execution "
+                f"status after {wait} seconds."
+            )
+            time.sleep(wait)
+            response = ws.statement_execution.get_statement(statement_id)  # type: ignore
+            if response.status is None or not job_pending(response.status.state):
+                break
+            wait_time += wait
+            retry_cnt += 1
+        if response.status and job_pending(response.status.state):
+            return FunctionExecutionResult(
+                error=f"Statement execution is still pending after {wait_time} "
+                "seconds. Please increase the wait_timeout argument for executing "
+                f"the function or increase {UC_TOOL_CLIENT_EXECUTION_TIMEOUT} "
+                "environment variable for increasing retrying time, default is "
+                f"{DEFAULT_UC_TOOL_CLIENT_EXECUTION_TIMEOUT} seconds."
+            )
+    assert response.status is not None, f"Statement execution failed: {response}"
+    if response.status.state != StatementState.SUCCEEDED:
+        error = response.status.error
        assert (
            error is not None
-        ), "Statement execution failed but no error message was provided."
+        ), f"Statement execution failed but no error message was provided: {response}"
        return FunctionExecutionResult(error=f"{error.error_code}: {error.message}")
    manifest = response.manifest
    assert manifest is not None
@@ -211,3 +246,9 @@ def execute_function(
        return FunctionExecutionResult(
            format="CSV", value=csv_buffer.getvalue(), truncated=truncated
        )
+
+
+def job_pending(state: Optional["StatementState"]) -> bool:
+    from databricks.sdk.service.sql import StatementState
+
+    return state in (StatementState.PENDING, StatementState.RUNNING)
--- a/libs/community/langchain_community/vectorstores/azuresearch.py
+++ b/libs/community/langchain_community/vectorstores/azuresearch.py
@@ -1769,6 +1769,8 @@ def _reorder_results_with_maximal_marginal_relevance(
        )
        for result in results
    ]
+    if not docs:
+        return []
    documents, scores, vectors = map(list, zip(*docs))

    # Get the new order of results.
--- a/libs/community/langchain_community/vectorstores/cassandra.py
+++ b/libs/community/langchain_community/vectorstores/cassandra.py
@@ -1,6 +1,7 @@
 from __future__ import annotations

 import asyncio
+import importlib.metadata
 import typing
 import uuid
 from typing import (
@@ -18,6 +19,7 @@ from typing import (
 )

 import numpy as np
+from packaging.version import Version  # this is a lancghain-core dependency

 if typing.TYPE_CHECKING:
    from cassandra.cluster import Session
@@ -30,6 +32,7 @@ from langchain_community.utilities.cassandra import SetupMode
 from langchain_community.vectorstores.utils import maximal_marginal_relevance

 CVST = TypeVar("CVST", bound="Cassandra")
+MIN_CASSIO_VERSION = Version("0.1.10")


 class Cassandra(VectorStore):
@@ -110,6 +113,15 @@ class Cassandra(VectorStore):
                "Could not import cassio python package. "
                "Please install it with `pip install cassio`."
            )
+        cassio_version = Version(importlib.metadata.version("cassio"))
+
+        if cassio_version is not None and cassio_version < MIN_CASSIO_VERSION:
+            msg = (
+                "Cassio version not supported. Please upgrade cassio "
+                f"to version {MIN_CASSIO_VERSION} or higher."
+            )
+            raise ImportError(msg)
+
        if not table_name:
            raise ValueError("Missing required parameter 'table_name'.")
        self.embedding = embedding
@@ -143,6 +155,9 @@ class Cassandra(VectorStore):
            **kwargs,
        )

+        if self.session is None:
+            self.session = self.table.session
+
    @property
    def embeddings(self) -> Embeddings:
        return self.embedding
@@ -231,6 +246,70 @@ class Cassandra(VectorStore):
            await self.adelete_by_document_id(document_id)
        return True

+    def delete_by_metadata_filter(
+        self,
+        filter: dict[str, Any],
+        *,
+        batch_size: int = 50,
+    ) -> int:
+        """Delete all documents matching a certain metadata filtering condition.
+
+        This operation does not use the vector embeddings in any way, it simply
+        removes all documents whose metadata match the provided condition.
+
+        Args:
+            filter: Filter on the metadata to apply. The filter cannot be empty.
+            batch_size: amount of deletions per each batch (until exhaustion of
+                the matching documents).
+
+        Returns:
+            A number expressing the amount of deleted documents.
+        """
+        if not filter:
+            msg = (
+                "Method `delete_by_metadata_filter` does not accept an empty "
+                "filter. Use the `clear()` method if you really want to empty "
+                "the vector store."
+            )
+            raise ValueError(msg)
+
+        return self.table.find_and_delete_entries(
+            metadata=filter,
+            batch_size=batch_size,
+        )
+
+    async def adelete_by_metadata_filter(
+        self,
+        filter: dict[str, Any],
+        *,
+        batch_size: int = 50,
+    ) -> int:
+        """Delete all documents matching a certain metadata filtering condition.
+
+        This operation does not use the vector embeddings in any way, it simply
+        removes all documents whose metadata match the provided condition.
+
+        Args:
+            filter: Filter on the metadata to apply. The filter cannot be empty.
+            batch_size: amount of deletions per each batch (until exhaustion of
+                the matching documents).
+
+        Returns:
+            A number expressing the amount of deleted documents.
+        """
+        if not filter:
+            msg = (
+                "Method `delete_by_metadata_filter` does not accept an empty "
+                "filter. Use the `clear()` method if you really want to empty "
+                "the vector store."
+            )
+            raise ValueError(msg)
+
+        return await self.table.afind_and_delete_entries(
+            metadata=filter,
+            batch_size=batch_size,
+        )
+
    def add_texts(
        self,
        texts: Iterable[str],
@@ -333,6 +412,180 @@ class Cassandra(VectorStore):
            await asyncio.gather(*tasks)
        return ids

+    def replace_metadata(
+        self,
+        id_to_metadata: dict[str, dict],
+        *,
+        batch_size: int = 50,
+    ) -> None:
+        """Replace the metadata of documents.
+
+        For each document to update, identified by its ID, the new metadata
+        dictionary completely replaces what is on the store. This includes
+        passing empty metadata `{}` to erase the currently-stored information.
+
+        Args:
+            id_to_metadata: map from the Document IDs to modify to the
+                new metadata for updating.
+                Keys in this dictionary that do not correspond to an existing
+                document will not cause an error, rather will result in new
+                rows being written into the Cassandra table but without an
+                associated vector: hence unreachable through vector search.
+            batch_size: Number of concurrent requests to send to the server.
+
+        Returns:
+            None if the writes succeed (otherwise an error is raised).
+        """
+        ids_and_metadatas = list(id_to_metadata.items())
+        for i in range(0, len(ids_and_metadatas), batch_size):
+            batch_i_m = ids_and_metadatas[i : i + batch_size]
+            futures = [
+                self.table.put_async(
+                    row_id=doc_id,
+                    metadata=doc_md,
+                )
+                for doc_id, doc_md in batch_i_m
+            ]
+            for future in futures:
+                future.result()
+        return
+
+    async def areplace_metadata(
+        self,
+        id_to_metadata: dict[str, dict],
+        *,
+        concurrency: int = 50,
+    ) -> None:
+        """Replace the metadata of documents.
+
+        For each document to update, identified by its ID, the new metadata
+        dictionary completely replaces what is on the store. This includes
+        passing empty metadata `{}` to erase the currently-stored information.
+
+        Args:
+            id_to_metadata: map from the Document IDs to modify to the
+                new metadata for updating.
+                Keys in this dictionary that do not correspond to an existing
+                document will not cause an error, rather will result in new
+                rows being written into the Cassandra table but without an
+                associated vector: hence unreachable through vector search.
+            concurrency: Number of concurrent queries to the database.
+                Defaults to 50.
+
+        Returns:
+            None if the writes succeed (otherwise an error is raised).
+        """
+        ids_and_metadatas = list(id_to_metadata.items())
+
+        sem = asyncio.Semaphore(concurrency)
+
+        async def send_concurrently(doc_id: str, doc_md: dict) -> None:
+            async with sem:
+                await self.table.aput(
+                    row_id=doc_id,
+                    metadata=doc_md,
+                )
+
+        for doc_id, doc_md in ids_and_metadatas:
+            tasks = [asyncio.create_task(send_concurrently(doc_id, doc_md))]
+            await asyncio.gather(*tasks)
+
+        return
+
+    @staticmethod
+    def _row_to_document(row: Dict[str, Any]) -> Document:
+        return Document(
+            id=row["row_id"],
+            page_content=row["body_blob"],
+            metadata=row["metadata"],
+        )
+
+    def get_by_document_id(self, document_id: str) -> Document | None:
+        """Get by document ID.
+
+        Args:
+            document_id: the document ID to get.
+        """
+        row = self.table.get(row_id=document_id)
+        if row is None:
+            return None
+        return self._row_to_document(row=row)
+
+    async def aget_by_document_id(self, document_id: str) -> Document | None:
+        """Get by document ID.
+
+        Args:
+            document_id: the document ID to get.
+        """
+        row = await self.table.aget(row_id=document_id)
+        if row is None:
+            return None
+        return self._row_to_document(row=row)
+
+    def metadata_search(
+        self,
+        metadata: dict[str, Any] = {},  # noqa: B006
+        n: int = 5,
+    ) -> Iterable[Document]:
+        """Get documents via a metadata search.
+
+        Args:
+            metadata: the metadata to query for.
+        """
+        rows = self.table.find_entries(metadata=metadata, n=n)
+        return [self._row_to_document(row=row) for row in rows if row]
+
+    async def ametadata_search(
+        self,
+        metadata: dict[str, Any] = {},  # noqa: B006
+        n: int = 5,
+    ) -> Iterable[Document]:
+        """Get documents via a metadata search.
+
+        Args:
+            metadata: the metadata to query for.
+        """
+        rows = await self.table.afind_entries(metadata=metadata, n=n)
+        return [self._row_to_document(row=row) for row in rows]
+
+    async def asimilarity_search_with_embedding_id_by_vector(
+        self,
+        embedding: List[float],
+        k: int = 4,
+        filter: Optional[Dict[str, str]] = None,
+        body_search: Optional[Union[str, List[str]]] = None,
+    ) -> List[Tuple[Document, List[float], str]]:
+        """Return docs most similar to embedding vector.
+
+        Args:
+            embedding: Embedding to look up documents similar to.
+            k: Number of Documents to return. Defaults to 4.
+            filter: Filter on the metadata to apply.
+            body_search: Document textual search terms to apply.
+                Only supported by Astra DB at the moment.
+        Returns:
+            List of (Document, embedding, id), the most similar to the query vector.
+        """
+        kwargs: Dict[str, Any] = {}
+        if filter is not None:
+            kwargs["metadata"] = filter
+        if body_search is not None:
+            kwargs["body_search"] = body_search
+
+        hits = await self.table.aann_search(
+            vector=embedding,
+            n=k,
+            **kwargs,
+        )
+        return [
+            (
+                self._row_to_document(row=hit),
+                hit["vector"],
+                hit["row_id"],
+            )
+            for hit in hits
+        ]
+
    @staticmethod
    def _search_to_documents(
        hits: Iterable[Dict[str, Any]],
@@ -341,10 +594,7 @@ class Cassandra(VectorStore):
        # (1=most relevant), as required by this class' contract.
        return [
            (
-                Document(
-                    page_content=hit["body_blob"],
-                    metadata=hit["metadata"],
-                ),
+                Cassandra._row_to_document(row=hit),
                0.5 + 0.5 * hit["distance"],
                hit["row_id"],
            )
@@ -375,7 +625,6 @@ class Cassandra(VectorStore):
            kwargs["metadata"] = filter
        if body_search is not None:
            kwargs["body_search"] = body_search
-
        hits = self.table.metric_ann_search(
            vector=embedding,
            n=k,
@@ -712,13 +961,7 @@ class Cassandra(VectorStore):
            for pf_index, pf_hit in enumerate(prefetch_hits)
            if pf_index in mmr_chosen_indices
        ]
-        return [
-            Document(
-                page_content=hit["body_blob"],
-                metadata=hit["metadata"],
-            )
-            for hit in mmr_hits
-        ]
+        return [Cassandra._row_to_document(row=hit) for hit in mmr_hits]

    def max_marginal_relevance_search_by_vector(
        self,
--- a/libs/community/langchain_community/vectorstores/infinispanvs.py
+++ b/libs/community/langchain_community/vectorstores/infinispanvs.py
@@ -5,9 +5,10 @@ from __future__ import annotations
 import json
 import logging
 import uuid
-from typing import Any, Iterable, List, Optional, Tuple, Type, cast
+import warnings
+from typing import Any, Iterable, List, Optional, Tuple, Type, Union, cast

-import requests
+from httpx import Response
 from langchain_core.documents import Document
 from langchain_core.embeddings import Embeddings
 from langchain_core.vectorstores import VectorStore
@@ -49,7 +50,7 @@ class InfinispanVS(VectorStore):
                            embedding=RGBEmbeddings(),
                            output_fields: ["texture", "color"],
                            lambda_key: lambda text,meta: str(meta["_key"]),
-                            lambda_content: lambda item: item["color"]})
+                            lambda_content: lambda item: item["color"])
    """

    def __init__(
@@ -58,13 +59,48 @@ class InfinispanVS(VectorStore):
        ids: Optional[List[str]] = None,
        **kwargs: Any,
    ):
+        """
+        Parameters
+        ----------
+        cache_name: str
+            Embeddings cache name. Default "vector"
+        entity_name: str
+            Protobuf entity name for the embeddings. Default "vector"
+        text_field: str
+            Protobuf field name for text. Default "text"
+        vector_field: str
+            Protobuf field name for vector. Default "vector"
+        lambda_content: lambda
+            Lambda returning the content part of an item. Default returns text_field
+        lambda_metadata: lambda
+            Lambda returning the metadata part of an item. Default returns items
+            fields excepts text_field, vector_field, _type
+        output_fields: List[str]
+            List of fields to be returned from item, if None return all fields.
+            Default None
+        kwargs: Any
+            Rest of arguments passed to Infinispan. See docs"""
        self.ispn = Infinispan(**kwargs)
        self._configuration = kwargs
        self._cache_name = str(self._configuration.get("cache_name", "vector"))
        self._entity_name = str(self._configuration.get("entity_name", "vector"))
        self._embedding = embedding
-        self._textfield = self._configuration.get("textfield", "text")
-        self._vectorfield = self._configuration.get("vectorfield", "vector")
+        self._textfield = self._configuration.get("textfield", "")
+        if self._textfield == "":
+            self._textfield = self._configuration.get("text_field", "text")
+        else:
+            warnings.warn(
+                "`textfield` is deprecated. Please use `text_field` " "param.",
+                DeprecationWarning,
+            )
+        self._vectorfield = self._configuration.get("vectorfield", "")
+        if self._vectorfield == "":
+            self._vectorfield = self._configuration.get("vector_field", "vector")
+        else:
+            warnings.warn(
+                "`vectorfield` is deprecated. Please use `vector_field` " "param.",
+                DeprecationWarning,
+            )
        self._to_content = self._configuration.get(
            "lambda_content", lambda item: self._default_content(item)
        )
@@ -121,7 +157,7 @@ repeated float %s = 1;
        metadata_proto += "}\n"
        return metadata_proto

-    def schema_create(self, proto: str) -> requests.Response:
+    def schema_create(self, proto: str) -> Response:
        """Deploy the schema for the vector db
        Args:
            proto(str): protobuf schema
@@ -130,14 +166,14 @@ repeated float %s = 1;
        """
        return self.ispn.schema_post(self._entity_name + ".proto", proto)

-    def schema_delete(self) -> requests.Response:
+    def schema_delete(self) -> Response:
        """Delete the schema for the vector db
        Returns:
            An http Response containing the result of the operation
        """
        return self.ispn.schema_delete(self._entity_name + ".proto")

-    def cache_create(self, config: str = "") -> requests.Response:
+    def cache_create(self, config: str = "") -> Response:
        """Create the cache for the vector db
        Args:
            config(str): configuration of the cache.
@@ -172,14 +208,14 @@ repeated float %s = 1;
            )
        return self.ispn.cache_post(self._cache_name, config)

-    def cache_delete(self) -> requests.Response:
+    def cache_delete(self) -> Response:
        """Delete the cache for the vector db
        Returns:
            An http Response containing the result of the operation
        """
        return self.ispn.cache_delete(self._cache_name)

-    def cache_clear(self) -> requests.Response:
+    def cache_clear(self) -> Response:
        """Clear the cache for the vector db
        Returns:
            An http Response containing the result of the operation
@@ -193,14 +229,14 @@ repeated float %s = 1;
        """
        return self.ispn.cache_exists(self._cache_name)

-    def cache_index_clear(self) -> requests.Response:
+    def cache_index_clear(self) -> Response:
        """Clear the index for the vector db
        Returns:
            An http Response containing the result of the operation
        """
        return self.ispn.index_clear(self._cache_name)

-    def cache_index_reindex(self) -> requests.Response:
+    def cache_index_reindex(self) -> Response:
        """Rebuild the for the vector db
        Returns:
            An http Response containing the result of the operation
@@ -325,12 +361,16 @@ repeated float %s = 1;
    def configure(self, metadata: dict, dimension: int) -> None:
        schema = self.schema_builder(metadata, dimension)
        output = self.schema_create(schema)
-        assert output.ok, "Unable to create schema. Already exists? "
+        assert (
+            output.status_code == self.ispn.Codes.OK
+        ), "Unable to create schema. Already exists? "
        "Consider using clear_old=True"
        assert json.loads(output.text)["error"] is None
        if not self.cache_exists():
            output = self.cache_create()
-            assert output.ok, "Unable to create cache. Already exists? "
+            assert (
+                output.status_code == self.ispn.Codes.OK
+            ), "Unable to create cache. Already exists? "
            "Consider using clear_old=True"
            # Ensure index is clean
            self.cache_index_clear()
@@ -350,7 +390,24 @@ repeated float %s = 1;
        auto_config: Optional[bool] = True,
        **kwargs: Any,
    ) -> InfinispanVS:
-        """Return VectorStore initialized from texts and embeddings."""
+        """Return VectorStore initialized from texts and embeddings.
+
+        In addition to parameters described by the super method, this
+        implementation provides other configuration params if different
+        configuration from default is needed.
+
+        Parameters
+        ----------
+        ids : List[str]
+            Additional list of keys associated to the embedding. If not
+            provided UUIDs will be generated
+        clear_old : bool
+            Whether old data must be deleted. Default True
+        auto_config: bool
+            Whether to do a complete server setup (caches,
+            protobuf definition...). Default True
+        kwargs: Any
+            Rest of arguments passed to InfinispanVS. See docs"""
        infinispanvs = cls(embedding=embedding, ids=ids, **kwargs)
        if auto_config and len(metadatas or []) > 0:
            if clear_old:
@@ -381,20 +438,83 @@ class Infinispan:
    https://github.com/rigazilla/infinispan-vector#run-infinispan
    """

-    def __init__(self, **kwargs: Any):
-        self._configuration = kwargs
-        self._schema = str(self._configuration.get("schema", "http"))
-        self._host = str(self._configuration.get("hosts", ["127.0.0.1:11222"])[0])
-        self._default_node = self._schema + "://" + self._host
-        self._cache_url = str(self._configuration.get("cache_url", "/rest/v2/caches"))
-        self._schema_url = str(self._configuration.get("cache_url", "/rest/v2/schemas"))
-        self._use_post_for_query = str(
-            self._configuration.get("use_post_for_query", True)
-        )
+    def __init__(
+        self,
+        schema: str = "http",
+        user: str = "",
+        password: str = "",
+        hosts: List[str] = ["127.0.0.1:11222"],
+        cache_url: str = "/rest/v2/caches",
+        schema_url: str = "/rest/v2/schemas",
+        use_post_for_query: bool = True,
+        http2: bool = True,
+        verify: bool = True,
+        **kwargs: Any,
+    ):
+        """
+        Parameters
+        ----------
+        schema: str
+            Schema for HTTP request: "http" or "https". Default "http"
+        user, password: str
+            User and password if auth is required. Default None
+        hosts: List[str]
+            List of server addresses. Default ["127.0.0.1:11222"]
+        cache_url: str
+            URL endpoint for cache API. Default "/rest/v2/caches"
+        schema_url: str
+            URL endpoint for schema API. Default "/rest/v2/schemas"
+        use_post_for_query: bool
+            Whether POST method should be used for query. Default True
+        http2: bool
+            Whether HTTP/2 protocol should be used. `pip install "httpx[http2]"` is
+            needed for HTTP/2. Default True
+        verify:  bool
+            Whether TLS certificate must be verified. Default True
+        """

-    def req_query(
-        self, query: str, cache_name: str, local: bool = False
-    ) -> requests.Response:
+        try:
+            import httpx
+        except ImportError:
+            raise ImportError(
+                "Could not import httpx python package. "
+                "Please install it with `pip install httpx`"
+                'or `pip install "httpx[http2]"` if you need HTTP/2.'
+            )
+
+        self.Codes = httpx.codes
+
+        self._configuration = kwargs
+        self._schema = schema
+        self._user = user
+        self._password = password
+        self._host = hosts[0]
+        self._default_node = self._schema + "://" + self._host
+        self._cache_url = cache_url
+        self._schema_url = schema_url
+        self._use_post_for_query = use_post_for_query
+        self._http2 = http2
+        if self._user and self._password:
+            if self._schema == "http":
+                auth: Union[Tuple[str, str], httpx.DigestAuth] = httpx.DigestAuth(
+                    username=self._user, password=self._password
+                )
+            else:
+                auth = (self._user, self._password)
+            self._h2c = httpx.Client(
+                http2=self._http2,
+                http1=not self._http2,
+                auth=auth,
+                verify=verify,
+            )
+        else:
+            self._h2c = httpx.Client(
+                http2=self._http2,
+                http1=not self._http2,
+                verify=verify,
+            )
+
+    def req_query(self, query: str, cache_name: str, local: bool = False) -> Response:
        """Request a query
        Args:
            query(str): query requested
@@ -409,7 +529,7 @@ class Infinispan:

    def _query_post(
        self, query_str: str, cache_name: str, local: bool = False
-    ) -> requests.Response:
+    ) -> Response:
        api_url = (
            self._default_node
            + self._cache_url
@@ -420,9 +540,9 @@ class Infinispan:
        )
        data = {"query": query_str}
        data_json = json.dumps(data)
-        response = requests.post(
+        response = self._h2c.post(
            api_url,
-            data_json,
+            content=data_json,
            headers={"Content-Type": "application/json"},
            timeout=REST_TIMEOUT,
        )
@@ -430,7 +550,7 @@ class Infinispan:

    def _query_get(
        self, query_str: str, cache_name: str, local: bool = False
-    ) -> requests.Response:
+    ) -> Response:
        api_url = (
            self._default_node
            + self._cache_url
@@ -441,10 +561,10 @@ class Infinispan:
            + "&local="
            + str(local)
        )
-        response = requests.get(api_url, timeout=REST_TIMEOUT)
+        response = self._h2c.get(api_url, timeout=REST_TIMEOUT)
        return response

-    def post(self, key: str, data: str, cache_name: str) -> requests.Response:
+    def post(self, key: str, data: str, cache_name: str) -> Response:
        """Post an entry
        Args:
            key(str): key of the entry
@@ -454,15 +574,15 @@ class Infinispan:
            An http Response containing the result of the operation
        """
        api_url = self._default_node + self._cache_url + "/" + cache_name + "/" + key
-        response = requests.post(
+        response = self._h2c.post(
            api_url,
-            data,
+            content=data,
            headers={"Content-Type": "application/json"},
            timeout=REST_TIMEOUT,
        )
        return response

-    def put(self, key: str, data: str, cache_name: str) -> requests.Response:
+    def put(self, key: str, data: str, cache_name: str) -> Response:
        """Put an entry
        Args:
            key(str): key of the entry
@@ -472,15 +592,15 @@ class Infinispan:
            An http Response containing the result of the operation
        """
        api_url = self._default_node + self._cache_url + "/" + cache_name + "/" + key
-        response = requests.put(
+        response = self._h2c.put(
            api_url,
-            data,
+            content=data,
            headers={"Content-Type": "application/json"},
            timeout=REST_TIMEOUT,
        )
        return response

-    def get(self, key: str, cache_name: str) -> requests.Response:
+    def get(self, key: str, cache_name: str) -> Response:
        """Get an entry
        Args:
            key(str): key of the entry
@@ -489,12 +609,12 @@ class Infinispan:
            An http Response containing the entry or errors
        """
        api_url = self._default_node + self._cache_url + "/" + cache_name + "/" + key
-        response = requests.get(
+        response = self._h2c.get(
            api_url, headers={"Content-Type": "application/json"}, timeout=REST_TIMEOUT
        )
        return response

-    def schema_post(self, name: str, proto: str) -> requests.Response:
+    def schema_post(self, name: str, proto: str) -> Response:
        """Deploy a schema
        Args:
            name(str): name of the schema. Will be used as a key
@@ -503,10 +623,10 @@ class Infinispan:
            An http Response containing the result of the operation
        """
        api_url = self._default_node + self._schema_url + "/" + name
-        response = requests.post(api_url, proto, timeout=REST_TIMEOUT)
+        response = self._h2c.post(api_url, content=proto, timeout=REST_TIMEOUT)
        return response

-    def cache_post(self, name: str, config: str) -> requests.Response:
+    def cache_post(self, name: str, config: str) -> Response:
        """Create a cache
        Args:
            name(str): name of the cache.
@@ -515,15 +635,15 @@ class Infinispan:
            An http Response containing the result of the operation
        """
        api_url = self._default_node + self._cache_url + "/" + name
-        response = requests.post(
+        response = self._h2c.post(
            api_url,
-            config,
+            content=config,
            headers={"Content-Type": "application/json"},
            timeout=REST_TIMEOUT,
        )
        return response

-    def schema_delete(self, name: str) -> requests.Response:
+    def schema_delete(self, name: str) -> Response:
        """Delete a schema
        Args:
            name(str): name of the schema.
@@ -531,10 +651,10 @@ class Infinispan:
            An http Response containing the result of the operation
        """
        api_url = self._default_node + self._schema_url + "/" + name
-        response = requests.delete(api_url, timeout=REST_TIMEOUT)
+        response = self._h2c.delete(api_url, timeout=REST_TIMEOUT)
        return response

-    def cache_delete(self, name: str) -> requests.Response:
+    def cache_delete(self, name: str) -> Response:
        """Delete a cache
        Args:
            name(str): name of the cache.
@@ -542,10 +662,10 @@ class Infinispan:
            An http Response containing the result of the operation
        """
        api_url = self._default_node + self._cache_url + "/" + name
-        response = requests.delete(api_url, timeout=REST_TIMEOUT)
+        response = self._h2c.delete(api_url, timeout=REST_TIMEOUT)
        return response

-    def cache_clear(self, cache_name: str) -> requests.Response:
+    def cache_clear(self, cache_name: str) -> Response:
        """Clear a cache
        Args:
            cache_name(str): name of the cache.
@@ -555,7 +675,7 @@ class Infinispan:
        api_url = (
            self._default_node + self._cache_url + "/" + cache_name + "?action=clear"
        )
-        response = requests.post(api_url, timeout=REST_TIMEOUT)
+        response = self._h2c.post(api_url, timeout=REST_TIMEOUT)
        return response

    def cache_exists(self, cache_name: str) -> bool:
@@ -570,18 +690,17 @@ class Infinispan:
        )
        return self.resource_exists(api_url)

-    @staticmethod
-    def resource_exists(api_url: str) -> bool:
+    def resource_exists(self, api_url: str) -> bool:
        """Check if a resource exists
        Args:
            api_url(str): url of the resource.
        Returns:
            true if resource exists
        """
-        response = requests.head(api_url, timeout=REST_TIMEOUT)
-        return response.ok
+        response = self._h2c.head(api_url, timeout=REST_TIMEOUT)
+        return response.status_code == self.Codes.OK

-    def index_clear(self, cache_name: str) -> requests.Response:
+    def index_clear(self, cache_name: str) -> Response:
        """Clear an index on a cache
        Args:
            cache_name(str): name of the cache.
@@ -595,9 +714,9 @@ class Infinispan:
            + cache_name
            + "/search/indexes?action=clear"
        )
-        return requests.post(api_url, timeout=REST_TIMEOUT)
+        return self._h2c.post(api_url, timeout=REST_TIMEOUT)

-    def index_reindex(self, cache_name: str) -> requests.Response:
+    def index_reindex(self, cache_name: str) -> Response:
        """Rebuild index on a cache
        Args:
            cache_name(str): name of the cache.
@@ -611,4 +730,4 @@ class Infinispan:
            + cache_name
            + "/search/indexes?action=reindex"
        )
-        return requests.post(api_url, timeout=REST_TIMEOUT)
+        return self._h2c.post(api_url, timeout=REST_TIMEOUT)
--- a/libs/community/langchain_community/vectorstores/neo4j_vector.py
+++ b/libs/community/langchain_community/vectorstores/neo4j_vector.py
@@ -623,7 +623,7 @@ class Neo4jVector(VectorStore):
        params = params or {}
        try:
            data, _, _ = self._driver.execute_query(
-                query, database=self._database, parameters_=params
+                query, database_=self._database, parameters_=params
            )
            return [r.data() for r in data]
        except Neo4jError as e:
@@ -646,7 +646,7 @@ class Neo4jVector(VectorStore):
            ):
                raise
        # Fallback to allow implicit transactions
-        with self._driver.session() as session:
+        with self._driver.session(database=self._database) as session:
            data = session.run(Query(text=query), params)
            return [r.data() for r in data]

--- a/libs/community/langchain_community/vectorstores/usearch.py
+++ b/libs/community/langchain_community/vectorstores/usearch.py
@@ -1,6 +1,6 @@
 from __future__ import annotations

-from typing import Any, Dict, Iterable, List, Optional, Tuple
+from typing import Any, Dict, Iterable, List, Optional, Tuple, Union

 import numpy as np
 from langchain_core.documents import Document
@@ -42,7 +42,7 @@ class USearch(VectorStore):
        self,
        texts: Iterable[str],
        metadatas: Optional[List[Dict]] = None,
-        ids: Optional[np.ndarray] = None,
+        ids: Optional[Union[np.ndarray, list[str]]] = None,
        **kwargs: Any,
    ) -> List[str]:
        """Run more texts through the embeddings and add to the vectorstore.
@@ -69,6 +69,8 @@ class USearch(VectorStore):
        last_id = int(self.ids[-1]) + 1
        if ids is None:
            ids = np.array([str(last_id + id) for id, _ in enumerate(texts)])
+        elif isinstance(ids, list):
+            ids = np.array(ids)

        self.index.add(np.array(ids), np.array(embeddings))
        self.docstore.add(dict(zip(ids, documents)))
@@ -134,7 +136,7 @@ class USearch(VectorStore):
        texts: List[str],
        embedding: Embeddings,
        metadatas: Optional[List[Dict]] = None,
-        ids: Optional[np.ndarray] = None,
+        ids: Optional[Union[np.ndarray, list[str]]] = None,
        metric: str = "cos",
        **kwargs: Any,
    ) -> USearch:
@@ -159,6 +161,8 @@ class USearch(VectorStore):
        documents: List[Document] = []
        if ids is None:
            ids = np.array([str(id) for id, _ in enumerate(texts)])
+        elif isinstance(ids, list):
+            ids = np.array(ids)
        for i, text in enumerate(texts):
            metadata = metadatas[i] if metadatas else {}
            documents.append(Document(page_content=text, metadata=metadata))
--- a/libs/community/poetry.lock
+++ b/libs/community/poetry.lock
--- a/libs/community/pyproject.toml
+++ b/libs/community/pyproject.toml
@@ -4,7 +4,7 @@ build-backend = "poetry.core.masonry.api"

 [tool.poetry]
 name = "langchain-community"
-version = "0.3.1"
+version = "0.3.2"
 description = "Community contributed LangChain integrations."
 authors = []
 license = "MIT"
@@ -33,13 +33,13 @@ ignore-words-list = "momento,collison,ned,foor,reworkd,parth,whats,aapply,mysogy

 [tool.poetry.dependencies]
 python = ">=3.9,<4.0"
-langchain-core = "^0.3.6"
-langchain = "^0.3.1"
+langchain-core = "^0.3.10"
+langchain = "^0.3.3"
 SQLAlchemy = ">=1.4,<3"
 requests = "^2"
 PyYAML = ">=5.3"
 aiohttp = "^3.8.3"
-tenacity = "^8.1.0,!=8.4.0"
+tenacity = ">=8.1.0,!=8.4.0,<10"
 dataclasses-json = ">= 0.5.7, < 0.7"
 pydantic-settings = "^2.4.0"
 langsmith = "^0.1.125"
--- a/libs/community/tests/integration_tests/chat_models/test_llamacpp.py
+++ b/libs/community/tests/integration_tests/chat_models/test_llamacpp.py
@@ -0,0 +1,19 @@
+from pydantic import BaseModel, Field
+
+from langchain_community.chat_models import ChatLlamaCpp
+
+
+class Joke(BaseModel):
+    """Joke to tell user."""
+
+    setup: str = Field(description="question to set up a joke")
+    punchline: str = Field(description="answer to resolve the joke")
+
+
+# TODO: replace with standard integration tests
+# See example in tests/integration_tests/chat_models/test_litellm.py
+def test_structured_output() -> None:
+    llm = ChatLlamaCpp(model_path="/path/to/Meta-Llama-3.1-8B-Instruct.Q4_K_M.gguf")
+    structured_llm = llm.with_structured_output(Joke)
+    result = structured_llm.invoke("Tell me a short joke about cats.")
+    assert isinstance(result, Joke)
--- a/libs/community/tests/integration_tests/chat_models/test_sambanova.py
+++ b/libs/community/tests/integration_tests/chat_models/test_sambanova.py
@@ -1,6 +1,9 @@
 from langchain_core.messages import AIMessage, HumanMessage

-from langchain_community.chat_models.sambanova import ChatSambaNovaCloud
+from langchain_community.chat_models.sambanova import (
+    ChatSambaNovaCloud,
+    ChatSambaStudio,
+)


 def test_chat_sambanova_cloud() -> None:
@@ -9,3 +12,11 @@ def test_chat_sambanova_cloud() -> None:
    response = chat.invoke([message])
    assert isinstance(response, AIMessage)
    assert isinstance(response.content, str)
+
+
+def test_chat_sambastudio() -> None:
+    chat = ChatSambaStudio()
+    message = HumanMessage(content="Hello")
+    response = chat.invoke([message])
+    assert isinstance(response, AIMessage)
+    assert isinstance(response.content, str)
--- a/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan.sh
+++ b/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan.sh
@@ -0,0 +1,4 @@
+#/bin/sh
+
+cd infinispan
+docker compose up
--- a/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan/conf/groups.properties
+++ b/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan/conf/groups.properties
@@ -0,0 +1,2 @@
+#Fri May 03 10:19:58 CEST 2024
+user=ADMIN,admin
--- a/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan/conf/infinispan.xml
+++ b/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan/conf/infinispan.xml
@@ -0,0 +1,62 @@
+<infinispan
+      xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
+      xsi:schemaLocation="urn:infinispan:config:15.0 https://infinispan.org/schemas/infinispan-config-15.0.xsd
+                            urn:infinispan:server:15.0 https://infinispan.org/schemas/infinispan-server-15.0.xsd"
+      xmlns="urn:infinispan:config:15.0"
+      xmlns:server="urn:infinispan:server:15.0">
+
+   <cache-container name="default" statistics="true">
+      <transport cluster="${infinispan.cluster.name:cluster}" stack="${infinispan.cluster.stack:tcp}" node-name="${infinispan.node.name:}"/>
+   </cache-container>
+
+   <server xmlns="urn:infinispan:server:15.0">
+      <interfaces>
+         <interface name="public">
+            <inet-address value="${infinispan.bind.address:127.0.0.1}"/>
+         </interface>
+      </interfaces>
+
+      <socket-bindings default-interface="public" port-offset="${infinispan.socket.binding.port-offset:0}">
+         <socket-binding name="default" port="${infinispan.bind.port:11222}"/>
+         <socket-binding name="authenticated" port="11232"/>
+         <socket-binding name="auth-tls" port="11242"/>
+      </socket-bindings>
+
+      <security>
+         <credential-stores>
+            <credential-store name="credentials" path="credentials.pfx">
+               <clear-text-credential clear-text="secret"/>
+            </credential-store>
+         </credential-stores>
+         <security-realms>
+            <security-realm name="default">
+               <properties-realm groups-attribute="Roles">
+		       <user-properties path="/user-config/users.properties"/>
+		       <group-properties path="/user-config/groups.properties"/>
+               </properties-realm>
+            </security-realm>
+            <security-realm name="tls">
+               <!-- Uncomment to enable TLS on the realm -->
+               <server-identities>
+                  <ssl>
+                     <keystore path="application.keystore"
+                               password="password" alias="server"
+                               generate-self-signed-certificate-host="localhost"/>
+
+                  </ssl>
+               </server-identities>
+               <properties-realm groups-attribute="Roles">
+		       <user-properties path="/user-config/users.properties"/>
+		       <group-properties path="/user-config/groups.properties"/>
+               </properties-realm>
+            </security-realm>
+         </security-realms>
+      </security>
+
+      <endpoints>
+	      <endpoint socket-binding="default"/>
+	      <endpoint socket-binding="authenticated" security-realm="default"/>
+	      <endpoint socket-binding="auth-tls" security-realm="tls"/>
+      </endpoints>
+   </server>
+</infinispan>
--- a/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan/conf/users.properties
+++ b/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan/conf/users.properties
@@ -0,0 +1,4 @@
+#$REALM_NAME=default$
+#$ALGORITHM=encrypted$
+#Fri May 03 10:19:58 CEST 2024
+user=scram-sha-1\:BYGcIAws2gznU/kpezoSb1VQNVd+YMX9r+9SAINFoZtPHaHTAQ\=\=;scram-sha-256\:BYGcIAwRiWiD+8f7dyQEs1Wsum/64MOcjGJ2UcmZFQB6DZJqwRDJ4NrvII4NttmxlA\=\=;scram-sha-384\:BYGcIAz+Eud65N8GWK4TMwhSCZpeE5EFSdynywdryQj3ZwBEgv+KF8hRUuGxiq3EyRxsby6w7DHK3CICGZLsPrM\=;scram-sha-512\:BYGcIAwWxVY9DHn42kHydivyU3s9LSPmyfPPJkIFYyt/XsMASFHGoy5rzk4ahX4HjpJgb+NjdCwhGfi33CY0azUIrn439s62Yg5mq9i+ISto;digest-md5\:AgR1c2VyB2RlZmF1bHSYYyzPjRDR7MhrsdFSK03P;digest-sha\:AgR1c2VyB2RlZmF1bHTga5gDNnNYh7/2HqhBVOdUHjBzhw\=\=;digest-sha-256\:AgR1c2VyB2RlZmF1bHTig5qZQIxqtJBTUp3EMh5UIFoS4qOhz9Uk5aOW9ZKCfw\=\=;digest-sha-384\:AgR1c2VyB2RlZmF1bHT01pAN/pRMLS5afm4Q9S0kuLlA0NokuP8F0AISTwXCb1E8RMsFHlBVPOa5rC6Nyso\=;digest-sha-512\:AgR1c2VyB2RlZmF1bHTi+cHn1Ez2Ze41CvPXb9eP/7JmRys7m1f5qPMQWhAmDOuuUXNWEG4yKSI9k2EZgQvMKTd5hDbR24ul1BsYP8X5;
--- a/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan/docker-compose.yaml
+++ b/libs/community/tests/integration_tests/vectorstores/docker-compose/infinispan/docker-compose.yaml
@@ -0,0 +1,16 @@
+version: "3.7"
+
+services:
+  infinispan:
+    image: quay.io/infinispan/server:15.0
+    ports:
+      - '11222:11222'
+      - '11232:11232'
+      - '11242:11242'
+    deploy:
+      resources:
+        limits:
+          memory: 25Gb
+    volumes:
+      - ./conf:/user-config
+    command: -c /user-config/infinispan.xml
--- a/libs/community/tests/integration_tests/vectorstores/test_cassandra.py
+++ b/libs/community/tests/integration_tests/vectorstores/test_cassandra.py
@@ -17,6 +17,17 @@ from tests.integration_tests.vectorstores.fake_embeddings import (
 )


+def _strip_docs(documents: List[Document]) -> List[Document]:
+    return [_strip_doc(doc) for doc in documents]
+
+
+def _strip_doc(document: Document) -> Document:
+    return Document(
+        page_content=document.page_content,
+        metadata=document.metadata,
+    )
+
+
 def _vectorstore_from_texts(
    texts: List[str],
    metadatas: Optional[List[dict]] = None,
@@ -110,9 +121,9 @@ async def test_cassandra() -> None:
    texts = ["foo", "bar", "baz"]
    docsearch = _vectorstore_from_texts(texts)
    output = docsearch.similarity_search("foo", k=1)
-    assert output == [Document(page_content="foo")]
+    assert _strip_docs(output) == _strip_docs([Document(page_content="foo")])
    output = await docsearch.asimilarity_search("foo", k=1)
-    assert output == [Document(page_content="foo")]
+    assert _strip_docs(output) == _strip_docs([Document(page_content="foo")])


 async def test_cassandra_with_score() -> None:
@@ -130,13 +141,13 @@ async def test_cassandra_with_score() -> None:
    output = docsearch.similarity_search_with_score("foo", k=3)
    docs = [o[0] for o in output]
    scores = [o[1] for o in output]
-    assert docs == expected_docs
+    assert _strip_docs(docs) == _strip_docs(expected_docs)
    assert scores[0] > scores[1] > scores[2]

    output = await docsearch.asimilarity_search_with_score("foo", k=3)
    docs = [o[0] for o in output]
    scores = [o[1] for o in output]
-    assert docs == expected_docs
+    assert _strip_docs(docs) == _strip_docs(expected_docs)
    assert scores[0] > scores[1] > scores[2]


@@ -239,7 +250,7 @@ async def test_cassandra_no_drop_async() -> None:
 def test_cassandra_delete() -> None:
    """Test delete methods from vector store."""
    texts = ["foo", "bar", "baz", "gni"]
-    metadatas = [{"page": i} for i in range(len(texts))]
+    metadatas = [{"page": i, "mod2": i % 2} for i in range(len(texts))]
    docsearch = _vectorstore_from_texts([], metadatas=metadatas)

    ids = docsearch.add_texts(texts, metadatas)
@@ -263,11 +274,21 @@ def test_cassandra_delete() -> None:
    output = docsearch.similarity_search("foo", k=10)
    assert len(output) == 0

+    docsearch.add_texts(texts, metadatas)
+    num_deleted = docsearch.delete_by_metadata_filter({"mod2": 0}, batch_size=1)
+    assert num_deleted == 2
+    output = docsearch.similarity_search("foo", k=10)
+    assert len(output) == 2
+    docsearch.clear()
+
+    with pytest.raises(ValueError):
+        docsearch.delete_by_metadata_filter({})
+

 async def test_cassandra_adelete() -> None:
    """Test delete methods from vector store."""
    texts = ["foo", "bar", "baz", "gni"]
-    metadatas = [{"page": i} for i in range(len(texts))]
+    metadatas = [{"page": i, "mod2": i % 2} for i in range(len(texts))]
    docsearch = await _vectorstore_from_texts_async([], metadatas=metadatas)

    ids = await docsearch.aadd_texts(texts, metadatas)
@@ -291,6 +312,16 @@ async def test_cassandra_adelete() -> None:
    output = docsearch.similarity_search("foo", k=10)
    assert len(output) == 0

+    await docsearch.aadd_texts(texts, metadatas)
+    num_deleted = await docsearch.adelete_by_metadata_filter({"mod2": 0}, batch_size=1)
+    assert num_deleted == 2
+    output = await docsearch.asimilarity_search("foo", k=10)
+    assert len(output) == 2
+    await docsearch.aclear()
+
+    with pytest.raises(ValueError):
+        await docsearch.adelete_by_metadata_filter({})
+

 def test_cassandra_metadata_indexing() -> None:
    """Test comparing metadata indexing policies."""
@@ -316,3 +347,107 @@ def test_cassandra_metadata_indexing() -> None:
    with pytest.raises(ValueError):
        # "Non-indexed metadata fields cannot be used in queries."
        vstore_f1.similarity_search("bar", filter={"field2": "b"}, k=2)
+
+
+def test_cassandra_replace_metadata() -> None:
+    """Test of replacing metadata."""
+    N_DOCS = 100
+    REPLACE_RATIO = 2  # one in ... will have replaced metadata
+    BATCH_SIZE = 3
+
+    vstore_f1 = _vectorstore_from_texts(
+        texts=[],
+        metadata_indexing=("allowlist", ["field1", "field2"]),
+        table_name="vector_test_table_indexing",
+    )
+    orig_documents = [
+        Document(
+            page_content=f"doc_{doc_i}",
+            id=f"doc_id_{doc_i}",
+            metadata={"field1": f"f1_{doc_i}", "otherf": "pre"},
+        )
+        for doc_i in range(N_DOCS)
+    ]
+    vstore_f1.add_documents(orig_documents)
+
+    ids_to_replace = [
+        f"doc_id_{doc_i}" for doc_i in range(N_DOCS) if doc_i % REPLACE_RATIO == 0
+    ]
+
+    # various kinds of replacement at play here:
+    def _make_new_md(mode: int, doc_id: str) -> dict[str, str]:
+        if mode == 0:
+            return {}
+        elif mode == 1:
+            return {"field2": f"NEW_{doc_id}"}
+        elif mode == 2:
+            return {"field2": f"NEW_{doc_id}", "ofherf2": "post"}
+        else:
+            return {"ofherf2": "post"}
+
+    ids_to_new_md = {
+        doc_id: _make_new_md(rep_i % 4, doc_id)
+        for rep_i, doc_id in enumerate(ids_to_replace)
+    }
+
+    vstore_f1.replace_metadata(ids_to_new_md, batch_size=BATCH_SIZE)
+    # thorough check
+    expected_id_to_metadata: dict[str, dict] = {
+        **{(document.id or ""): document.metadata for document in orig_documents},
+        **ids_to_new_md,
+    }
+    for hit in vstore_f1.similarity_search("doc", k=N_DOCS + 1):
+        assert hit.id is not None
+        assert hit.metadata == expected_id_to_metadata[hit.id]
+
+
+async def test_cassandra_areplace_metadata() -> None:
+    """Test of replacing metadata."""
+    N_DOCS = 100
+    REPLACE_RATIO = 2  # one in ... will have replaced metadata
+    BATCH_SIZE = 3
+
+    vstore_f1 = _vectorstore_from_texts(
+        texts=[],
+        metadata_indexing=("allowlist", ["field1", "field2"]),
+        table_name="vector_test_table_indexing",
+    )
+    orig_documents = [
+        Document(
+            page_content=f"doc_{doc_i}",
+            id=f"doc_id_{doc_i}",
+            metadata={"field1": f"f1_{doc_i}", "otherf": "pre"},
+        )
+        for doc_i in range(N_DOCS)
+    ]
+    await vstore_f1.aadd_documents(orig_documents)
+
+    ids_to_replace = [
+        f"doc_id_{doc_i}" for doc_i in range(N_DOCS) if doc_i % REPLACE_RATIO == 0
+    ]
+
+    # various kinds of replacement at play here:
+    def _make_new_md(mode: int, doc_id: str) -> dict[str, str]:
+        if mode == 0:
+            return {}
+        elif mode == 1:
+            return {"field2": f"NEW_{doc_id}"}
+        elif mode == 2:
+            return {"field2": f"NEW_{doc_id}", "ofherf2": "post"}
+        else:
+            return {"ofherf2": "post"}
+
+    ids_to_new_md = {
+        doc_id: _make_new_md(rep_i % 4, doc_id)
+        for rep_i, doc_id in enumerate(ids_to_replace)
+    }
+
+    await vstore_f1.areplace_metadata(ids_to_new_md, concurrency=BATCH_SIZE)
+    # thorough check
+    expected_id_to_metadata: dict[str, dict] = {
+        **{(document.id or ""): document.metadata for document in orig_documents},
+        **ids_to_new_md,
+    }
+    for hit in await vstore_f1.asimilarity_search("doc", k=N_DOCS + 1):
+        assert hit.id is not None
+        assert hit.metadata == expected_id_to_metadata[hit.id]
--- a/libs/community/tests/integration_tests/vectorstores/test_infinispanvs.py
+++ b/libs/community/tests/integration_tests/vectorstores/test_infinispanvs.py
@@ -1,7 +1,9 @@
 """Test Infinispan functionality."""

+import warnings
 from typing import Any, List, Optional

+import httpx
 import pytest
 from langchain_core.documents import Document

@@ -11,9 +13,18 @@ from tests.integration_tests.vectorstores.fake_embeddings import (
    fake_texts,
 )

+"""
+cd tests/integration_tests/vectorstores/docker-compose
+./infinispan.sh

-def _infinispan_setup_noautoconf() -> None:
-    ispnvs = InfinispanVS(auto_config=False)
+Current Infinispan implementation relies on httpx: `pip install "httpx[http2]"`
+if not installed. HTTP/2 is enable by default, if it's not
+wanted use `pip install "httpx"`.
+"""
+
+
+def _infinispan_setup_noautoconf(**kwargs: Any) -> None:
+    ispnvs = InfinispanVS(http2=_hasHttp2(), auto_config=False, **kwargs)
    ispnvs.cache_delete()
    ispnvs.schema_delete()
    proto = """
@@ -54,64 +65,104 @@ def _infinispanvs_from_texts(
        ids=ids,
        clear_old=clear_old,
        auto_config=auto_config,
+        http2=_hasHttp2(),
        **kwargs,
    )


+def _hasHttp2() -> bool:
+    try:
+        httpx.Client(http2=True)
+        return True
+    except Exception:
+        return False
+
+
@pytest.mark.parametrize("autoconfig", [False, True])
+@pytest.mark.parametrize(
+    "conn_opts",
+    [
+        {},
+        {
+            "user": "user",
+            "password": "password",
+            "hosts": ["localhost:11232"],
+            "schema": "http",
+        },
+        {
+            "user": "user",
+            "password": "password",
+            "hosts": ["localhost:11242"],
+            "schema": "https",
+            "verify": False,
+        },
+    ],
+)
 class TestBasic:
-    def test_infinispan(self, autoconfig: bool) -> None:
+    def test_infinispan(self, autoconfig: bool, conn_opts: dict) -> None:
        """Test end to end construction and search."""
        if not autoconfig:
-            _infinispan_setup_noautoconf()
-        docsearch = _infinispanvs_from_texts(auto_config=autoconfig)
+            _infinispan_setup_noautoconf(**conn_opts)
+        docsearch = _infinispanvs_from_texts(auto_config=autoconfig, **conn_opts)
        output = docsearch.similarity_search("foo", k=1)
        assert output == [Document(page_content="foo")]

-    def test_infinispan_with_metadata(self, autoconfig: bool) -> None:
+    def test_infinispan_with_auth(self, autoconfig: bool, conn_opts: dict) -> None:
+        """Test end to end construction and search."""
+        if not autoconfig:
+            _infinispan_setup_noautoconf(**conn_opts)
+        docsearch = _infinispanvs_from_texts(auto_config=autoconfig, **conn_opts)
+        output = docsearch.similarity_search("foo", k=1)
+        assert output == [Document(page_content="foo")]
+
+    def test_infinispan_with_metadata(self, autoconfig: bool, conn_opts: dict) -> None:
        """Test with metadata"""
        if not autoconfig:
-            _infinispan_setup_noautoconf()
+            _infinispan_setup_noautoconf(**conn_opts)
        meta = []
        for _ in range(len(fake_texts)):
            meta.append({"label": "test"})
-        docsearch = _infinispanvs_from_texts(metadatas=meta, auto_config=autoconfig)
+        docsearch = _infinispanvs_from_texts(
+            metadatas=meta, auto_config=autoconfig, **conn_opts
+        )
        output = docsearch.similarity_search("foo", k=1)
        assert output == [Document(page_content="foo", metadata={"label": "test"})]

    def test_infinispan_with_metadata_with_output_fields(
-        self, autoconfig: bool
+        self, autoconfig: bool, conn_opts: dict
    ) -> None:
        """Test with metadata"""
        if not autoconfig:
-            _infinispan_setup_noautoconf()
+            _infinispan_setup_noautoconf(**conn_opts)
        metadatas = [
            {"page": i, "label": "label" + str(i)} for i in range(len(fake_texts))
        ]
        c = {"output_fields": ["label", "page", "text"]}
        docsearch = _infinispanvs_from_texts(
-            metadatas=metadatas, configuration=c, auto_config=autoconfig
+            metadatas=metadatas, configuration=c, auto_config=autoconfig, **conn_opts
        )
        output = docsearch.similarity_search("foo", k=1)
        assert output == [
            Document(page_content="foo", metadata={"label": "label0", "page": 0})
        ]

-    def test_infinispanvs_with_id(self, autoconfig: bool) -> None:
+    def test_infinispanvs_with_id(self, autoconfig: bool, conn_opts: dict) -> None:
        """Test with ids"""
        ids = ["id_" + str(i) for i in range(len(fake_texts))]
-        docsearch = _infinispanvs_from_texts(ids=ids, auto_config=autoconfig)
+        docsearch = _infinispanvs_from_texts(
+            ids=ids, auto_config=autoconfig, **conn_opts
+        )
        output = docsearch.similarity_search("foo", k=1)
        assert output == [Document(page_content="foo")]

-    def test_infinispan_with_score(self, autoconfig: bool) -> None:
+    def test_infinispan_with_score(self, autoconfig: bool, conn_opts: dict) -> None:
        """Test end to end construction and search with scores and IDs."""
        if not autoconfig:
-            _infinispan_setup_noautoconf()
+            _infinispan_setup_noautoconf(**conn_opts)
        texts = ["foo", "bar", "baz"]
        metadatas = [{"page": i} for i in range(len(texts))]
        docsearch = _infinispanvs_from_texts(
-            metadatas=metadatas, auto_config=autoconfig
+            metadatas=metadatas, auto_config=autoconfig, **conn_opts
        )
        output = docsearch.similarity_search_with_score("foo", k=3)
        docs = [o[0] for o in output]
@@ -123,14 +174,14 @@ class TestBasic:
        ]
        assert scores[0] >= scores[1] >= scores[2]

-    def test_infinispan_add_texts(self, autoconfig: bool) -> None:
+    def test_infinispan_add_texts(self, autoconfig: bool, conn_opts: dict) -> None:
        """Test end to end construction and MRR search."""
        if not autoconfig:
-            _infinispan_setup_noautoconf()
+            _infinispan_setup_noautoconf(**conn_opts)
        texts = ["foo", "bar", "baz"]
        metadatas = [{"page": i} for i in range(len(texts))]
        docsearch = _infinispanvs_from_texts(
-            metadatas=metadatas, auto_config=autoconfig
+            metadatas=metadatas, auto_config=autoconfig, **conn_opts
        )

        docsearch.add_texts(texts, metadatas)
@@ -138,19 +189,22 @@ class TestBasic:
        output = docsearch.similarity_search("foo", k=10)
        assert len(output) == 6

-    def test_infinispan_no_clear_old(self, autoconfig: bool) -> None:
+    def test_infinispan_no_clear_old(self, autoconfig: bool, conn_opts: dict) -> None:
        """Test end to end construction and MRR search."""
        if not autoconfig:
-            _infinispan_setup_noautoconf()
+            _infinispan_setup_noautoconf(**conn_opts)
        texts = ["foo", "bar", "baz"]
        metadatas = [{"page": i} for i in range(len(texts))]
        docsearch = _infinispanvs_from_texts(
-            metadatas=metadatas, auto_config=autoconfig
+            metadatas=metadatas, auto_config=autoconfig, **conn_opts
        )
        del docsearch
        try:
            docsearch = _infinispanvs_from_texts(
-                metadatas=metadatas, clear_old=False, auto_config=autoconfig
+                metadatas=metadatas,
+                clear_old=False,
+                auto_config=autoconfig,
+                **conn_opts,
            )
        except AssertionError:
            if autoconfig:
@@ -159,3 +213,12 @@ class TestBasic:
                raise
        output = docsearch.similarity_search("foo", k=10)
        assert len(output) == 6
+
+
+class TestHttp2:
+    def test_http2(self) -> None:
+        try:
+            httpx.Client(http2=True)
+        except Exception:
+            warnings.warn('pip install "httpx[http2]" if you need HTTP/2')
+        pass
--- a/libs/community/tests/unit_tests/chat_models/test_imports.py
+++ b/libs/community/tests/unit_tests/chat_models/test_imports.py
@@ -35,6 +35,7 @@ EXPECTED_ALL = [
    "ChatPerplexity",
    "ChatPremAI",
    "ChatSambaNovaCloud",
+    "ChatSambaStudio",
    "ChatSparkLLM",
    "ChatTongyi",
    "ChatVertexAI",
--- a/libs/core/Makefile
+++ b/libs/core/Makefile
@@ -46,7 +46,7 @@ lint lint_diff lint_package lint_tests:

 format format_diff:
 	[ "$(PYTHON_FILES)" = "" ] || poetry run ruff format $(PYTHON_FILES)
-	[ "$(PYTHON_FILES)" = "" ] || poetry run ruff check --select I --fix $(PYTHON_FILES)
+	[ "$(PYTHON_FILES)" = "" ] || poetry run ruff check --fix $(PYTHON_FILES)

 spell_check:
 	poetry run codespell --toml pyproject.toml
--- a/libs/core/langchain_core/_api/deprecation.py
+++ b/libs/core/langchain_core/_api/deprecation.py
@@ -51,15 +51,18 @@ def _validate_deprecation_params(
 ) -> None:
    """Validate the deprecation parameters."""
    if pending and removal:
-        raise ValueError("A pending deprecation cannot have a scheduled removal")
+        msg = "A pending deprecation cannot have a scheduled removal"
+        raise ValueError(msg)
    if alternative and alternative_import:
-        raise ValueError("Cannot specify both alternative and alternative_import")
+        msg = "Cannot specify both alternative and alternative_import"
+        raise ValueError(msg)

    if alternative_import and "." not in alternative_import:
-        raise ValueError(
+        msg = (
            "alternative_import must be a fully qualified module path. Got "
            f" {alternative_import}"
        )
+        raise ValueError(msg)


 def deprecated(
@@ -222,7 +225,8 @@ def deprecated(
            if not _obj_type:
                _obj_type = "attribute"
            if not _name:
-                raise ValueError(f"Field {obj} must have a name to be deprecated.")
+                msg = f"Field {obj} must have a name to be deprecated."
+                raise ValueError(msg)
            old_doc = obj.description

            def finalize(wrapper: Callable[..., Any], new_doc: str) -> T:
@@ -241,7 +245,8 @@ def deprecated(
            if not _obj_type:
                _obj_type = "attribute"
            if not _name:
-                raise ValueError(f"Field {obj} must have a name to be deprecated.")
+                msg = f"Field {obj} must have a name to be deprecated."
+                raise ValueError(msg)
            old_doc = obj.description

            def finalize(wrapper: Callable[..., Any], new_doc: str) -> T:
@@ -428,10 +433,11 @@ def warn_deprecated(
    if not pending:
        if not removal:
            removal = f"in {removal}" if removal else "within ?? minor releases"
-            raise NotImplementedError(
+            msg = (
                f"Need to determine which default deprecation schedule to use. "
                f"{removal}"
            )
+            raise NotImplementedError(msg)
        else:
            removal = f"in {removal}"

@@ -523,9 +529,8 @@ def rename_parameter(
        @functools.wraps(f)
        def wrapper(*args: _P.args, **kwargs: _P.kwargs) -> _R:
            if new in kwargs and old in kwargs:
-                raise TypeError(
-                    f"{f.__name__}() got multiple values for argument {new!r}"
-                )
+                msg = f"{f.__name__}() got multiple values for argument {new!r}"
+                raise TypeError(msg)
            if old in kwargs:
                warn_deprecated(
                    since,
--- a/libs/core/langchain_core/beta/runnables/context.py
+++ b/libs/core/langchain_core/beta/runnables/context.py
@@ -59,7 +59,8 @@ def _key_from_id(id_: str) -> str:
    elif wout_prefix.endswith(CONTEXT_CONFIG_SUFFIX_SET):
        return wout_prefix[: -len(CONTEXT_CONFIG_SUFFIX_SET)]
    else:
-        raise ValueError(f"Invalid context config id {id_}")
+        msg = f"Invalid context config id {id_}"
+        raise ValueError(msg)


 def _config_with_context(
@@ -103,16 +104,15 @@ def _config_with_context(

        for dep in deps_by_key[key]:
            if key in deps_by_key[dep]:
-                raise ValueError(
-                    f"Deadlock detected between context keys {key} and {dep}"
-                )
+                msg = f"Deadlock detected between context keys {key} and {dep}"
+                raise ValueError(msg)
        if len(setters) != 1:
-            raise ValueError(f"Expected exactly one setter for context key {key}")
+            msg = f"Expected exactly one setter for context key {key}"
+            raise ValueError(msg)
        setter_idx = setters[0][1]
        if any(getter_idx < setter_idx for _, getter_idx in getters):
-            raise ValueError(
-                f"Context setter for key {key} must be defined after all getters."
-            )
+            msg = f"Context setter for key {key} must be defined after all getters."
+            raise ValueError(msg)

        if getters:
            context_funcs[getters[0][0].id] = partial(getter, events[key], values)
@@ -271,9 +271,8 @@ class ContextSet(RunnableSerializable):
            if spec.id.endswith(CONTEXT_CONFIG_SUFFIX_GET):
                getter_key = spec.id.split("/")[1]
                if getter_key in self.keys:
-                    raise ValueError(
-                        f"Circular reference in context setter for key {getter_key}"
-                    )
+                    msg = f"Circular reference in context setter for key {getter_key}"
+                    raise ValueError(msg)
        return super().config_specs + [
            ConfigurableFieldSpec(
                id=id_,
--- a/libs/core/langchain_core/caches.py
+++ b/libs/core/langchain_core/caches.py
@@ -160,7 +160,8 @@ class InMemoryCache(BaseCache):
        """
        self._cache: dict[tuple[str, str], RETURN_VAL_TYPE] = {}
        if maxsize is not None and maxsize <= 0:
-            raise ValueError("maxsize must be greater than 0")
+            msg = "maxsize must be greater than 0"
+            raise ValueError(msg)
        self._maxsize = maxsize

    def lookup(self, prompt: str, llm_string: str) -> Optional[RETURN_VAL_TYPE]:
--- a/libs/core/langchain_core/callbacks/base.py
+++ b/libs/core/langchain_core/callbacks/base.py
@@ -275,9 +275,8 @@ class CallbackManagerMixin:
        """
        # NotImplementedError is thrown intentionally
        # Callback handler will fall back to on_llm_start if this is exception is thrown
-        raise NotImplementedError(
-            f"{self.__class__.__name__} does not implement `on_chat_model_start`"
-        )
+        msg = f"{self.__class__.__name__} does not implement `on_chat_model_start`"
+        raise NotImplementedError(msg)

    def on_retriever_start(
        self,
@@ -523,9 +522,8 @@ class AsyncCallbackHandler(BaseCallbackHandler):
        """
        # NotImplementedError is thrown intentionally
        # Callback handler will fall back to on_llm_start if this is exception is thrown
-        raise NotImplementedError(
-            f"{self.__class__.__name__} does not implement `on_chat_model_start`"
-        )
+        msg = f"{self.__class__.__name__} does not implement `on_chat_model_start`"
+        raise NotImplementedError(msg)

    async def on_llm_new_token(
        self,
--- a/libs/core/langchain_core/callbacks/manager.py
+++ b/libs/core/langchain_core/callbacks/manager.py
@@ -1510,11 +1510,12 @@ class CallbackManager(BaseCallbackManager):
        .. versionadded:: 0.2.14
        """
        if kwargs:
-            raise ValueError(
+            msg = (
                "The dispatcher API does not accept additional keyword arguments."
                "Please do not pass any additional keyword arguments, instead "
                "include them in the data field."
            )
+            raise ValueError(msg)
        if run_id is None:
            run_id = uuid.uuid4()

@@ -1729,7 +1730,12 @@ class AsyncCallbackManager(BaseCallbackManager):
                to each prompt.
        """

-        tasks = []
+        inline_tasks = []
+        non_inline_tasks = []
+        inline_handlers = [handler for handler in self.handlers if handler.run_inline]
+        non_inline_handlers = [
+            handler for handler in self.handlers if not handler.run_inline
+        ]
        managers = []

        for prompt in prompts:
@@ -1739,20 +1745,36 @@ class AsyncCallbackManager(BaseCallbackManager):
            else:
                run_id_ = uuid.uuid4()

-            tasks.append(
-                ahandle_event(
-                    self.handlers,
-                    "on_llm_start",
-                    "ignore_llm",
-                    serialized,
-                    [prompt],
-                    run_id=run_id_,
-                    parent_run_id=self.parent_run_id,
-                    tags=self.tags,
-                    metadata=self.metadata,
-                    **kwargs,
+            if inline_handlers:
+                inline_tasks.append(
+                    ahandle_event(
+                        inline_handlers,
+                        "on_llm_start",
+                        "ignore_llm",
+                        serialized,
+                        [prompt],
+                        run_id=run_id_,
+                        parent_run_id=self.parent_run_id,
+                        tags=self.tags,
+                        metadata=self.metadata,
+                        **kwargs,
+                    )
+                )
+            else:
+                non_inline_tasks.append(
+                    ahandle_event(
+                        non_inline_handlers,
+                        "on_llm_start",
+                        "ignore_llm",
+                        serialized,
+                        [prompt],
+                        run_id=run_id_,
+                        parent_run_id=self.parent_run_id,
+                        tags=self.tags,
+                        metadata=self.metadata,
+                        **kwargs,
+                    )
                )
-            )

            managers.append(
                AsyncCallbackManagerForLLMRun(
@@ -1767,7 +1789,13 @@ class AsyncCallbackManager(BaseCallbackManager):
                )
            )

-        await asyncio.gather(*tasks)
+        # Run inline tasks sequentially
+        for inline_task in inline_tasks:
+            await inline_task
+
+        # Run non-inline tasks concurrently
+        if non_inline_tasks:
+            await asyncio.gather(*non_inline_tasks)

        return managers

@@ -1791,7 +1819,8 @@ class AsyncCallbackManager(BaseCallbackManager):
                async callback managers, one for each LLM Run
                corresponding to each inner  message list.
        """
-        tasks = []
+        inline_tasks = []
+        non_inline_tasks = []
        managers = []

        for message_list in messages:
@@ -1801,9 +1830,9 @@ class AsyncCallbackManager(BaseCallbackManager):
            else:
                run_id_ = uuid.uuid4()

-            tasks.append(
-                ahandle_event(
-                    self.handlers,
+            for handler in self.handlers:
+                task = ahandle_event(
+                    [handler],
                    "on_chat_model_start",
                    "ignore_chat_model",
                    serialized,
@@ -1814,7 +1843,10 @@ class AsyncCallbackManager(BaseCallbackManager):
                    metadata=self.metadata,
                    **kwargs,
                )
-            )
+                if handler.run_inline:
+                    inline_tasks.append(task)
+                else:
+                    non_inline_tasks.append(task)

            managers.append(
                AsyncCallbackManagerForLLMRun(
@@ -1829,7 +1861,14 @@ class AsyncCallbackManager(BaseCallbackManager):
                )
            )

-        await asyncio.gather(*tasks)
+        # Run inline tasks sequentially
+        for task in inline_tasks:
+            await task
+
+        # Run non-inline tasks concurrently
+        if non_inline_tasks:
+            await asyncio.gather(*non_inline_tasks)
+
        return managers

    async def on_chain_start(
@@ -1951,11 +1990,12 @@ class AsyncCallbackManager(BaseCallbackManager):
            run_id = uuid.uuid4()

        if kwargs:
-            raise ValueError(
+            msg = (
                "The dispatcher API does not accept additional keyword arguments."
                "Please do not pass any additional keyword arguments, instead "
                "include them in the data field."
            )
+            raise ValueError(msg)
        await ahandle_event(
            self.handlers,
            "on_custom_event",
@@ -2298,11 +2338,12 @@ def _configure(

    if v1_tracing_enabled_ and not tracing_v2_enabled_:
        # if both are enabled, can silently ignore the v1 tracer
-        raise RuntimeError(
+        msg = (
            "Tracing using LangChainTracerV1 is no longer supported. "
            "Please set the LANGCHAIN_TRACING_V2 environment variable to enable "
            "tracing instead."
        )
+        raise RuntimeError(msg)

    tracer_project = _get_tracer_project()
    debug = _get_debug()
@@ -2481,13 +2522,14 @@ async def adispatch_custom_event(
    # within a tool or a lambda and have the metadata events associated
    # with the parent run rather than have a new run id generated for each.
    if callback_manager.parent_run_id is None:
-        raise RuntimeError(
+        msg = (
            "Unable to dispatch an adhoc event without a parent run id."
            "This function can only be called from within an existing run (e.g.,"
            "inside a tool or a RunnableLambda or a RunnableGenerator.)"
            "If you are doing that and still seeing this error, try explicitly"
            "passing the config parameter to this function."
        )
+        raise RuntimeError(msg)

    await callback_manager.on_custom_event(
        name,
@@ -2550,13 +2592,14 @@ def dispatch_custom_event(
    # within a tool or a lambda and have the metadata events associated
    # with the parent run rather than have a new run id generated for each.
    if callback_manager.parent_run_id is None:
-        raise RuntimeError(
+        msg = (
            "Unable to dispatch an adhoc event without a parent run id."
            "This function can only be called from within an existing run (e.g.,"
            "inside a tool or a RunnableLambda or a RunnableGenerator.)"
            "If you are doing that and still seeing this error, try explicitly"
            "passing the config parameter to this function."
        )
+        raise RuntimeError(msg)
    callback_manager.on_custom_event(
        name,
        data,
--- a/libs/core/langchain_core/chat_history.py
+++ b/libs/core/langchain_core/chat_history.py
@@ -157,10 +157,11 @@ class BaseChatMessageHistory(ABC):
            # method, so we should use it.
            self.add_messages([message])
        else:
-            raise NotImplementedError(
+            msg = (
                "add_message is not implemented for this class. "
                "Please implement add_message or add_messages."
            )
+            raise NotImplementedError(msg)

    def add_messages(self, messages: Sequence[BaseMessage]) -> None:
        """Add a list of messages.
--- a/libs/core/langchain_core/document_loaders/base.py
+++ b/libs/core/langchain_core/document_loaders/base.py
@@ -53,11 +53,12 @@ class BaseLoader(ABC):  # noqa: B024
            try:
                from langchain_text_splitters import RecursiveCharacterTextSplitter
            except ImportError as e:
-                raise ImportError(
+                msg = (
                    "Unable to import from langchain_text_splitters. Please specify "
                    "text_splitter or install langchain_text_splitters with "
                    "`pip install -U langchain-text-splitters`."
-                ) from e
+                )
+                raise ImportError(msg) from e

            _text_splitter: TextSplitter = RecursiveCharacterTextSplitter()
        else:
@@ -71,9 +72,8 @@ class BaseLoader(ABC):  # noqa: B024
        """A lazy loader for Documents."""
        if type(self).load != BaseLoader.load:
            return iter(self.load())
-        raise NotImplementedError(
-            f"{self.__class__.__name__} does not implement lazy_load()"
-        )
+        msg = f"{self.__class__.__name__} does not implement lazy_load()"
+        raise NotImplementedError(msg)

    async def alazy_load(self) -> AsyncIterator[Document]:
        """A lazy loader for Documents."""
--- a/libs/core/langchain_core/documents/base.py
+++ b/libs/core/langchain_core/documents/base.py
@@ -142,7 +142,8 @@ class Blob(BaseMedia):
    def check_blob_is_valid(cls, values: dict[str, Any]) -> Any:
        """Verify that either data or path is provided."""
        if "data" not in values and "path" not in values:
-            raise ValueError("Either data or path must be provided")
+            msg = "Either data or path must be provided"
+            raise ValueError(msg)
        return values

    def as_string(self) -> str:
@@ -155,7 +156,8 @@ class Blob(BaseMedia):
        elif isinstance(self.data, str):
            return self.data
        else:
-            raise ValueError(f"Unable to get string for blob {self}")
+            msg = f"Unable to get string for blob {self}"
+            raise ValueError(msg)

    def as_bytes(self) -> bytes:
        """Read data as bytes."""
@@ -167,7 +169,8 @@ class Blob(BaseMedia):
            with open(str(self.path), "rb") as f:
                return f.read()
        else:
-            raise ValueError(f"Unable to get bytes for blob {self}")
+            msg = f"Unable to get bytes for blob {self}"
+            raise ValueError(msg)

    @contextlib.contextmanager
    def as_bytes_io(self) -> Generator[Union[BytesIO, BufferedReader], None, None]:
@@ -178,7 +181,8 @@ class Blob(BaseMedia):
            with open(str(self.path), "rb") as f:
                yield f
        else:
-            raise NotImplementedError(f"Unable to convert blob {self}")
+            msg = f"Unable to convert blob {self}"
+            raise NotImplementedError(msg)

    @classmethod
    def from_path(
--- a/libs/core/langchain_core/embeddings/fake.py
+++ b/libs/core/langchain_core/embeddings/fake.py
@@ -53,7 +53,7 @@ class FakeEmbeddings(Embeddings, BaseModel):
    def _get_embedding(self) -> list[float]:
        import numpy as np  # type: ignore[import-not-found, import-untyped]

-        return list(np.random.normal(size=self.size))
+        return list(np.random.default_rng().normal(size=self.size))

    def embed_documents(self, texts: list[str]) -> list[list[float]]:
        return [self._get_embedding() for _ in texts]
@@ -109,8 +109,8 @@ class DeterministicFakeEmbedding(Embeddings, BaseModel):
        import numpy as np  # type: ignore[import-not-found, import-untyped]

        # set the seed for the random generator
-        np.random.seed(seed)
-        return list(np.random.normal(size=self.size))
+        rng = np.random.default_rng(seed)
+        return list(rng.normal(size=self.size))

    def _get_seed(self, text: str) -> int:
        """Get a seed for the random generator, using the hash of the text."""
--- a/libs/core/langchain_core/exceptions.py
+++ b/libs/core/langchain_core/exceptions.py
@@ -41,10 +41,11 @@ class OutputParserException(ValueError, LangChainException):  # noqa: N818
    ):
        super().__init__(error)
        if send_to_llm and (observation is None or llm_output is None):
-            raise ValueError(
+            msg = (
                "Arguments 'observation' & 'llm_output'"
                " are required if 'send_to_llm' is True"
            )
+            raise ValueError(msg)
        self.observation = observation
        self.llm_output = llm_output
        self.send_to_llm = send_to_llm
--- a/libs/core/langchain_core/indexing/api.py
+++ b/libs/core/langchain_core/indexing/api.py
@@ -73,20 +73,22 @@ class _HashedDocument(Document):

        for key in forbidden_keys:
            if key in metadata:
-                raise ValueError(
+                msg = (
                    f"Metadata cannot contain key {key} as it "
                    f"is reserved for internal use."
                )
+                raise ValueError(msg)

        content_hash = str(_hash_string_to_uuid(content))

        try:
            metadata_hash = str(_hash_nested_dict_to_uuid(metadata))
        except Exception as e:
-            raise ValueError(
+            msg = (
                f"Failed to hash metadata: {e}. "
                f"Please use a dict that can be serialized using json."
-            ) from e
+            )
+            raise ValueError(msg) from e

        values["content_hash"] = content_hash
        values["metadata_hash"] = metadata_hash
@@ -154,10 +156,11 @@ def _get_source_id_assigner(
    elif callable(source_id_key):
        return source_id_key
    else:
-        raise ValueError(
+        msg = (
            f"source_id_key should be either None, a string or a callable. "
            f"Got {source_id_key} of type {type(source_id_key)}."
        )
+        raise ValueError(msg)


 def _deduplicate_in_order(
@@ -198,6 +201,7 @@ def index(
    source_id_key: Union[str, Callable[[Document], str], None] = None,
    cleanup_batch_size: int = 1_000,
    force_update: bool = False,
+    upsert_kwargs: Optional[dict[str, Any]] = None,
 ) -> IndexingResult:
    """Index data from the loader into the vector store.

@@ -249,6 +253,12 @@ def index(
        force_update: Force update documents even if they are present in the
            record manager. Useful if you are re-indexing with updated embeddings.
            Default is False.
+        upsert_kwargs: Additional keyword arguments to pass to the add_documents
+                       method of the VectorStore or the upsert method of the
+                       DocumentIndex. For example, you can use this to
+                       specify a custom vector_field:
+                       upsert_kwargs={"vector_field": "embedding"}
+            .. versionadded:: 0.3.10

    Returns:
        Indexing result which contains information about how many documents
@@ -262,13 +272,15 @@ def index(
        ValueError: If source_id_key is not None, but is not a string or callable.
    """
    if cleanup not in {"incremental", "full", None}:
-        raise ValueError(
+        msg = (
            f"cleanup should be one of 'incremental', 'full' or None. "
            f"Got {cleanup}."
        )
+        raise ValueError(msg)

    if cleanup == "incremental" and source_id_key is None:
-        raise ValueError("Source id key is required when cleanup mode is incremental.")
+        msg = "Source id key is required when cleanup mode is incremental."
+        raise ValueError(msg)

    destination = vector_store  # Renaming internally for clarity

@@ -279,21 +291,24 @@ def index(

        for method in methods:
            if not hasattr(destination, method):
-                raise ValueError(
+                msg = (
                    f"Vectorstore {destination} does not have required method {method}"
                )
+                raise ValueError(msg)

        if type(destination).delete == VectorStore.delete:
            # Checking if the vectorstore has overridden the default delete method
            # implementation which just raises a NotImplementedError
-            raise ValueError("Vectorstore has not implemented the delete method")
+            msg = "Vectorstore has not implemented the delete method"
+            raise ValueError(msg)
    elif isinstance(destination, DocumentIndex):
        pass
    else:
-        raise TypeError(
+        msg = (
            f"Vectorstore should be either a VectorStore or a DocumentIndex. "
            f"Got {type(destination)}."
        )
+        raise TypeError(msg)

    if isinstance(docs_source, BaseLoader):
        try:
@@ -327,12 +342,13 @@ def index(
            # If the cleanup mode is incremental, source ids are required.
            for source_id, hashed_doc in zip(source_ids, hashed_docs):
                if source_id is None:
-                    raise ValueError(
+                    msg = (
                        "Source ids are required when cleanup mode is incremental. "
                        f"Document that starts with "
                        f"content: {hashed_doc.page_content[:100]} was not assigned "
                        f"as source id."
                    )
+                    raise ValueError(msg)
            # source ids cannot be None after for loop above.
            source_ids = cast(Sequence[str], source_ids)  # type: ignore[assignment]

@@ -363,10 +379,16 @@ def index(
        if docs_to_index:
            if isinstance(destination, VectorStore):
                destination.add_documents(
-                    docs_to_index, ids=uids, batch_size=batch_size
+                    docs_to_index,
+                    ids=uids,
+                    batch_size=batch_size,
+                    **(upsert_kwargs or {}),
                )
            elif isinstance(destination, DocumentIndex):
-                destination.upsert(docs_to_index)
+                destination.upsert(
+                    docs_to_index,
+                    **(upsert_kwargs or {}),
+                )

            num_added += len(docs_to_index) - len(seen_docs)
            num_updated += len(seen_docs)
@@ -387,7 +409,8 @@ def index(
            # mypy isn't good enough to determine that source ids cannot be None
            # here due to a check that's happening above, so we check again.
            if any(source_id is None for source_id in source_ids):
-                raise AssertionError("Source ids cannot be if cleanup=='incremental'.")
+                msg = "Source ids cannot be if cleanup=='incremental'."
+                raise AssertionError(msg)

            indexed_source_ids = cast(
                Sequence[str], [source_id_assigner(doc) for doc in docs_to_index]
@@ -438,6 +461,7 @@ async def aindex(
    source_id_key: Union[str, Callable[[Document], str], None] = None,
    cleanup_batch_size: int = 1_000,
    force_update: bool = False,
+    upsert_kwargs: Optional[dict[str, Any]] = None,
 ) -> IndexingResult:
    """Async index data from the loader into the vector store.

@@ -480,6 +504,12 @@ async def aindex(
        force_update: Force update documents even if they are present in the
            record manager. Useful if you are re-indexing with updated embeddings.
            Default is False.
+        upsert_kwargs: Additional keyword arguments to pass to the aadd_documents
+                       method of the VectorStore or the aupsert method of the
+                       DocumentIndex. For example, you can use this to
+                       specify a custom vector_field:
+                       upsert_kwargs={"vector_field": "embedding"}
+            .. versionadded:: 0.3.10

    Returns:
        Indexing result which contains information about how many documents
@@ -494,13 +524,15 @@ async def aindex(
    """

    if cleanup not in {"incremental", "full", None}:
-        raise ValueError(
+        msg = (
            f"cleanup should be one of 'incremental', 'full' or None. "
            f"Got {cleanup}."
        )
+        raise ValueError(msg)

    if cleanup == "incremental" and source_id_key is None:
-        raise ValueError("Source id key is required when cleanup mode is incremental.")
+        msg = "Source id key is required when cleanup mode is incremental."
+        raise ValueError(msg)

    destination = vector_store  # Renaming internally for clarity

@@ -512,21 +544,24 @@ async def aindex(

        for method in methods:
            if not hasattr(destination, method):
-                raise ValueError(
+                msg = (
                    f"Vectorstore {destination} does not have required method {method}"
                )
+                raise ValueError(msg)

        if type(destination).adelete == VectorStore.adelete:
            # Checking if the vectorstore has overridden the default delete method
            # implementation which just raises a NotImplementedError
-            raise ValueError("Vectorstore has not implemented the delete method")
+            msg = "Vectorstore has not implemented the delete method"
+            raise ValueError(msg)
    elif isinstance(destination, DocumentIndex):
        pass
    else:
-        raise TypeError(
+        msg = (
            f"Vectorstore should be either a VectorStore or a DocumentIndex. "
            f"Got {type(destination)}."
        )
+        raise TypeError(msg)
    async_doc_iterator: AsyncIterator[Document]
    if isinstance(docs_source, BaseLoader):
        try:
@@ -568,12 +603,13 @@ async def aindex(
            # If the cleanup mode is incremental, source ids are required.
            for source_id, hashed_doc in zip(source_ids, hashed_docs):
                if source_id is None:
-                    raise ValueError(
+                    msg = (
                        "Source ids are required when cleanup mode is incremental. "
                        f"Document that starts with "
                        f"content: {hashed_doc.page_content[:100]} was not assigned "
                        f"as source id."
                    )
+                    raise ValueError(msg)
            # source ids cannot be None after for loop above.
            source_ids = cast(Sequence[str], source_ids)

@@ -604,10 +640,16 @@ async def aindex(
        if docs_to_index:
            if isinstance(destination, VectorStore):
                await destination.aadd_documents(
-                    docs_to_index, ids=uids, batch_size=batch_size
+                    docs_to_index,
+                    ids=uids,
+                    batch_size=batch_size,
+                    **(upsert_kwargs or {}),
                )
            elif isinstance(destination, DocumentIndex):
-                await destination.aupsert(docs_to_index)
+                await destination.aupsert(
+                    docs_to_index,
+                    **(upsert_kwargs or {}),
+                )
            num_added += len(docs_to_index) - len(seen_docs)
            num_updated += len(seen_docs)

@@ -628,7 +670,8 @@ async def aindex(
            # mypy isn't good enough to determine that source ids cannot be None
            # here due to a check that's happening above, so we check again.
            if any(source_id is None for source_id in source_ids):
-                raise AssertionError("Source ids cannot be if cleanup=='incremental'.")
+                msg = "Source ids cannot be if cleanup=='incremental'."
+                raise AssertionError(msg)

            indexed_source_ids = cast(
                Sequence[str], [source_id_assigner(doc) for doc in docs_to_index]
--- a/libs/core/langchain_core/indexing/base.py
+++ b/libs/core/langchain_core/indexing/base.py
@@ -290,11 +290,13 @@ class InMemoryRecordManager(RecordManager):
        """

        if group_ids and len(keys) != len(group_ids):
-            raise ValueError("Length of keys must match length of group_ids")
+            msg = "Length of keys must match length of group_ids"
+            raise ValueError(msg)
        for index, key in enumerate(keys):
            group_id = group_ids[index] if group_ids else None
            if time_at_least and time_at_least > self.get_time():
-                raise ValueError("time_at_least must be in the past")
+                msg = "time_at_least must be in the past"
+                raise ValueError(msg)
            self.records[key] = {"group_id": group_id, "updated_at": self.get_time()}

    async def aupdate(
--- a/libs/core/langchain_core/indexing/in_memory.py
+++ b/libs/core/langchain_core/indexing/in_memory.py
@@ -47,7 +47,8 @@ class InMemoryDocumentIndex(DocumentIndex):
    def delete(self, ids: Optional[list[str]] = None, **kwargs: Any) -> DeleteResponse:
        """Delete by ID."""
        if ids is None:
-            raise ValueError("IDs must be provided for deletion")
+            msg = "IDs must be provided for deletion"
+            raise ValueError(msg)

        ok_ids = []

--- a/libs/core/langchain_core/language_models/base.py
+++ b/libs/core/langchain_core/language_models/base.py
@@ -60,11 +60,12 @@ def get_tokenizer() -> Any:
    try:
        from transformers import GPT2TokenizerFast  # type: ignore[import]
    except ImportError as e:
-        raise ImportError(
+        msg = (
            "Could not import transformers python package. "
            "This is needed in order to calculate get_token_ids. "
            "Please install it with `pip install transformers`."
-        ) from e
+        )
+        raise ImportError(msg) from e
    # create a GPT-2 tokenizer instance
    return GPT2TokenizerFast.from_pretrained("gpt2")

@@ -236,7 +237,7 @@ class BaseLanguageModel(
        """Not implemented on this class."""
        # Implement this on child class if there is a way of steering the model to
        # generate responses that match a given schema.
-        raise NotImplementedError()
+        raise NotImplementedError

    @deprecated("0.1.7", alternative="invoke", removal="1.0")
    @abstractmethod
--- a/libs/core/langchain_core/language_models/chat_models.py
+++ b/libs/core/langchain_core/language_models/chat_models.py
@@ -89,7 +89,8 @@ def generate_from_stream(stream: Iterator[ChatGenerationChunk]) -> ChatResult:
    if generation:
        generation += list(stream)
    if generation is None:
-        raise ValueError("No generations found in stream.")
+        msg = "No generations found in stream."
+        raise ValueError(msg)
    return ChatResult(
        generations=[
            ChatGeneration(
@@ -265,10 +266,11 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
        elif isinstance(input, Sequence):
            return ChatPromptValue(messages=convert_to_messages(input))
        else:
-            raise ValueError(
+            msg = (
                f"Invalid input type {type(input)}. "
                "Must be a PromptValue, str, or list of BaseMessages."
            )
+            raise ValueError(msg)

    def invoke(
        self,
@@ -817,9 +819,8 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
            elif self.cache is None:
                pass
            else:
-                raise ValueError(
-                    "Asked to cache, but no cache found at `langchain.cache`."
-                )
+                msg = "Asked to cache, but no cache found at `langchain.cache`."
+                raise ValueError(msg)

        # Apply the rate limiter after checking the cache, since
        # we usually don't want to rate limit cache lookups, but
@@ -891,9 +892,8 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
            elif self.cache is None:
                pass
            else:
-                raise ValueError(
-                    "Asked to cache, but no cache found at `langchain.cache`."
-                )
+                msg = "Asked to cache, but no cache found at `langchain.cache`."
+                raise ValueError(msg)

        # Apply the rate limiter after checking the cache, since
        # we usually don't want to rate limit cache lookups, but
@@ -977,7 +977,7 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
        run_manager: Optional[CallbackManagerForLLMRun] = None,
        **kwargs: Any,
    ) -> Iterator[ChatGenerationChunk]:
-        raise NotImplementedError()
+        raise NotImplementedError

    async def _astream(
        self,
@@ -1020,7 +1020,8 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
        if isinstance(generation, ChatGeneration):
            return generation.message
        else:
-            raise ValueError("Unexpected generation type")
+            msg = "Unexpected generation type"
+            raise ValueError(msg)

    async def _call_async(
        self,
@@ -1036,7 +1037,8 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
        if isinstance(generation, ChatGeneration):
            return generation.message
        else:
-            raise ValueError("Unexpected generation type")
+            msg = "Unexpected generation type"
+            raise ValueError(msg)

    @deprecated("0.1.7", alternative="invoke", removal="1.0")
    def call_as_llm(
@@ -1053,7 +1055,8 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
        if isinstance(result.content, str):
            return result.content
        else:
-            raise ValueError("Cannot use predict when output is not a string.")
+            msg = "Cannot use predict when output is not a string."
+            raise ValueError(msg)

    @deprecated("0.1.7", alternative="invoke", removal="1.0")
    def predict_messages(
@@ -1077,7 +1080,8 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
        if isinstance(result.content, str):
            return result.content
        else:
-            raise ValueError("Cannot use predict when output is not a string.")
+            msg = "Cannot use predict when output is not a string."
+            raise ValueError(msg)

    @deprecated("0.1.7", alternative="ainvoke", removal="1.0")
    async def apredict_messages(
@@ -1108,7 +1112,7 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
        ],
        **kwargs: Any,
    ) -> Runnable[LanguageModelInput, BaseMessage]:
-        raise NotImplementedError()
+        raise NotImplementedError

    def with_structured_output(
        self,
@@ -1220,7 +1224,8 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
                # }
        """  # noqa: E501
        if kwargs:
-            raise ValueError(f"Received unsupported arguments {kwargs}")
+            msg = f"Received unsupported arguments {kwargs}"
+            raise ValueError(msg)

        from langchain_core.output_parsers.openai_tools import (
            JsonOutputKeyToolsParser,
@@ -1228,9 +1233,8 @@ class BaseChatModel(BaseLanguageModel[BaseMessage], ABC):
        )

        if self.bind_tools is BaseChatModel.bind_tools:
-            raise NotImplementedError(
-                "with_structured_output is not implemented for this model."
-            )
+            msg = "with_structured_output is not implemented for this model."
+            raise NotImplementedError(msg)
        llm = self.bind_tools([schema], tool_choice="any")
        if isinstance(schema, type) and is_basemodel_subclass(schema):
            output_parser: OutputParserLike = PydanticToolsParser(
--- a/libs/core/langchain_core/language_models/fake_chat_models.py
+++ b/libs/core/langchain_core/language_models/fake_chat_models.py
@@ -238,18 +238,20 @@ class GenericFakeChatModel(BaseChatModel):
            messages, stop=stop, run_manager=run_manager, **kwargs
        )
        if not isinstance(chat_result, ChatResult):
-            raise ValueError(
+            msg = (
                f"Expected generate to return a ChatResult, "
                f"but got {type(chat_result)} instead."
            )
+            raise ValueError(msg)

        message = chat_result.generations[0].message

        if not isinstance(message, AIMessage):
-            raise ValueError(
+            msg = (
                f"Expected invoke to return an AIMessage, "
                f"but got {type(message)} instead."
            )
+            raise ValueError(msg)

        content = message.content

--- a/libs/core/langchain_core/language_models/llms.py
+++ b/libs/core/langchain_core/language_models/llms.py
@@ -135,15 +135,17 @@ def _resolve_cache(cache: Union[BaseCache, bool, None]) -> Optional[BaseCache]:
    elif cache is True:
        llm_cache = get_llm_cache()
        if llm_cache is None:
-            raise ValueError(
+            msg = (
                "No global cache was configured. Use `set_llm_cache`."
                "to set a global cache if you want to use a global cache."
                "Otherwise either pass a cache object or set cache to False/None"
            )
+            raise ValueError(msg)
    elif cache is False:
        llm_cache = None
    else:
-        raise ValueError(f"Unsupported cache value {cache}")
+        msg = f"Unsupported cache value {cache}"
+        raise ValueError(msg)
    return llm_cache


@@ -332,10 +334,11 @@ class BaseLLM(BaseLanguageModel[str], ABC):
        elif isinstance(input, Sequence):
            return ChatPromptValue(messages=convert_to_messages(input))
        else:
-            raise ValueError(
+            msg = (
                f"Invalid input type {type(input)}. "
                "Must be a PromptValue, str, or list of BaseMessages."
            )
+            raise ValueError(msg)

    def _get_ls_params(
        self,
@@ -695,7 +698,7 @@ class BaseLLM(BaseLanguageModel[str], ABC):
        Returns:
            An iterator of GenerationChunks.
        """
-        raise NotImplementedError()
+        raise NotImplementedError

    async def _astream(
        self,
@@ -842,10 +845,11 @@ class BaseLLM(BaseLanguageModel[str], ABC):
                prompt and additional model provider-specific output.
        """
        if not isinstance(prompts, list):
-            raise ValueError(
+            msg = (
                "Argument 'prompts' is expected to be of type List[str], received"
                f" argument of type {type(prompts)}."
            )
+            raise ValueError(msg)
        # Create callback managers
        if isinstance(metadata, list):
            metadata = [
@@ -989,10 +993,11 @@ class BaseLLM(BaseLanguageModel[str], ABC):
            return [None] * len(prompts)
        if isinstance(run_id, list):
            if len(run_id) != len(prompts):
-                raise ValueError(
+                msg = (
                    "Number of manually provided run_id's does not match batch length."
                    f" {len(run_id)} != {len(prompts)}"
                )
+                raise ValueError(msg)
            return run_id
        return [run_id] + [None] * (len(prompts) - 1)

@@ -1262,11 +1267,12 @@ class BaseLLM(BaseLanguageModel[str], ABC):
            ValueError: If the prompt is not a string.
        """
        if not isinstance(prompt, str):
-            raise ValueError(
+            msg = (
                "Argument `prompt` is expected to be a string. Instead found "
                f"{type(prompt)}. If you want to run the LLM on multiple prompts, use "
                "`generate` instead."
            )
+            raise ValueError(msg)
        return (
            self.generate(
                [prompt],
@@ -1387,7 +1393,8 @@ class BaseLLM(BaseLanguageModel[str], ABC):
            with open(file_path, "w") as f:
                yaml.dump(prompt_dict, f, default_flow_style=False)
        else:
-            raise ValueError(f"{save_path} must be json or yaml")
+            msg = f"{save_path} must be json or yaml"
+            raise ValueError(msg)


 class LLM(BaseLLM):
--- a/libs/core/langchain_core/load/dump.py
+++ b/libs/core/langchain_core/load/dump.py
@@ -37,7 +37,8 @@ def dumps(obj: Any, *, pretty: bool = False, **kwargs: Any) -> str:
        ValueError: If `default` is passed as a kwarg.
    """
    if "default" in kwargs:
-        raise ValueError("`default` should not be passed to dumps")
+        msg = "`default` should not be passed to dumps"
+        raise ValueError(msg)
    try:
        if pretty:
            indent = kwargs.pop("indent", 2)
--- a/libs/core/langchain_core/load/load.py
+++ b/libs/core/langchain_core/load/load.py
@@ -96,17 +96,19 @@ class Reviver:
            else:
                if self.secrets_from_env and key in os.environ and os.environ[key]:
                    return os.environ[key]
-                raise KeyError(f'Missing key "{key}" in load(secrets_map)')
+                msg = f'Missing key "{key}" in load(secrets_map)'
+                raise KeyError(msg)

        if (
            value.get("lc") == 1
            and value.get("type") == "not_implemented"
            and value.get("id") is not None
        ):
-            raise NotImplementedError(
+            msg = (
                "Trying to load an object that doesn't implement "
                f"serialization: {value}"
            )
+            raise NotImplementedError(msg)

        if (
            value.get("lc") == 1
@@ -121,7 +123,8 @@ class Reviver:
                # The root namespace ["langchain"] is not a valid identifier.
                or namespace == ["langchain"]
            ):
-                raise ValueError(f"Invalid namespace: {value}")
+                msg = f"Invalid namespace: {value}"
+                raise ValueError(msg)
            # Has explicit import path.
            elif mapping_key in self.import_mappings:
                import_path = self.import_mappings[mapping_key]
@@ -130,11 +133,12 @@ class Reviver:
                # Import module
                mod = importlib.import_module(".".join(import_dir))
            elif namespace[0] in DISALLOW_LOAD_FROM_PATH:
-                raise ValueError(
+                msg = (
                    "Trying to deserialize something that cannot "
                    "be deserialized in current version of langchain-core: "
                    f"{mapping_key}."
                )
+                raise ValueError(msg)
            # Otherwise, treat namespace as path.
            else:
                mod = importlib.import_module(".".join(namespace))
@@ -143,7 +147,8 @@ class Reviver:

            # The class must be a subclass of Serializable.
            if not issubclass(cls, Serializable):
-                raise ValueError(f"Invalid namespace: {value}")
+                msg = f"Invalid namespace: {value}"
+                raise ValueError(msg)

            # We don't need to recurse on kwargs
            # as json.loads will do that for us.
--- a/libs/core/langchain_core/load/serializable.py
+++ b/libs/core/langchain_core/load/serializable.py
@@ -215,11 +215,12 @@ class Serializable(BaseModel, ABC):

                for attr in deprecated_attributes:
                    if hasattr(cls, attr):
-                        raise ValueError(
+                        msg = (
                            f"Class {self.__class__} has a deprecated "
                            f"attribute {attr}. Please use the corresponding "
                            f"classmethod instead."
                        )
+                        raise ValueError(msg)

            # Get a reference to self bound to each class in the MRO
            this = cast(Serializable, self if cls is None else super(cls, self))
--- a/libs/core/langchain_core/messages/ai.py
+++ b/libs/core/langchain_core/messages/ai.py
@@ -1,5 +1,6 @@
 import json
-from typing import Any, Literal, Optional, Union
+import operator
+from typing import Any, Literal, Optional, Union, cast

 from pydantic import model_validator
 from typing_extensions import NotRequired, Self, TypedDict
@@ -27,6 +28,7 @@ from langchain_core.messages.tool import (
 )
 from langchain_core.utils._merge import merge_dicts, merge_lists
 from langchain_core.utils.json import parse_partial_json
+from langchain_core.utils.usage import _dict_int_op


 class InputTokenDetails(TypedDict, total=False):
@@ -373,7 +375,8 @@ class AIMessageChunk(AIMessage, BaseMessageChunk):
                        )
                    )
                else:
-                    raise ValueError("Malformed args.")
+                    msg = "Malformed args."
+                    raise ValueError(msg)
            except Exception:
                invalid_tool_calls.append(
                    create_invalid_tool_call(
@@ -402,9 +405,8 @@ def add_ai_message_chunks(
 ) -> AIMessageChunk:
    """Add multiple AIMessageChunks together."""
    if any(left.example != o.example for o in others):
-        raise ValueError(
-            "Cannot concatenate AIMessageChunks with different example values."
-        )
+        msg = "Cannot concatenate AIMessageChunks with different example values."
+        raise ValueError(msg)

    content = merge_content(left.content, *(o.content for o in others))
    additional_kwargs = merge_dicts(
@@ -432,17 +434,9 @@ def add_ai_message_chunks(

    # Token usage
    if left.usage_metadata or any(o.usage_metadata is not None for o in others):
-        usage_metadata_: UsageMetadata = left.usage_metadata or UsageMetadata(
-            input_tokens=0, output_tokens=0, total_tokens=0
-        )
+        usage_metadata: Optional[UsageMetadata] = left.usage_metadata
        for other in others:
-            if other.usage_metadata is not None:
-                usage_metadata_["input_tokens"] += other.usage_metadata["input_tokens"]
-                usage_metadata_["output_tokens"] += other.usage_metadata[
-                    "output_tokens"
-                ]
-                usage_metadata_["total_tokens"] += other.usage_metadata["total_tokens"]
-        usage_metadata: Optional[UsageMetadata] = usage_metadata_
+            usage_metadata = add_usage(usage_metadata, other.usage_metadata)
    else:
        usage_metadata = None

@@ -455,3 +449,115 @@ def add_ai_message_chunks(
        usage_metadata=usage_metadata,
        id=left.id,
    )
+
+
+def add_usage(
+    left: Optional[UsageMetadata], right: Optional[UsageMetadata]
+) -> UsageMetadata:
+    """Recursively add two UsageMetadata objects.
+
+    Example:
+        .. code-block:: python
+
+            from langchain_core.messages.ai import add_usage
+
+            left = UsageMetadata(
+                input_tokens=5,
+                output_tokens=0,
+                total_tokens=5,
+                input_token_details=InputTokenDetails(cache_read=3)
+            )
+            right = UsageMetadata(
+                input_tokens=0,
+                output_tokens=10,
+                total_tokens=10,
+                output_token_details=OutputTokenDetails(reasoning=4)
+            )
+
+            add_usage(left, right)
+
+        results in
+
+        .. code-block:: python
+
+            UsageMetadata(
+                input_tokens=5,
+                output_tokens=10,
+                total_tokens=15,
+                input_token_details=InputTokenDetails(cache_read=3),
+                output_token_details=OutputTokenDetails(reasoning=4)
+            )
+
+    """
+    if not (left or right):
+        return UsageMetadata(input_tokens=0, output_tokens=0, total_tokens=0)
+    if not (left and right):
+        return cast(UsageMetadata, left or right)
+
+    return UsageMetadata(
+        **cast(
+            UsageMetadata,
+            _dict_int_op(
+                cast(dict, left),
+                cast(dict, right),
+                operator.add,
+            ),
+        )
+    )
+
+
+def subtract_usage(
+    left: Optional[UsageMetadata], right: Optional[UsageMetadata]
+) -> UsageMetadata:
+    """Recursively subtract two UsageMetadata objects.
+
+    Token counts cannot be negative so the actual operation is max(left - right, 0).
+
+    Example:
+        .. code-block:: python
+
+            from langchain_core.messages.ai import subtract_usage
+
+            left = UsageMetadata(
+                input_tokens=5,
+                output_tokens=10,
+                total_tokens=15,
+                input_token_details=InputTokenDetails(cache_read=4)
+            )
+            right = UsageMetadata(
+                input_tokens=3,
+                output_tokens=8,
+                total_tokens=11,
+                output_token_details=OutputTokenDetails(reasoning=4)
+            )
+
+            subtract_usage(left, right)
+
+        results in
+
+        .. code-block:: python
+
+            UsageMetadata(
+                input_tokens=2,
+                output_tokens=2,
+                total_tokens=4,
+                input_token_details=InputTokenDetails(cache_read=4),
+                output_token_details=OutputTokenDetails(reasoning=0)
+            )
+
+    """
+    if not (left or right):
+        return UsageMetadata(input_tokens=0, output_tokens=0, total_tokens=0)
+    if not (left and right):
+        return cast(UsageMetadata, left or right)
+
+    return UsageMetadata(
+        **cast(
+            UsageMetadata,
+            _dict_int_op(
+                cast(dict, left),
+                cast(dict, right),
+                (lambda le, ri: max(le - ri, 0)),
+            ),
+        )
+    )
--- a/libs/core/langchain_core/messages/base.py
+++ b/libs/core/langchain_core/messages/base.py
@@ -223,11 +223,12 @@ class BaseMessageChunk(BaseMessage):
                response_metadata=response_metadata,
            )
        else:
-            raise TypeError(
+            msg = (
                'unsupported operand type(s) for +: "'
                f"{self.__class__.__name__}"
                f'" and "{other.__class__.__name__}"'
            )
+            raise TypeError(msg)


 def message_to_dict(message: BaseMessage) -> dict:
--- a/libs/core/langchain_core/messages/chat.py
+++ b/libs/core/langchain_core/messages/chat.py
@@ -48,9 +48,8 @@ class ChatMessageChunk(ChatMessage, BaseMessageChunk):
    def __add__(self, other: Any) -> BaseMessageChunk:  # type: ignore
        if isinstance(other, ChatMessageChunk):
            if self.role != other.role:
-                raise ValueError(
-                    "Cannot concatenate ChatMessageChunks with different roles."
-                )
+                msg = "Cannot concatenate ChatMessageChunks with different roles."
+                raise ValueError(msg)

            return self.__class__(
                role=self.role,
--- a/libs/core/langchain_core/messages/function.py
+++ b/libs/core/langchain_core/messages/function.py
@@ -54,9 +54,8 @@ class FunctionMessageChunk(FunctionMessage, BaseMessageChunk):
    def __add__(self, other: Any) -> BaseMessageChunk:  # type: ignore
        if isinstance(other, FunctionMessageChunk):
            if self.name != other.name:
-                raise ValueError(
-                    "Cannot concatenate FunctionMessageChunks with different names."
-                )
+                msg = "Cannot concatenate FunctionMessageChunks with different names."
+                raise ValueError(msg)

            return self.__class__(
                name=self.name,
--- a/libs/core/langchain_core/messages/modifier.py
+++ b/libs/core/langchain_core/messages/modifier.py
@@ -20,7 +20,8 @@ class RemoveMessage(BaseMessage):
            ValueError: If the 'content' field is passed in kwargs.
        """
        if kwargs.pop("content", None):
-            raise ValueError("RemoveMessage does not support 'content' field.")
+            msg = "RemoveMessage does not support 'content' field."
+            raise ValueError(msg)

        return super().__init__("", id=id, **kwargs)

--- a/libs/core/langchain_core/messages/tool.py
+++ b/libs/core/langchain_core/messages/tool.py
@@ -94,11 +94,12 @@ class ToolMessage(BaseMessage):
            try:
                values["content"] = str(content)
            except ValueError as e:
-                raise ValueError(
+                msg = (
                    "ToolMessage content should be a string or a list of string/dicts. "
                    f"Received:\n\n{content=}\n\n which could not be coerced into a "
                    "string."
-                ) from e
+                )
+                raise ValueError(msg) from e
        elif isinstance(content, list):
            values["content"] = []
            for i, x in enumerate(content):
@@ -106,12 +107,13 @@ class ToolMessage(BaseMessage):
                    try:
                        values["content"].append(str(x))
                    except ValueError as e:
-                        raise ValueError(
+                        msg = (
                            "ToolMessage content should be a string or a list of "
                            "string/dicts. Received a list but "
                            f"element ToolMessage.content[{i}] is not a dict and could "
                            f"not be coerced to a string.:\n\n{x}"
-                        ) from e
+                        )
+                        raise ValueError(msg) from e
                else:
                    values["content"].append(x)
        else:
@@ -147,9 +149,8 @@ class ToolMessageChunk(ToolMessage, BaseMessageChunk):
    def __add__(self, other: Any) -> BaseMessageChunk:  # type: ignore
        if isinstance(other, ToolMessageChunk):
            if self.tool_call_id != other.tool_call_id:
-                raise ValueError(
-                    "Cannot concatenate ToolMessageChunks with different names."
-                )
+                msg = "Cannot concatenate ToolMessageChunks with different names."
+                raise ValueError(msg)

            return self.__class__(
                tool_call_id=self.tool_call_id,
--- a/libs/core/langchain_core/messages/utils.py
+++ b/libs/core/langchain_core/messages/utils.py
@@ -51,10 +51,11 @@ def _get_type(v: Any) -> str:
    elif hasattr(v, "type"):
        return v.type
    else:
-        raise TypeError(
+        msg = (
            f"Expected either a dictionary with a 'type' key or an object "
            f"with a 'type' attribute. Instead got type {type(v)}."
        )
+        raise TypeError(msg)


 AnyMessage = Annotated[
@@ -120,7 +121,8 @@ def get_buffer_string(
        elif isinstance(m, ChatMessage):
            role = m.role
        else:
-            raise ValueError(f"Got unsupported message type: {m}")
+            msg = f"Got unsupported message type: {m}"
+            raise ValueError(msg)
        message = f"{role}: {m.content}"
        if isinstance(m, AIMessage) and "function_call" in m.additional_kwargs:
            message += f"{m.additional_kwargs['function_call']}"
@@ -158,7 +160,8 @@ def _message_from_dict(message: dict) -> BaseMessage:
    elif _type == "ChatMessageChunk":
        return ChatMessageChunk(**message["data"])
    else:
-        raise ValueError(f"Got unexpected message type: {_type}")
+        msg = f"Got unexpected message type: {_type}"
+        raise ValueError(msg)


 def messages_from_dict(messages: Sequence[dict]) -> list[BaseMessage]:
@@ -266,10 +269,11 @@ def _create_message_from_message_type(
    elif message_type == "remove":
        message = RemoveMessage(**kwargs)
    else:
-        raise ValueError(
+        msg = (
            f"Unexpected message type: '{message_type}'. Use one of 'human',"
            f" 'user', 'ai', 'assistant', 'function', 'tool', or 'system'."
        )
+        raise ValueError(msg)
    return message


@@ -312,14 +316,14 @@ def _convert_to_message(message: MessageLikeRepresentation) -> BaseMessage:
            # None msg content is not allowed
            msg_content = msg_kwargs.pop("content") or ""
        except KeyError as e:
-            raise ValueError(
-                f"Message dict must contain 'role' and 'content' keys, got {message}"
-            ) from e
+            msg = f"Message dict must contain 'role' and 'content' keys, got {message}"
+            raise ValueError(msg) from e
        _message = _create_message_from_message_type(
            msg_type, msg_content, **msg_kwargs
        )
    else:
-        raise NotImplementedError(f"Unsupported message type: {type(message)}")
+        msg = f"Unsupported message type: {type(message)}"
+        raise NotImplementedError(msg)

    return _message

@@ -820,11 +824,12 @@ def trim_messages(
        else:
            list_token_counter = token_counter  # type: ignore[assignment]
    else:
-        raise ValueError(
+        msg = (
            f"'token_counter' expected to be a model that implements "
            f"'get_num_tokens_from_messages()' or a function. Received object of type "
            f"{type(token_counter)}."
        )
+        raise ValueError(msg)

    try:
        from langchain_text_splitters import TextSplitter
@@ -859,9 +864,8 @@ def trim_messages(
            text_splitter=text_splitter_fn,
        )
    else:
-        raise ValueError(
-            f"Unrecognized {strategy=}. Supported strategies are 'last' and 'first'."
-        )
+        msg = f"Unrecognized {strategy=}. Supported strategies are 'last' and 'first'."
+        raise ValueError(msg)


 def _first_max_tokens(
@@ -995,10 +999,11 @@ def _msg_to_chunk(message: BaseMessage) -> BaseMessageChunk:
        if isinstance(message, msg_cls):
            return chunk_cls(**message.model_dump(exclude={"type"}))

-    raise ValueError(
+    msg = (
        f"Unrecognized message class {message.__class__}. Supported classes are "
        f"{list(_MSG_CHUNK_MAP.keys())}"
    )
+    raise ValueError(msg)


 def _chunk_to_msg(chunk: BaseMessageChunk) -> BaseMessage:
@@ -1010,10 +1015,11 @@ def _chunk_to_msg(chunk: BaseMessageChunk) -> BaseMessage:
        if isinstance(chunk, chunk_cls):
            return msg_cls(**chunk.model_dump(exclude={"type", "tool_call_chunks"}))

-    raise ValueError(
+    msg = (
        f"Unrecognized message chunk class {chunk.__class__}. Supported classes are "
        f"{list(_CHUNK_MSG_MAP.keys())}"
    )
+    raise ValueError(msg)


 def _default_text_splitter(text: str) -> list[str]:
--- a/libs/core/langchain_core/output_parsers/base.py
+++ b/libs/core/langchain_core/output_parsers/base.py
@@ -177,10 +177,11 @@ class BaseOutputParser(
                if "args" in metadata and len(metadata["args"]) > 0:
                    return metadata["args"][0]

-        raise TypeError(
+        msg = (
            f"Runnable {self.__class__.__name__} doesn't have an inferable OutputType. "
            "Override the OutputType property to specify the output type."
        )
+        raise TypeError(msg)

    def invoke(
        self,
@@ -310,10 +311,11 @@ class BaseOutputParser(
    @property
    def _type(self) -> str:
        """Return the output parser type for serialization."""
-        raise NotImplementedError(
+        msg = (
            f"_type property is not implemented in class {self.__class__.__name__}."
            " This is required for serialization."
        )
+        raise NotImplementedError(msg)

    def dict(self, **kwargs: Any) -> dict:
        """Return dictionary representation of output parser."""
--- a/libs/core/langchain_core/output_parsers/openai_functions.py
+++ b/libs/core/langchain_core/output_parsers/openai_functions.py
@@ -36,16 +36,14 @@ class OutputFunctionsParser(BaseGenerationOutputParser[Any]):
        """
        generation = result[0]
        if not isinstance(generation, ChatGeneration):
-            raise OutputParserException(
-                "This output parser can only be used with a chat generation."
-            )
+            msg = "This output parser can only be used with a chat generation."
+            raise OutputParserException(msg)
        message = generation.message
        try:
            func_call = copy.deepcopy(message.additional_kwargs["function_call"])
        except KeyError as exc:
-            raise OutputParserException(
-                f"Could not parse function call: {exc}"
-            ) from exc
+            msg = f"Could not parse function call: {exc}"
+            raise OutputParserException(msg) from exc

        if self.args_only:
            return func_call["arguments"]
@@ -88,14 +86,12 @@ class JsonOutputFunctionsParser(BaseCumulativeTransformOutputParser[Any]):
        """

        if len(result) != 1:
-            raise OutputParserException(
-                f"Expected exactly one result, but got {len(result)}"
-            )
+            msg = f"Expected exactly one result, but got {len(result)}"
+            raise OutputParserException(msg)
        generation = result[0]
        if not isinstance(generation, ChatGeneration):
-            raise OutputParserException(
-                "This output parser can only be used with a chat generation."
-            )
+            msg = "This output parser can only be used with a chat generation."
+            raise OutputParserException(msg)
        message = generation.message
        try:
            function_call = message.additional_kwargs["function_call"]
@@ -103,9 +99,8 @@ class JsonOutputFunctionsParser(BaseCumulativeTransformOutputParser[Any]):
            if partial:
                return None
            else:
-                raise OutputParserException(
-                    f"Could not parse function call: {exc}"
-                ) from exc
+                msg = f"Could not parse function call: {exc}"
+                raise OutputParserException(msg) from exc
        try:
            if partial:
                try:
@@ -129,9 +124,8 @@ class JsonOutputFunctionsParser(BaseCumulativeTransformOutputParser[Any]):
                            function_call["arguments"], strict=self.strict
                        )
                    except (json.JSONDecodeError, TypeError) as exc:
-                        raise OutputParserException(
-                            f"Could not parse function call data: {exc}"
-                        ) from exc
+                        msg = f"Could not parse function call data: {exc}"
+                        raise OutputParserException(msg) from exc
                else:
                    try:
                        return {
@@ -141,9 +135,8 @@ class JsonOutputFunctionsParser(BaseCumulativeTransformOutputParser[Any]):
                            ),
                        }
                    except (json.JSONDecodeError, TypeError) as exc:
-                        raise OutputParserException(
-                            f"Could not parse function call data: {exc}"
-                        ) from exc
+                        msg = f"Could not parse function call data: {exc}"
+                        raise OutputParserException(msg) from exc
        except KeyError:
            return None

@@ -158,7 +151,7 @@ class JsonOutputFunctionsParser(BaseCumulativeTransformOutputParser[Any]):
        Returns:
            The parsed JSON object.
        """
-        raise NotImplementedError()
+        raise NotImplementedError


 class JsonKeyOutputFunctionsParser(JsonOutputFunctionsParser):
@@ -253,10 +246,11 @@ class PydanticOutputFunctionsParser(OutputFunctionsParser):
                and issubclass(schema, BaseModel)
            )
        elif values["args_only"] and isinstance(schema, dict):
-            raise ValueError(
+            msg = (
                "If multiple pydantic schemas are provided then args_only should be"
                " False."
            )
+            raise ValueError(msg)
        return values

    def parse_result(self, result: list[Generation], *, partial: bool = False) -> Any:
--- a/libs/core/langchain_core/output_parsers/openai_tools.py
+++ b/libs/core/langchain_core/output_parsers/openai_tools.py
@@ -52,11 +52,12 @@ def parse_tool_call(
                raw_tool_call["function"]["arguments"], strict=strict
            )
        except JSONDecodeError as e:
-            raise OutputParserException(
+            msg = (
                f"Function {raw_tool_call['function']['name']} arguments:\n\n"
                f"{raw_tool_call['function']['arguments']}\n\nare not valid JSON. "
                f"Received JSONDecodeError {e}"
-            ) from e
+            )
+            raise OutputParserException(msg) from e
    parsed = {
        "name": raw_tool_call["function"]["name"] or "",
        "args": function_args or {},
@@ -170,9 +171,8 @@ class JsonOutputToolsParser(BaseCumulativeTransformOutputParser[Any]):

        generation = result[0]
        if not isinstance(generation, ChatGeneration):
-            raise OutputParserException(
-                "This output parser can only be used with a chat generation."
-            )
+            msg = "This output parser can only be used with a chat generation."
+            raise OutputParserException(msg)
        message = generation.message
        if isinstance(message, AIMessage) and message.tool_calls:
            tool_calls = [dict(tc) for tc in message.tool_calls]
@@ -207,7 +207,7 @@ class JsonOutputToolsParser(BaseCumulativeTransformOutputParser[Any]):
        Returns:
            The parsed tool calls.
        """
-        raise NotImplementedError()
+        raise NotImplementedError


 class JsonOutputKeyToolsParser(JsonOutputToolsParser):
@@ -285,10 +285,11 @@ class PydanticToolsParser(JsonOutputToolsParser):
        for res in json_results:
            try:
                if not isinstance(res["args"], dict):
-                    raise ValueError(
+                    msg = (
                        f"Tool arguments must be specified as a dict, received: "
                        f"{res['args']}"
                    )
+                    raise ValueError(msg)
                pydantic_objects.append(name_dict[res["type"]](**res["args"]))
            except (ValidationError, ValueError) as e:
                if partial:
--- a/libs/core/langchain_core/output_parsers/pydantic.py
+++ b/libs/core/langchain_core/output_parsers/pydantic.py
@@ -29,10 +29,9 @@ class PydanticOutputParser(JsonOutputParser, Generic[TBaseModel]):
                elif issubclass(self.pydantic_object, pydantic.v1.BaseModel):
                    return self.pydantic_object.parse_obj(obj)
                else:
-                    raise OutputParserException(
-                        f"Unsupported model version for PydanticOutputParser: \
+                    msg = f"Unsupported model version for PydanticOutputParser: \
                            {self.pydantic_object.__class__}"
-                    )
+                    raise OutputParserException(msg)
            except (pydantic.ValidationError, pydantic.v1.ValidationError) as e:
                raise self._parser_exception(e, obj) from e
        else:  # pydantic v1
--- a/libs/core/langchain_core/output_parsers/transform.py
+++ b/libs/core/langchain_core/output_parsers/transform.py
@@ -106,7 +106,7 @@ class BaseCumulativeTransformOutputParser(BaseTransformOutputParser[T]):
        Returns:
            The diff between the previous and current parsed output.
        """
-        raise NotImplementedError()
+        raise NotImplementedError

    def _transform(self, input: Iterator[Union[str, BaseMessage]]) -> Iterator[Any]:
        prev_parsed = None
--- a/libs/core/langchain_core/output_parsers/xml.py
+++ b/libs/core/langchain_core/output_parsers/xml.py
@@ -49,11 +49,12 @@ class _StreamingParser:
            try:
                import defusedxml  # type: ignore
            except ImportError as e:
-                raise ImportError(
+                msg = (
                    "defusedxml is not installed. "
                    "Please install it to use the defusedxml parser."
                    "You can install it with `pip install defusedxml` "
-                ) from e
+                )
+                raise ImportError(msg) from e
            _parser = defusedxml.ElementTree.DefusedXMLParser(target=TreeBuilder())
        else:
            _parser = None
@@ -190,12 +191,13 @@ class XMLOutputParser(BaseTransformOutputParser):
            try:
                from defusedxml import ElementTree  # type: ignore
            except ImportError as e:
-                raise ImportError(
+                msg = (
                    "defusedxml is not installed. "
                    "Please install it to use the defusedxml parser."
                    "You can install it with `pip install defusedxml`"
                    "See https://github.com/tiran/defusedxml for more details"
-                ) from e
+                )
+                raise ImportError(msg) from e
            _et = ElementTree  # Use the defusedxml parser
        else:
            _et = ET  # Use the standard library parser
--- a/libs/core/langchain_core/outputs/chat_generation.py
+++ b/libs/core/langchain_core/outputs/chat_generation.py
@@ -65,7 +65,8 @@ class ChatGeneration(Generation):
                pass
            self.text = text
        except (KeyError, AttributeError) as e:
-            raise ValueError("Error while initializing ChatGeneration") from e
+            msg = "Error while initializing ChatGeneration"
+            raise ValueError(msg) from e
        return self

    @classmethod
@@ -114,6 +115,7 @@ class ChatGenerationChunk(ChatGeneration):
                generation_info=generation_info or None,
            )
        else:
-            raise TypeError(
+            msg = (
                f"unsupported operand type(s) for +: '{type(self)}' and '{type(other)}'"
            )
+            raise TypeError(msg)
--- a/libs/core/langchain_core/outputs/generation.py
+++ b/libs/core/langchain_core/outputs/generation.py
@@ -64,6 +64,7 @@ class GenerationChunk(Generation):
                generation_info=generation_info or None,
            )
        else:
-            raise TypeError(
+            msg = (
                f"unsupported operand type(s) for +: '{type(self)}' and '{type(other)}'"
            )
+            raise TypeError(msg)
--- a/libs/core/langchain_core/prompts/base.py
+++ b/libs/core/langchain_core/prompts/base.py
@@ -70,21 +70,22 @@ class BasePromptTemplate(
    def validate_variable_names(self) -> Self:
        """Validate variable names do not include restricted names."""
        if "stop" in self.input_variables:
-            raise ValueError(
+            msg = (
                "Cannot have an input variable named 'stop', as it is used internally,"
                " please rename."
            )
+            raise ValueError(msg)
        if "stop" in self.partial_variables:
-            raise ValueError(
+            msg = (
                "Cannot have an partial variable named 'stop', as it is used "
                "internally, please rename."
            )
+            raise ValueError(msg)

        overall = set(self.input_variables).intersection(self.partial_variables)
        if overall:
-            raise ValueError(
-                f"Found overlapping input and partial variables: {overall}"
-            )
+            msg = f"Found overlapping input and partial variables: {overall}"
+            raise ValueError(msg)
        return self

    @classmethod
@@ -143,10 +144,11 @@ class BasePromptTemplate(
                inner_input = {var_name: inner_input}

            else:
-                raise TypeError(
+                msg = (
                    f"Expected mapping type as input to {self.__class__.__name__}. "
                    f"Received {type(inner_input)}."
                )
+                raise TypeError(msg)
        missing = set(self.input_variables).difference(inner_input)
        if missing:
            msg = (
@@ -341,12 +343,14 @@ class BasePromptTemplate(
            prompt.save(file_path="path/prompt.yaml")
        """
        if self.partial_variables:
-            raise ValueError("Cannot save prompt with partial variables.")
+            msg = "Cannot save prompt with partial variables."
+            raise ValueError(msg)

        # Fetch dictionary to save
        prompt_dict = self.dict()
        if "_type" not in prompt_dict:
-            raise NotImplementedError(f"Prompt {self} does not support saving.")
+            msg = f"Prompt {self} does not support saving."
+            raise NotImplementedError(msg)

        # Convert file to Path object.
        save_path = Path(file_path) if isinstance(file_path, str) else file_path
@@ -361,7 +365,8 @@ class BasePromptTemplate(
            with open(file_path, "w") as f:
                yaml.dump(prompt_dict, f, default_flow_style=False)
        else:
-            raise ValueError(f"{save_path} must be json or yaml")
+            msg = f"{save_path} must be json or yaml"
+            raise ValueError(msg)


 def _get_document_info(doc: Document, prompt: BasePromptTemplate[str]) -> dict:
@@ -371,11 +376,12 @@ def _get_document_info(doc: Document, prompt: BasePromptTemplate[str]) -> dict:
        required_metadata = [
            iv for iv in prompt.input_variables if iv != "page_content"
        ]
-        raise ValueError(
+        msg = (
            f"Document prompt requires documents to have metadata variables: "
            f"{required_metadata}. Received document with missing metadata: "
            f"{list(missing_metadata)}."
        )
+        raise ValueError(msg)
    return {k: base_info[k] for k in prompt.input_variables}


--- a/libs/core/langchain_core/prompts/chat.py
+++ b/libs/core/langchain_core/prompts/chat.py
@@ -236,10 +236,11 @@ class MessagesPlaceholder(BaseMessagePromptTemplate):
            else kwargs[self.variable_name]
        )
        if not isinstance(value, list):
-            raise ValueError(
+            msg = (
                f"variable {self.variable_name} should be a list of base messages, "
                f"got {value} of type {type(value)}"
            )
+            raise ValueError(msg)
        value = convert_to_messages(value)
        if self.n_messages:
            value = value[-self.n_messages :]
@@ -514,9 +515,8 @@ class _StringImageMessagePromptTemplate(BaseMessagePromptTemplate):
            return cls(prompt=prompt, **kwargs)
        elif isinstance(template, list):
            if (partial_variables is not None) and len(partial_variables) > 0:
-                raise ValueError(
-                    "Partial variables are not supported for list of templates."
-                )
+                msg = "Partial variables are not supported for list of templates."
+                raise ValueError(msg)
            prompt = []
            for tmpl in template:
                if isinstance(tmpl, str) or isinstance(tmpl, dict) and "text" in tmpl:
@@ -536,11 +536,12 @@ class _StringImageMessagePromptTemplate(BaseMessagePromptTemplate):
                        vars = get_template_variables(img_template, "f-string")
                        if vars:
                            if len(vars) > 1:
-                                raise ValueError(
+                                msg = (
                                    "Only one format variable allowed per image"
                                    f" template.\nGot: {vars}"
                                    f"\nFrom: {tmpl}"
                                )
+                                raise ValueError(msg)
                            input_variables = [vars[0]]
                        img_template = {"url": img_template}
                        img_template_obj = ImagePromptTemplate(
@@ -559,13 +560,16 @@ class _StringImageMessagePromptTemplate(BaseMessagePromptTemplate):
                            input_variables=input_variables, template=img_template
                        )
                    else:
-                        raise ValueError(f"Invalid image template: {tmpl}")
+                        msg = f"Invalid image template: {tmpl}"
+                        raise ValueError(msg)
                    prompt.append(img_template_obj)
                else:
-                    raise ValueError(f"Invalid template: {tmpl}")
+                    msg = f"Invalid template: {tmpl}"
+                    raise ValueError(msg)
            return cls(prompt=prompt, **kwargs)
        else:
-            raise ValueError(f"Invalid template: {template}")
+            msg = f"Invalid template: {template}"
+            raise ValueError(msg)

    @classmethod
    def from_template_file(
@@ -1042,7 +1046,8 @@ class ChatPromptTemplate(BaseChatPromptTemplate):
            prompt = HumanMessagePromptTemplate.from_template(other)
            return ChatPromptTemplate(messages=self.messages + [prompt])  # type: ignore[call-arg]
        else:
-            raise NotImplementedError(f"Unsupported operand type for +: {type(other)}")
+            msg = f"Unsupported operand type for +: {type(other)}"
+            raise NotImplementedError(msg)

    @model_validator(mode="before")
    @classmethod
@@ -1085,11 +1090,12 @@ class ChatPromptTemplate(BaseChatPromptTemplate):
            input_vars = input_vars - optional_variables
        if "input_variables" in values and values.get("validate_template"):
            if input_vars != set(values["input_variables"]):
-                raise ValueError(
+                msg = (
                    "Got mismatched input_variables. "
                    f"Expected: {input_vars}. "
                    f"Got: {values['input_variables']}"
                )
+                raise ValueError(msg)
        else:
            values["input_variables"] = sorted(input_vars)
        if optional_variables:
@@ -1214,7 +1220,8 @@ class ChatPromptTemplate(BaseChatPromptTemplate):
                message = message_template.format_messages(**kwargs)
                result.extend(message)
            else:
-                raise ValueError(f"Unexpected input: {message_template}")
+                msg = f"Unexpected input: {message_template}"
+                raise ValueError(msg)
        return result

    async def aformat_messages(self, **kwargs: Any) -> list[BaseMessage]:
@@ -1241,7 +1248,8 @@ class ChatPromptTemplate(BaseChatPromptTemplate):
                message = await message_template.aformat_messages(**kwargs)
                result.extend(message)
            else:
-                raise ValueError(f"Unexpected input: {message_template}")
+                msg = f"Unexpected input: {message_template}"
+                raise ValueError(msg)
        return result

    def partial(self, **kwargs: Any) -> ChatPromptTemplate:
@@ -1328,7 +1336,7 @@ class ChatPromptTemplate(BaseChatPromptTemplate):
        Args:
            file_path: path to file.
        """
-        raise NotImplementedError()
+        raise NotImplementedError

    def pretty_repr(self, html: bool = False) -> str:
        """Human-readable representation.
@@ -1376,38 +1384,43 @@ def _create_template_from_message_type(
    elif message_type == "placeholder":
        if isinstance(template, str):
            if template[0] != "{" or template[-1] != "}":
-                raise ValueError(
+                msg = (
                    f"Invalid placeholder template: {template}."
                    " Expected a variable name surrounded by curly braces."
                )
+                raise ValueError(msg)
            var_name = template[1:-1]
            message = MessagesPlaceholder(variable_name=var_name, optional=True)
        elif len(template) == 2 and isinstance(template[1], bool):
            var_name_wrapped, is_optional = template
            if not isinstance(var_name_wrapped, str):
-                raise ValueError(
+                msg = (
                    "Expected variable name to be a string." f" Got: {var_name_wrapped}"
                )
+                raise ValueError(msg)
            if var_name_wrapped[0] != "{" or var_name_wrapped[-1] != "}":
-                raise ValueError(
+                msg = (
                    f"Invalid placeholder template: {var_name_wrapped}."
                    " Expected a variable name surrounded by curly braces."
                )
+                raise ValueError(msg)
            var_name = var_name_wrapped[1:-1]

            message = MessagesPlaceholder(variable_name=var_name, optional=is_optional)
        else:
-            raise ValueError(
+            msg = (
                "Unexpected arguments for placeholder message type."
                " Expected either a single string variable name"
                " or a list of [variable_name: str, is_optional: bool]."
                f" Got: {template}"
            )
+            raise ValueError(msg)
    else:
-        raise ValueError(
+        msg = (
            f"Unexpected message type: {message_type}. Use one of 'human',"
            f" 'user', 'ai', 'assistant', or 'system'."
        )
+        raise ValueError(msg)
    return message


@@ -1448,7 +1461,8 @@ def _convert_to_message(
        )
    elif isinstance(message, tuple):
        if len(message) != 2:
-            raise ValueError(f"Expected 2-tuple of (role, template), got {message}")
+            msg = f"Expected 2-tuple of (role, template), got {message}"
+            raise ValueError(msg)
        message_type_str, template = message
        if isinstance(message_type_str, str):
            _message = _create_template_from_message_type(
@@ -1461,6 +1475,7 @@ def _convert_to_message(
                )
            )
    else:
-        raise NotImplementedError(f"Unsupported message type: {type(message)}")
+        msg = f"Unsupported message type: {type(message)}"
+        raise NotImplementedError(msg)

    return _message
--- a/libs/core/langchain_core/prompts/few_shot.py
+++ b/libs/core/langchain_core/prompts/few_shot.py
@@ -62,14 +62,12 @@ class _FewShotPromptTemplateMixin(BaseModel):
        examples = values.get("examples")
        example_selector = values.get("example_selector")
        if examples and example_selector:
-            raise ValueError(
-                "Only one of 'examples' and 'example_selector' should be provided"
-            )
+            msg = "Only one of 'examples' and 'example_selector' should be provided"
+            raise ValueError(msg)

        if examples is None and example_selector is None:
-            raise ValueError(
-                "One of 'examples' and 'example_selector' should be provided"
-            )
+            msg = "One of 'examples' and 'example_selector' should be provided"
+            raise ValueError(msg)

        return values

@@ -90,9 +88,8 @@ class _FewShotPromptTemplateMixin(BaseModel):
        elif self.example_selector is not None:
            return self.example_selector.select_examples(kwargs)
        else:
-            raise ValueError(
-                "One of 'examples' and 'example_selector' should be provided"
-            )
+            msg = "One of 'examples' and 'example_selector' should be provided"
+            raise ValueError(msg)

    async def _aget_examples(self, **kwargs: Any) -> list[dict]:
        """Async get the examples to use for formatting the prompt.
@@ -111,9 +108,8 @@ class _FewShotPromptTemplateMixin(BaseModel):
        elif self.example_selector is not None:
            return await self.example_selector.aselect_examples(kwargs)
        else:
-            raise ValueError(
-                "One of 'examples' and 'example_selector' should be provided"
-            )
+            msg = "One of 'examples' and 'example_selector' should be provided"
+            raise ValueError(msg)


 class FewShotPromptTemplate(_FewShotPromptTemplateMixin, StringPromptTemplate):
@@ -243,7 +239,8 @@ class FewShotPromptTemplate(_FewShotPromptTemplateMixin, StringPromptTemplate):
            ValueError: If example_selector is provided.
        """
        if self.example_selector:
-            raise ValueError("Saving an example selector is not currently supported")
+            msg = "Saving an example selector is not currently supported"
+            raise ValueError(msg)
        return super().save(file_path)


@@ -467,4 +464,4 @@ class FewShotChatMessagePromptTemplate(
        Returns:
            A pretty representation of the prompt template.
        """
-        raise NotImplementedError()
+        raise NotImplementedError
--- a/libs/core/langchain_core/prompts/few_shot_with_templates.py
+++ b/libs/core/langchain_core/prompts/few_shot_with_templates.py
@@ -54,14 +54,12 @@ class FewShotPromptWithTemplates(StringPromptTemplate):
        examples = values.get("examples")
        example_selector = values.get("example_selector")
        if examples and example_selector:
-            raise ValueError(
-                "Only one of 'examples' and 'example_selector' should be provided"
-            )
+            msg = "Only one of 'examples' and 'example_selector' should be provided"
+            raise ValueError(msg)

        if examples is None and example_selector is None:
-            raise ValueError(
-                "One of 'examples' and 'example_selector' should be provided"
-            )
+            msg = "One of 'examples' and 'example_selector' should be provided"
+            raise ValueError(msg)

        return values

@@ -76,10 +74,11 @@ class FewShotPromptWithTemplates(StringPromptTemplate):
                expected_input_variables |= set(self.prefix.input_variables)
            missing_vars = expected_input_variables.difference(input_variables)
            if missing_vars:
-                raise ValueError(
+                msg = (
                    f"Got input_variables={input_variables}, but based on "
                    f"prefix/suffix expected {expected_input_variables}"
                )
+                raise ValueError(msg)
        else:
            self.input_variables = sorted(
                set(self.suffix.input_variables)
@@ -216,5 +215,6 @@ class FewShotPromptWithTemplates(StringPromptTemplate):
            ValueError: If example_selector is provided.
        """
        if self.example_selector:
-            raise ValueError("Saving an example selector is not currently supported")
+            msg = "Saving an example selector is not currently supported"
+            raise ValueError(msg)
        return super().save(file_path)
--- a/libs/core/langchain_core/prompts/image.py
+++ b/libs/core/langchain_core/prompts/image.py
@@ -20,11 +20,12 @@ class ImagePromptTemplate(BasePromptTemplate[ImageURL]):

        overlap = set(kwargs["input_variables"]) & {"url", "path", "detail"}
        if overlap:
-            raise ValueError(
+            msg = (
                "input_variables for the image template cannot contain"
                " any of 'url', 'path', or 'detail'."
                f" Found: {overlap}"
            )
+            raise ValueError(msg)
        super().__init__(**kwargs)

    @property
@@ -91,13 +92,16 @@ class ImagePromptTemplate(BasePromptTemplate[ImageURL]):
        path = kwargs.get("path") or formatted.get("path")
        detail = kwargs.get("detail") or formatted.get("detail")
        if not url and not path:
-            raise ValueError("Must provide either url or path.")
+            msg = "Must provide either url or path."
+            raise ValueError(msg)
        if not url:
            if not isinstance(path, str):
-                raise ValueError("path must be a string.")
+                msg = "path must be a string."
+                raise ValueError(msg)
            url = image_utils.image_to_data_url(path)
        if not isinstance(url, str):
-            raise ValueError("url must be a string.")
+            msg = "url must be a string."
+            raise ValueError(msg)
        output: ImageURL = {"url": url}
        if detail:
            # Don't check literal values here: let the API check them
@@ -128,4 +132,4 @@ class ImagePromptTemplate(BasePromptTemplate[ImageURL]):
        Returns:
            A pretty representation of the prompt.
        """
-        raise NotImplementedError()
+        raise NotImplementedError
--- a/libs/core/langchain_core/prompts/loading.py
+++ b/libs/core/langchain_core/prompts/loading.py
@@ -34,7 +34,8 @@ def load_prompt_from_config(config: dict) -> BasePromptTemplate:
    config_type = config.pop("_type", "prompt")

    if config_type not in type_to_loader_dict:
-        raise ValueError(f"Loading {config_type} prompt not supported")
+        msg = f"Loading {config_type} prompt not supported"
+        raise ValueError(msg)

    prompt_loader = type_to_loader_dict[config_type]
    return prompt_loader(config)
@@ -46,9 +47,8 @@ def _load_template(var_name: str, config: dict) -> dict:
    if f"{var_name}_path" in config:
        # If it does, make sure template variable doesn't also exist.
        if var_name in config:
-            raise ValueError(
-                f"Both `{var_name}_path` and `{var_name}` cannot be provided."
-            )
+            msg = f"Both `{var_name}_path` and `{var_name}` cannot be provided."
+            raise ValueError(msg)
        # Pop the template path from the config.
        template_path = Path(config.pop(f"{var_name}_path"))
        # Load the template.
@@ -73,12 +73,12 @@ def _load_examples(config: dict) -> dict:
            elif config["examples"].endswith((".yaml", ".yml")):
                examples = yaml.safe_load(f)
            else:
-                raise ValueError(
-                    "Invalid file format. Only json or yaml formats are supported."
-                )
+                msg = "Invalid file format. Only json or yaml formats are supported."
+                raise ValueError(msg)
        config["examples"] = examples
    else:
-        raise ValueError("Invalid examples format. Only list or string are supported.")
+        msg = "Invalid examples format. Only list or string are supported."
+        raise ValueError(msg)
    return config


@@ -90,7 +90,8 @@ def _load_output_parser(config: dict) -> dict:
        if output_parser_type == "default":
            output_parser = StrOutputParser(**_config)
        else:
-            raise ValueError(f"Unsupported output parser {output_parser_type}")
+            msg = f"Unsupported output parser {output_parser_type}"
+            raise ValueError(msg)
        config["output_parser"] = output_parser
    return config

@@ -103,10 +104,11 @@ def _load_few_shot_prompt(config: dict) -> FewShotPromptTemplate:
    # Load the example prompt.
    if "example_prompt_path" in config:
        if "example_prompt" in config:
-            raise ValueError(
+            msg = (
                "Only one of example_prompt and example_prompt_path should "
                "be specified."
            )
+            raise ValueError(msg)
        config["example_prompt"] = load_prompt(config.pop("example_prompt_path"))
    else:
        config["example_prompt"] = load_prompt_from_config(config["example_prompt"])
@@ -126,11 +128,12 @@ def _load_prompt(config: dict) -> PromptTemplate:
    if template_format == "jinja2":
        # Disabled due to:
        # https://github.com/langchain-ai/langchain/issues/4394
-        raise ValueError(
+        msg = (
            f"Loading templates with '{template_format}' format is no longer supported "
            f"since it can lead to arbitrary code execution. Please migrate to using "
            f"the 'f-string' template format, which does not suffer from this issue."
        )
+        raise ValueError(msg)

    return PromptTemplate(**config)

@@ -151,11 +154,12 @@ def load_prompt(
        RuntimeError: If the path is a Lang Chain Hub path.
    """
    if isinstance(path, str) and path.startswith("lc://"):
-        raise RuntimeError(
+        msg = (
            "Loading from the deprecated github-based Hub is no longer supported. "
            "Please use the new LangChain Hub at https://smith.langchain.com/hub "
            "instead."
        )
+        raise RuntimeError(msg)
    return _load_prompt_from_file(path, encoding)


@@ -173,7 +177,8 @@ def _load_prompt_from_file(
        with open(file_path, encoding=encoding) as f:
            config = yaml.safe_load(f)
    else:
-        raise ValueError(f"Got unsupported file type {file_path.suffix}")
+        msg = f"Got unsupported file type {file_path.suffix}"
+        raise ValueError(msg)
    # Load the prompt from the config now.
    return load_prompt_from_config(config)

@@ -186,7 +191,8 @@ def _load_chat_prompt(config: dict) -> ChatPromptTemplate:
    config.pop("input_variables")

    if not template:
-        raise ValueError("Can't load chat prompt without template")
+        msg = "Can't load chat prompt without template"
+        raise ValueError(msg)

    return ChatPromptTemplate.from_template(template=template, **config)

--- a/libs/core/langchain_core/prompts/prompt.py
+++ b/libs/core/langchain_core/prompts/prompt.py
@@ -89,12 +89,12 @@ class PromptTemplate(StringPromptTemplate):

        if values.get("validate_template"):
            if values["template_format"] == "mustache":
-                raise ValueError("Mustache templates cannot be validated.")
+                msg = "Mustache templates cannot be validated."
+                raise ValueError(msg)

            if "input_variables" not in values:
-                raise ValueError(
-                    "Input variables must be provided to validate the template."
-                )
+                msg = "Input variables must be provided to validate the template."
+                raise ValueError(msg)

            all_inputs = values["input_variables"] + list(values["partial_variables"])
            check_valid_template(
@@ -131,13 +131,11 @@ class PromptTemplate(StringPromptTemplate):
        # Allow for easy combining
        if isinstance(other, PromptTemplate):
            if self.template_format != "f-string":
-                raise ValueError(
-                    "Adding prompt templates only supported for f-strings."
-                )
+                msg = "Adding prompt templates only supported for f-strings."
+                raise ValueError(msg)
            if other.template_format != "f-string":
-                raise ValueError(
-                    "Adding prompt templates only supported for f-strings."
-                )
+                msg = "Adding prompt templates only supported for f-strings."
+                raise ValueError(msg)
            input_variables = list(
                set(self.input_variables) | set(other.input_variables)
            )
@@ -147,7 +145,8 @@ class PromptTemplate(StringPromptTemplate):
            partial_variables = dict(self.partial_variables.items())
            for k, v in other.partial_variables.items():
                if k in partial_variables:
-                    raise ValueError("Cannot have same variable partialed twice.")
+                    msg = "Cannot have same variable partialed twice."
+                    raise ValueError(msg)
                else:
                    partial_variables[k] = v
            return PromptTemplate(
@@ -161,7 +160,8 @@ class PromptTemplate(StringPromptTemplate):
            prompt = PromptTemplate.from_template(other)
            return self + prompt
        else:
-            raise NotImplementedError(f"Unsupported operand type for +: {type(other)}")
+            msg = f"Unsupported operand type for +: {type(other)}"
+            raise NotImplementedError(msg)

    @property
    def _prompt_type(self) -> str:
--- a/libs/core/langchain_core/prompts/string.py
+++ b/libs/core/langchain_core/prompts/string.py
@@ -42,13 +42,14 @@ def jinja2_formatter(template: str, /, **kwargs: Any) -> str:
    try:
        from jinja2.sandbox import SandboxedEnvironment
    except ImportError as e:
-        raise ImportError(
+        msg = (
            "jinja2 not installed, which is needed to use the jinja2_formatter. "
            "Please install it with `pip install jinja2`."
            "Please be cautious when using jinja2 templates. "
            "Do not expand jinja2 templates using unverified or user-controlled "
            "inputs as that can result in arbitrary Python code execution."
-        ) from e
+        )
+        raise ImportError(msg) from e

    # This uses a sandboxed environment to prevent arbitrary code execution.
    # Jinja2 uses an opt-out rather than opt-in approach for sand-boxing.
@@ -89,10 +90,11 @@ def _get_jinja2_variables_from_template(template: str) -> set[str]:
    try:
        from jinja2 import Environment, meta
    except ImportError as e:
-        raise ImportError(
+        msg = (
            "jinja2 not installed, which is needed to use the jinja2_formatter. "
            "Please install it with `pip install jinja2`."
-        ) from e
+        )
+        raise ImportError(msg) from e
    env = Environment()
    ast = env.parse(template)
    variables = meta.find_undeclared_variables(ast)
@@ -217,17 +219,19 @@ def check_valid_template(
    try:
        validator_func = DEFAULT_VALIDATOR_MAPPING[template_format]
    except KeyError as exc:
-        raise ValueError(
+        msg = (
            f"Invalid template format {template_format!r}, should be one of"
            f" {list(DEFAULT_FORMATTER_MAPPING)}."
-        ) from exc
+        )
+        raise ValueError(msg) from exc
    try:
        validator_func(template, input_variables)
    except (KeyError, IndexError) as exc:
-        raise ValueError(
+        msg = (
            "Invalid prompt schema; check for mismatched or missing input parameters"
            f" from {input_variables}."
-        ) from exc
+        )
+        raise ValueError(msg) from exc


 def get_template_variables(template: str, template_format: str) -> list[str]:
@@ -253,7 +257,8 @@ def get_template_variables(template: str, template_format: str) -> list[str]:
    elif template_format == "mustache":
        input_variables = mustache_template_vars(template)
    else:
-        raise ValueError(f"Unsupported template format: {template_format}")
+        msg = f"Unsupported template format: {template_format}"
+        raise ValueError(msg)

    return sorted(input_variables)

--- a/libs/core/langchain_core/prompts/structured.py
+++ b/libs/core/langchain_core/prompts/structured.py
@@ -156,6 +156,5 @@ class StructuredPrompt(ChatPromptTemplate):
                name=name,
            )
        else:
-            raise NotImplementedError(
-                "Structured prompts need to be piped to a language model."
-            )
+            msg = "Structured prompts need to be piped to a language model."
+            raise NotImplementedError(msg)
--- a/libs/core/langchain_core/rate_limiters.py
+++ b/libs/core/langchain_core/rate_limiters.py
@@ -249,7 +249,13 @@ class InMemoryRateLimiter(BaseRateLimiter):
            return self._consume()

        while not self._consume():
-            await asyncio.sleep(self.check_every_n_seconds)
+            # This code ignores the ASYNC110 warning which is a false positive in this
+            # case.
+            # There is no external actor that can mark that the Event is done
+            # since the tokens are managed by the rate limiter itself.
+            # It needs to wake up to re-fill the tokens.
+            # https://docs.astral.sh/ruff/rules/async-busy-wait/
+            await asyncio.sleep(self.check_every_n_seconds)  # ruff: noqa: ASYNC110
        return True


--- a/libs/core/langchain_core/runnables/base.py
+++ b/libs/core/langchain_core/runnables/base.py
@@ -292,10 +292,11 @@ class Runnable(Generic[Input, Output], ABC):
            if type_args and len(type_args) == 2:
                return type_args[0]

-        raise TypeError(
+        msg = (
            f"Runnable {self.get_name()} doesn't have an inferable InputType. "
            "Override the InputType property to specify the input type."
        )
+        raise TypeError(msg)

    @property
    def OutputType(self) -> type[Output]:  # noqa: N802
@@ -313,10 +314,11 @@ class Runnable(Generic[Input, Output], ABC):
            if type_args and len(type_args) == 2:
                return type_args[1]

-        raise TypeError(
+        msg = (
            f"Runnable {self.get_name()} doesn't have an inferable OutputType. "
            "Override the OutputType property to specify the output type."
        )
+        raise TypeError(msg)

    @property
    def input_schema(self) -> type[BaseModel]:
@@ -1161,7 +1163,7 @@ class Runnable(Generic[Input, Output], ABC):
        - ``data``: **Dict[str, Any]**


-        Below is a table that illustrates some evens that might be emitted by various
+        Below is a table that illustrates some events that might be emitted by various
        chains. Metadata fields have been omitted from the table for brevity.
        Chain definitions have been included after the table.

@@ -1379,9 +1381,8 @@ class Runnable(Generic[Input, Output], ABC):
                **kwargs,
            )
        else:
-            raise NotImplementedError(
-                'Only versions "v1" and "v2" of the schema is currently supported.'
-            )
+            msg = 'Only versions "v1" and "v2" of the schema is currently supported.'
+            raise NotImplementedError(msg)

        async with aclosing(event_stream):
            async for event in event_stream:
@@ -2513,10 +2514,11 @@ class RunnableSerializable(Serializable, Runnable[Input, Output]):

        for key in kwargs:
            if key not in self.model_fields:
-                raise ValueError(
+                msg = (
                    f"Configuration key {key} not found in {self}: "
                    f"available keys are {self.model_fields.keys()}"
                )
+                raise ValueError(msg)

        return RunnableConfigurableFields(default=self, fields=kwargs)

@@ -2772,9 +2774,8 @@ class RunnableSequence(RunnableSerializable[Input, Output]):
            else:
                steps_flat.append(coerce_to_runnable(step))
        if len(steps_flat) < 2:
-            raise ValueError(
-                f"RunnableSequence must have at least 2 steps, got {len(steps_flat)}"
-            )
+            msg = f"RunnableSequence must have at least 2 steps, got {len(steps_flat)}"
+            raise ValueError(msg)
        super().__init__(  # type: ignore[call-arg]
            first=steps_flat[0],
            middle=list(steps_flat[1:-1]),
@@ -2923,7 +2924,8 @@ class RunnableSequence(RunnableSerializable[Input, Output]):
                step_graph.trim_last_node()
            step_first_node, _ = graph.extend(step_graph)
            if not step_first_node:
-                raise ValueError(f"Runnable {step} has no first node")
+                msg = f"Runnable {step} has no first node"
+                raise ValueError(msg)
            if current_last_node:
                graph.add_edge(current_last_node, step_first_node)

@@ -3655,9 +3657,11 @@ class RunnableParallel(RunnableSerializable[Input, dict[str, Any]]):
            else:
                step_first_node, step_last_node = graph.extend(step_graph)
                if not step_first_node:
-                    raise ValueError(f"Runnable {step} has no first node")
+                    msg = f"Runnable {step} has no first node"
+                    raise ValueError(msg)
                if not step_last_node:
-                    raise ValueError(f"Runnable {step} has no last node")
+                    msg = f"Runnable {step} has no last node"
+                    raise ValueError(msg)
                graph.add_edge(input_node, step_first_node)
                graph.add_edge(step_last_node, output_node)

@@ -4049,10 +4053,11 @@ class RunnableGenerator(Runnable[Input, Output]):
            self._transform = transform
            func_for_name = transform
        else:
-            raise TypeError(
+            msg = (
                "Expected a generator function type for `transform`."
                f"Instead got an unsupported type: {type(transform)}"
            )
+            raise TypeError(msg)

        try:
            self.name = name or func_for_name.__name__
@@ -4161,7 +4166,8 @@ class RunnableGenerator(Runnable[Input, Output]):
        **kwargs: Any,
    ) -> Iterator[Output]:
        if not hasattr(self, "_transform"):
-            raise NotImplementedError(f"{repr(self)} only supports async methods.")
+            msg = f"{repr(self)} only supports async methods."
+            raise NotImplementedError(msg)
        return self._transform_stream_with_config(
            input,
            self._transform,  # type: ignore[arg-type]
@@ -4192,7 +4198,8 @@ class RunnableGenerator(Runnable[Input, Output]):
        **kwargs: Any,
    ) -> AsyncIterator[Output]:
        if not hasattr(self, "_atransform"):
-            raise NotImplementedError(f"{repr(self)} only supports sync methods.")
+            msg = f"{repr(self)} only supports sync methods."
+            raise NotImplementedError(msg)

        return self._atransform_stream_with_config(
            input, self._atransform, config, **kwargs
@@ -4320,21 +4327,23 @@ class RunnableLambda(Runnable[Input, Output]):

        if is_async_callable(func) or is_async_generator(func):
            if afunc is not None:
-                raise TypeError(
+                msg = (
                    "Func was provided as a coroutine function, but afunc was "
                    "also provided. If providing both, func should be a regular "
                    "function to avoid ambiguity."
                )
+                raise TypeError(msg)
            self.afunc = func
            func_for_name = func
        elif callable(func):
            self.func = cast(Callable[[Input], Output], func)
            func_for_name = func
        else:
-            raise TypeError(
+            msg = (
                "Expected a callable type for `func`."
                f"Instead got an unsupported type: {type(func)}"
            )
+            raise TypeError(msg)

        try:
            if name is not None:
@@ -4497,9 +4506,11 @@ class RunnableLambda(Runnable[Input, Output]):
                else:
                    dep_first_node, dep_last_node = graph.extend(dep_graph)
                    if not dep_first_node:
-                        raise ValueError(f"Runnable {dep} has no first node")
+                        msg = f"Runnable {dep} has no first node"
+                        raise ValueError(msg)
                    if not dep_last_node:
-                        raise ValueError(f"Runnable {dep} has no last node")
+                        msg = f"Runnable {dep} has no last node"
+                        raise ValueError(msg)
                    graph.add_edge(input_node, dep_first_node)
                    graph.add_edge(dep_last_node, output_node)
        else:
@@ -4560,9 +4571,10 @@ class RunnableLambda(Runnable[Input, Output]):
        if isinstance(output, Runnable):
            recursion_limit = config["recursion_limit"]
            if recursion_limit <= 0:
-                raise RecursionError(
+                msg = (
                    f"Recursion limit reached when invoking {self} with input {input}."
                )
+                raise RecursionError(msg)
            output = output.invoke(
                input,
                patch_config(
@@ -4659,9 +4671,10 @@ class RunnableLambda(Runnable[Input, Output]):
        if isinstance(output, Runnable):
            recursion_limit = config["recursion_limit"]
            if recursion_limit <= 0:
-                raise RecursionError(
+                msg = (
                    f"Recursion limit reached when invoking {self} with input {input}."
                )
+                raise RecursionError(msg)
            output = await output.ainvoke(
                input,
                patch_config(
@@ -4704,10 +4717,11 @@ class RunnableLambda(Runnable[Input, Output]):
                **kwargs,
            )
        else:
-            raise TypeError(
+            msg = (
                "Cannot invoke a coroutine function synchronously."
                "Use `ainvoke` instead."
            )
+            raise TypeError(msg)

    async def ainvoke(
        self,
@@ -4778,10 +4792,11 @@ class RunnableLambda(Runnable[Input, Output]):
        if isinstance(output, Runnable):
            recursion_limit = config["recursion_limit"]
            if recursion_limit <= 0:
-                raise RecursionError(
+                msg = (
                    f"Recursion limit reached when invoking "
                    f"{self} with input {final}."
                )
+                raise RecursionError(msg)
            for chunk in output.stream(
                final,
                patch_config(
@@ -4809,10 +4824,11 @@ class RunnableLambda(Runnable[Input, Output]):
                **kwargs,
            )
        else:
-            raise TypeError(
+            msg = (
                "Cannot stream a coroutine function synchronously."
                "Use `astream` instead."
            )
+            raise TypeError(msg)

    def stream(
        self,
@@ -4849,10 +4865,11 @@ class RunnableLambda(Runnable[Input, Output]):
            afunc = self.afunc
        else:
            if inspect.isgeneratorfunction(self.func):
-                raise TypeError(
+                msg = (
                    "Cannot stream from a generator function asynchronously."
                    "Use .stream() instead."
                )
+                raise TypeError(msg)

            def func(
                input: Input,
@@ -4899,10 +4916,11 @@ class RunnableLambda(Runnable[Input, Output]):
        if isinstance(output, Runnable):
            recursion_limit = config["recursion_limit"]
            if recursion_limit <= 0:
-                raise RecursionError(
+                msg = (
                    f"Recursion limit reached when invoking "
                    f"{self} with input {final}."
                )
+                raise RecursionError(msg)
            async for chunk in output.astream(
                final,
                patch_config(
@@ -5061,9 +5079,8 @@ class RunnableEachBase(RunnableSerializable[list[Input], list[Output]]):
        **kwargs: Optional[Any],
    ) -> AsyncIterator[StreamEvent]:
        for _ in range(1):
-            raise NotImplementedError(
-                "RunnableEach does not support astream_events yet."
-            )
+            msg = "RunnableEach does not support astream_events yet."
+            raise NotImplementedError(msg)
            yield


@@ -5819,10 +5836,11 @@ def coerce_to_runnable(thing: RunnableLike) -> Runnable[Input, Output]:
    elif isinstance(thing, dict):
        return cast(Runnable[Input, Output], RunnableParallel(thing))
    else:
-        raise TypeError(
+        msg = (
            f"Expected a Runnable, callable or dict."
            f"Instead got an unsupported type: {type(thing)}"
        )
+        raise TypeError(msg)


@overload
--- a/libs/core/langchain_core/runnables/branch.py
+++ b/libs/core/langchain_core/runnables/branch.py
@@ -92,7 +92,8 @@ class RunnableBranch(RunnableSerializable[Input, Output]):
            ValueError: If a branch is not of length 2.
        """
        if len(branches) < 2:
-            raise ValueError("RunnableBranch requires at least two branches")
+            msg = "RunnableBranch requires at least two branches"
+            raise ValueError(msg)

        default = branches[-1]

@@ -100,9 +101,8 @@ class RunnableBranch(RunnableSerializable[Input, Output]):
            default,
            (Runnable, Callable, Mapping),  # type: ignore[arg-type]
        ):
-            raise TypeError(
-                "RunnableBranch default must be Runnable, callable or mapping."
-            )
+            msg = "RunnableBranch default must be Runnable, callable or mapping."
+            raise TypeError(msg)

        default_ = cast(
            Runnable[Input, Output], coerce_to_runnable(cast(RunnableLike, default))
@@ -112,16 +112,18 @@ class RunnableBranch(RunnableSerializable[Input, Output]):

        for branch in branches[:-1]:
            if not isinstance(branch, (tuple, list)):  # type: ignore[arg-type]
-                raise TypeError(
+                msg = (
                    f"RunnableBranch branches must be "
                    f"tuples or lists, not {type(branch)}"
                )
+                raise TypeError(msg)

-            if not len(branch) == 2:
-                raise ValueError(
+            if len(branch) != 2:
+                msg = (
                    f"RunnableBranch branches must be "
                    f"tuples or lists of length 2, not {len(branch)}"
                )
+                raise ValueError(msg)
            condition, runnable = branch
            condition = cast(Runnable[Input, bool], coerce_to_runnable(condition))
            runnable = coerce_to_runnable(runnable)
@@ -185,7 +187,8 @@ class RunnableBranch(RunnableSerializable[Input, Output]):
            and s.id.endswith(CONTEXT_CONFIG_SUFFIX_SET)
            for s in specs
        ):
-            raise ValueError("RunnableBranch cannot contain context setters.")
+            msg = "RunnableBranch cannot contain context setters."
+            raise ValueError(msg)
        return specs

    def invoke(
--- a/libs/core/langchain_core/runnables/config.py
+++ b/libs/core/langchain_core/runnables/config.py
@@ -219,12 +219,14 @@ def get_config_list(

    """
    if length < 0:
-        raise ValueError(f"length must be >= 0, but got {length}")
+        msg = f"length must be >= 0, but got {length}"
+        raise ValueError(msg)
    if isinstance(config, Sequence) and len(config) != length:
-        raise ValueError(
+        msg = (
            f"config must be a list of the same length as inputs, "
            f"but got {len(config)} configs for {length} inputs"
        )
+        raise ValueError(msg)

    if isinstance(config, Sequence):
        return list(map(ensure_config, config))
--- a/libs/core/langchain_core/runnables/configurable.py
+++ b/libs/core/langchain_core/runnables/configurable.py
@@ -632,7 +632,8 @@ class RunnableConfigurableAlternatives(DynamicRunnable[Input, Output]):
            else:
                return (alt(), config)
        else:
-            raise ValueError(f"Unknown alternative: {which}")
+            msg = f"Unknown alternative: {which}"
+            raise ValueError(msg)


 def _strremoveprefix(s: str, prefix: str) -> str:
--- a/libs/core/langchain_core/runnables/fallbacks.py
+++ b/libs/core/langchain_core/runnables/fallbacks.py
@@ -152,10 +152,11 @@ class RunnableWithFallbacks(RunnableSerializable[Input, Output]):
        self, input: Input, config: Optional[RunnableConfig] = None, **kwargs: Any
    ) -> Output:
        if self.exception_key is not None and not isinstance(input, dict):
-            raise ValueError(
+            msg = (
                "If 'exception_key' is specified then input must be a dictionary."
                f"However found a type of {type(input)} for input"
            )
+            raise ValueError(msg)
        # setup callbacks
        config = ensure_config(config)
        callback_manager = get_callback_manager_for_config(config)
@@ -192,7 +193,8 @@ class RunnableWithFallbacks(RunnableSerializable[Input, Output]):
                run_manager.on_chain_end(output)
                return output
        if first_error is None:
-            raise ValueError("No error stored at end of fallbacks.")
+            msg = "No error stored at end of fallbacks."
+            raise ValueError(msg)
        run_manager.on_chain_error(first_error)
        raise first_error

@@ -203,10 +205,11 @@ class RunnableWithFallbacks(RunnableSerializable[Input, Output]):
        **kwargs: Optional[Any],
    ) -> Output:
        if self.exception_key is not None and not isinstance(input, dict):
-            raise ValueError(
+            msg = (
                "If 'exception_key' is specified then input must be a dictionary."
                f"However found a type of {type(input)} for input"
            )
+            raise ValueError(msg)
        # setup callbacks
        config = ensure_config(config)
        callback_manager = get_async_callback_manager_for_config(config)
@@ -243,7 +246,8 @@ class RunnableWithFallbacks(RunnableSerializable[Input, Output]):
                await run_manager.on_chain_end(output)
                return output
        if first_error is None:
-            raise ValueError("No error stored at end of fallbacks.")
+            msg = "No error stored at end of fallbacks."
+            raise ValueError(msg)
        await run_manager.on_chain_error(first_error)
        raise first_error

@@ -260,10 +264,11 @@ class RunnableWithFallbacks(RunnableSerializable[Input, Output]):
        if self.exception_key is not None and not all(
            isinstance(input, dict) for input in inputs
        ):
-            raise ValueError(
+            msg = (
                "If 'exception_key' is specified then inputs must be dictionaries."
                f"However found a type of {type(inputs[0])} for input"
            )
+            raise ValueError(msg)

        if not inputs:
            return []
@@ -352,10 +357,11 @@ class RunnableWithFallbacks(RunnableSerializable[Input, Output]):
        if self.exception_key is not None and not all(
            isinstance(input, dict) for input in inputs
        ):
-            raise ValueError(
+            msg = (
                "If 'exception_key' is specified then inputs must be dictionaries."
                f"However found a type of {type(inputs[0])} for input"
            )
+            raise ValueError(msg)

        if not inputs:
            return []
@@ -447,10 +453,11 @@ class RunnableWithFallbacks(RunnableSerializable[Input, Output]):
    ) -> Iterator[Output]:
        """"""
        if self.exception_key is not None and not isinstance(input, dict):
-            raise ValueError(
+            msg = (
                "If 'exception_key' is specified then input must be a dictionary."
                f"However found a type of {type(input)} for input"
            )
+            raise ValueError(msg)
        # setup callbacks
        config = ensure_config(config)
        callback_manager = get_callback_manager_for_config(config)
@@ -510,10 +517,11 @@ class RunnableWithFallbacks(RunnableSerializable[Input, Output]):
        **kwargs: Optional[Any],
    ) -> AsyncIterator[Output]:
        if self.exception_key is not None and not isinstance(input, dict):
-            raise ValueError(
+            msg = (
                "If 'exception_key' is specified then input must be a dictionary."
                f"However found a type of {type(input)} for input"
            )
+            raise ValueError(msg)
        # setup callbacks
        config = ensure_config(config)
        callback_manager = get_async_callback_manager_for_config(config)
--- a/libs/core/langchain_core/runnables/graph.py
+++ b/libs/core/langchain_core/runnables/graph.py
@@ -330,7 +330,8 @@ class Graph:
            ValueError: If a node with the same id already exists.
        """
        if id is not None and id in self.nodes:
-            raise ValueError(f"Node with id {id} already exists")
+            msg = f"Node with id {id} already exists"
+            raise ValueError(msg)
        id = id or self.next_id()
        node = Node(id=id, data=data, metadata=metadata, name=node_data_str(id, data))
        self.nodes[node.id] = node
@@ -371,9 +372,11 @@ class Graph:
            ValueError: If the source or target node is not in the graph.
        """
        if source.id not in self.nodes:
-            raise ValueError(f"Source node {source.id} not in graph")
+            msg = f"Source node {source.id} not in graph"
+            raise ValueError(msg)
        if target.id not in self.nodes:
-            raise ValueError(f"Target node {target.id} not in graph")
+            msg = f"Target node {target.id} not in graph"
+            raise ValueError(msg)
        edge = Edge(
            source=source.id, target=target.id, data=data, conditional=conditional
        )
--- a/libs/core/langchain_core/runnables/graph_ascii.py
+++ b/libs/core/langchain_core/runnables/graph_ascii.py
@@ -161,9 +161,8 @@ def _build_sugiyama_layout(
            route_with_lines,
        )
    except ImportError as exc:
-        raise ImportError(
-            "Install grandalf to draw graphs: `pip install grandalf`."
-        ) from exc
+        msg = "Install grandalf to draw graphs: `pip install grandalf`."
+        raise ImportError(msg) from exc

    #
    # Just a reminder about naming conventions:
--- a/Show More
+++ b/Show More
Author	SHA1	Message	Date
Erick Friis	e18fc8bc16	x	2024-10-11 12:34:55 -04:00
Sir Qasim	2a1029c53c	Update chatbot.ipynb (#27243 ) Async invocation: remove : from at the end of line line 441 because there is not any structure block after it. Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-10-10 18:03:10 +00:00
Eugene Yurtsev	5b9b8fe80f	core[patch]: Ignore ASYNC110 to upgrade to newest ruff version (#27229 ) Ignoring ASYNC110 with explanation	2024-10-09 11:25:58 -04:00
Vittorio Rigamonti	7da2efd9d3	community[minor]: VectorStore Infinispan. Adding TLS and authentication (#23522 ) Description: this PR enable VectorStore TLS and authentication (digest, basic) with HTTP/2 for Infinispan server. Based on httpx. Added docker-compose facilities for testing Added documentation Dependencies: requires `pip install httpx[http2]` if HTTP2 is needed Twitter handle: https://twitter.com/infinispan	2024-10-09 10:51:39 -04:00
Luke Jang	ff925d2ddc	docs: fixed broken API reference link for StructuredTool.from_function (#27181 ) Fix broken API reference link for StructuredTool.from_function	2024-10-09 10:05:22 -04:00
Diao Zihao	4553573acb	core[patch],langchain[patch],community[patch]: Bump version dependency of tenacity to >=8.1.0,!=8.4.0,<10 (#27201 ) This should fixes the compatibility issue with graprag as in - https://github.com/langchain-ai/langchain/discussions/25595 Here are the release notes for tenacity 9 (https://github.com/jd/tenacity/releases/tag/9.0.0) --------- Signed-off-by: Zihao Diao <hi@ericdiao.com> Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-09 14:00:45 +00:00
Stefano Lottini	d05fdd97dd	community: Cassandra Vector Store: extend metadata-related methods (#27078 ) Description: this PR adds a set of methods to deal with metadata associated to the vector store entries. These, while essential to the Graph-related extension of the `Cassandra` vector store, are also useful in themselves. These are (all come in their sync+async versions): - `[a]delete_by_metadata_filter` - `[a]replace_metadata` - `[a]get_by_document_id` - `[a]metadata_search` Additionally, a `[a]similarity_search_with_embedding_id_by_vector` method is introduced to better serve the store's internal working (esp. related to reranking logic). Issue: no issue number, but now all Document's returned bear their `.id` consistently (as a consequence of a slight refactoring in how the raw entries read from DB are made back into `Document` instances). Dependencies: (no new deps: packaging comes through langchain-core already; `cassio` is now required to be version 0.1.10+) Add tests and docs Added integration tests for the relevant newly-introduced methods. (Docs will be updated in a separate PR). Lint and test Lint and (updated) test all pass. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-09 06:41:34 +00:00
Erick Friis	84c05b031d	community: release 0.3.2 (#27214 )	2024-10-08 23:33:55 -07:00
Serena Ruan	a7c1ce2b3f	[community] Add timeout control and retry for UC tool execution (#26645 ) Add timeout at client side for UCFunctionToolkit and add retry logic. Users could specify environment variable `UC_TOOL_CLIENT_EXECUTION_TIMEOUT` to increase the timeout value for retrying to get the execution response if the status is pending. Default timeout value is 120s. - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Tested in Databricks: <img width="1200" alt="image" src="https://github.com/user-attachments/assets/54ab5dfc-5e57-4941-b7d9-bfe3f8ad3f62"> - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Signed-off-by: serena-ruan <serena.rxy@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-09 06:31:48 +00:00
Tomaz Bratanic	481bd25d29	community: Fix database connections for neo4j (#27190 ) Fixes https://github.com/langchain-ai/langchain/issues/27185 Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 23:47:55 +00:00
Erick Friis	cedf4d9462	langchain: release 0.3.3 (#27213 )	2024-10-08 16:39:42 -07:00
Jorge Piedrahita Ortiz	6c33124c72	docs: minor fix sambastudio chat model docs (#27212 ) - Description: minor fix sambastudio chat model docs Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 23:34:29 +00:00
Erick Friis	7264fb254c	core: release 0.3.10 (#27209 )	2024-10-08 16:21:42 -07:00
Bagatur	ce33c4fa40	openai[patch]: default temp=1 for o1 (#27206 )	2024-10-08 15:45:21 -07:00
Mateusz Szewczyk	b298d0337e	docs: Update IBM `ChatWatsonx` documentation (#27189 )	2024-10-08 21:10:18 +00:00
RIdham Golakiya	73ad7f2e7a	langchain_chroma[patch]: updated example for get documents with where clause (#26767 ) Example updated for vectorstore ChromaDB. If we want to apply multiple filters then ChromaDB supports filters like this: Reference: [ChromaDB filters](https://cookbook.chromadb.dev/core/filters/) Thank you.	2024-10-08 20:21:58 +00:00
Bagatur	e3e9ee8398	core[patch]: utils for adding/subtracting usage metadata (#27203 )	2024-10-08 13:15:33 -07:00
ccurme	e3920f2320	community[patch]: fix structured_output in llamacpp integration (#27202 ) Resolves https://github.com/langchain-ai/langchain/issues/25318.	2024-10-08 15:16:59 -04:00
Leonid Ganeline	c3cb56a9e8	docs: `integrations` updates 18 (#27054 ) Added missed provider pages. Added descriptions and links. Fixed inconsistency in text formatting. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 19:05:07 +00:00
Leonid Ganeline	b716d808ba	docs: `integrations/providers/microsoft` update (#27055 ) Added reference to the AzureCognitiveServicesToolkit. Fixed titles. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 19:04:40 +00:00
Mathias Colpaert	feb4be82aa	docs: in chatbot tutorial, make docs consistent with code sample (#27042 ) Docs Chatbot Tutorial The docs state that you can omit the language parameter, but the code sample to demonstrate, still contains it. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 18:38:15 +00:00
Ikko Eltociear Ashimine	c10e1f70fe	docs: update passio_nutrition_ai.ipynb (#27041 ) initalize -> initialize - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 18:35:48 +00:00
Erick Friis	b84e00283f	standard-tests: test that only one chunk sets input_tokens (#27177 )	2024-10-08 11:35:32 -07:00
Ajayeswar Reddy	9b7bdf1a26	Fixed typo in llibs/community/langchain_community/storage/sql.py (#27029 ) - [ ] PR title: docs: fix typo in SQLStore import path - [ ] PR message: - Description: This PR corrects a typo in the docstrings for the class SQLStore(BaseStore[str, bytes]). The import path in the docstring currently reads from langchain_rag.storage import SQLStore, which should be changed to langchain_community.storage import SQLStore. This typo is also reflected in the official documentation. - Issue: N/A - Dependencies: None - Twitter handle: N/A Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 17:51:26 +00:00
Nihal Chaudhary	0b36ed09cf	DOC:Changed /docs/integrations/tools/jira/ (#27023 ) - [x] - Description: replaced `%pip install -qU langchain-community` to `%pip install -qU langchain-community langchain_openai ` in doc \langchain\docs\docs\integrations\tools\jira.ipynb - [x] - Issue: the issue #27013 - [x] Add docs Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 17:48:08 +00:00
Jacob Lee	0ec74fbc14	docs: 👥 Update LangChain people data (#27022 ) 👥 Update LangChain people data --------- Co-authored-by: github-actions <github-actions@github.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 17:09:07 +00:00
Leonid Ganeline	ea9a59bcf5	docs: `integrations` updates 17 (#27015 ) Added missed provider pages. Added missed descriptions and links. I fixed the Ipex-LLM titles, so the ToC is now sorted properly for these titles. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 17:03:18 +00:00
Vadym Barda	8d27325dbc	core[patch]: support ValidationError from pydantic v1 in tools (#27194 )	2024-10-08 10:19:04 -04:00
Christophe Bornet	16f5fdb38b	core: Add various ruff rules (#26836 ) Adds - ASYNC - COM - DJ - EXE - FLY - FURB - ICN - INT - LOG - NPY - PD - Q - RSE - SLOT - T10 - TID - YTT Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-07 22:30:27 +00:00
Erick Friis	5c826faece	core: update make format to fix all autofixable things (#27174 )	2024-10-07 15:20:47 -07:00
Christophe Bornet	d31ec8810a	core: Add ruff rules for error messages (EM) (#26965 ) All auto-fixes Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-07 22:12:28 +00:00
Oleksii Pokotylo	37ca468d03	community: AzureSearch: fix reranking for empty lists (#27104 ) Description: Fix reranking for empty lists Issue: ``` ValueError: not enough values to unpack (expected 3, got 0) documents, scores, vectors = map(list, zip(*docs)) File langchain_community/vectorstores/azuresearch.py", line 1680, in _reorder_results_with_maximal_marginal_relevance ``` Co-authored-by: Oleksii Pokotylo <oleksii.pokotylo@pwc.com>	2024-10-07 15:27:09 -04:00
Bhadresh Savani	8454a742d7	Update README.md for Tutorial to Usecase url (#27099 ) Fixed tutorial URL since earlier Tutorial URL was pointing to usecase age which does not have any detail it should redirect to correct URL page	2024-10-07 15:24:33 -04:00
Christophe Bornet	c4ebccfec2	core[minor]: Improve support for id in VectorStore (#26660 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 15:01:08 -04:00
Bharat Ramanathan	931ce8d026	core[patch]: Update `AsyncCallbackManager` to honor `run_inline` attribute and prevent context loss (#26885 ) ## Description This PR fixes the context loss issue in `AsyncCallbackManager`, specifically in `on_llm_start` and `on_chat_model_start` methods. It properly honors the `run_inline` attribute of callback handlers, preventing race conditions and ordering issues. Key changes: 1. Separate handlers into inline and non-inline groups. 2. Execute inline handlers sequentially for each prompt. 3. Execute non-inline handlers concurrently across all prompts. 4. Preserve context for stateful handlers. 5. Maintain performance benefits for non-inline handlers. These changes are implemented in `AsyncCallbackManager` rather than `ahandle_event` because the issue occurs at the prompt and message_list levels, not within individual events. ## Testing - Test case implemented in #26857 now passes, verifying execution order for inline handlers. ## Related Issues - Fixes issue discussed in #23909 ## Dependencies No new dependencies are required. --- @eyurtsev: This PR implements the discussed changes to respect `run_inline` in `AsyncCallbackManager`. Please review and advise on any needed changes. Twitter handle: @parambharat --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 14:59:29 -04:00
Aleksandar Petrov	c61b9daef5	docs: Grammar fix in concepts.mdx (#27149 ) Missing "is" in a sentence about the Tool usage.	2024-10-07 18:55:25 +00:00
Eugene Yurtsev	8f8392137a	Update MIGRATE.md (#27169 ) Upgrade the content of MIGRATE.md so it's in sync	2024-10-07 14:53:40 -04:00
João Carlos Ferra de Almeida	780ce00dea	core[minor]: add kwargs to index and aindex functions for custom vector_field support (#26998 ) Added `kwargs` parameters to the `index` and `aindex` functions in `libs/core/langchain_core/indexing/api.py`. This allows users to pass additional arguments to the `add_documents` and `aadd_documents` methods, enabling the specification of a custom `vector_field`. For example, users can now use `vector_field="embedding"` when indexing documents in `OpenSearchVectorStore` --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 14:52:50 -04:00
Jorge Piedrahita Ortiz	14de81b140	community: sambastudio chat model (#27056 ) Description:: sambastudio chat model integration added, previously only LLM integration included docs and tests --------- Co-authored-by: luisfucros <luisfucros@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-10-07 14:31:39 -04:00
Aditya Anand	f70650f67d	core[patch]: correct typo doc-string for astream_events method (#27108 ) This commit addresses a typographical error in the documentation for the async astream_events method. The word 'evens' was incorrectly used in the introductory sentence for the reference table, which could lead to confusion for users.\n\n### Changes Made:\n- Corrected 'Below is a table that illustrates some evens that might be emitted by various chains.' to 'Below is a table that illustrates some events that might be emitted by various chains.'\n\nThis enhancement improves the clarity of the documentation and ensures accurate terminology is used throughout the reference material.\n\nIssue Reference: #27107	2024-10-07 14:12:42 -04:00
Bagatur	38099800cc	docs: fix anthropic max_tokens docstring (#27166 )	2024-10-07 16:51:42 +00:00
ogawa	07dd8dd3d7	community[patch]: update gpt-4o cost (#27038 ) updated OpenAI cost definition according to the following: https://openai.com/api/pricing/	2024-10-07 09:06:30 -04:00
Averi Kitsch	7a07196df6	docs: update Google Spanner Vector Store documentation (#27124 ) Thank you for contributing to LangChain! - [X] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [x] PR message: *Delete this entire checklist* and replace with - Description: Update Spanner VS integration doc - Issue: None - Dependencies: None - Twitter handle: NA - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-04 23:59:10 +00:00
Bagatur	06ce5d1d5c	anthropic[patch]: Release 0.2.3 (#27126 )	2024-10-04 22:38:03 +00:00
Bagatur	0b8416bd2e	anthropic[patch]: fix input_tokens when cached (#27125 )	2024-10-04 22:35:51 +00:00
Erick Friis	64a16f2cf0	infra: add nvidia and astradb back to api build (#27115 ) test build https://github.com/langchain-ai/langchain/actions/runs/11185115845	2024-10-04 14:41:41 -07:00
Bagatur	bd5b335cb4	standard-tests[patch]: fix oai usage metadata test (#27122 )	2024-10-04 20:00:48 +00:00