huggingface[patch]: Support for HuggingFacePipeline in ChatHuggingFace. (#22194)

- **Description:** Added support for using HuggingFacePipeline in ChatHuggingFace (previously it was only usable with API endpoints, probably by oversight). - **Issue:** #19997 - **Dependencies:** none - **Twitter handle:** none --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>
2025-09-02 19:47:13 +00:00 · 2024-06-04 02:47:35 +02:00
parent 0061ded002
commit 98b2e7b195
2 changed files with 69 additions and 3 deletions
--- a/docs/docs/integrations/chat/huggingface.ipynb
+++ b/docs/docs/integrations/chat/huggingface.ipynb
@@ -58,6 +58,62 @@
    ")"
   ]
  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "### `HuggingFacePipeline`"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "from langchain_huggingface import HuggingFacePipeline\n",
+    "\n",
+    "llm = HuggingFacePipeline.from_model_id(\n",
+    "    model_id=\"HuggingFaceH4/zephyr-7b-beta\",\n",
+    "    task=\"text-generation\",\n",
+    "    pipeline_kwargs=dict(\n",
+    "        max_new_tokens=512,\n",
+    "        do_sample=False,\n",
+    "        repetition_penalty=1.03,\n",
+    "    ),\n",
+    ")"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "To run a quantized version, you might specify a `bitsandbytes` quantization config as follows:\n",
+    "\n",
+    "```python\n",
+    "from transformers import BitsAndBytesConfig\n",
+    "\n",
+    "quantization_config = BitsAndBytesConfig(\n",
+    "    load_in_4bit=True,\n",
+    "    bnb_4bit_quant_type=\"nf4\",\n",
+    "    bnb_4bit_compute_dtype=\"float16\",\n",
+    "    bnb_4bit_use_double_quant=True\n",
+    ")\n",
+    "```\n",
+    "\n",
+    "and pass it to the `HuggingFacePipeline` as a part of its `model_kwargs`:\n",
+    "\n",
+    "```python\n",
+    "pipeline = HuggingFacePipeline(\n",
+    "    ...\n",
+    "\n",
+    "    model_kwargs={\"quantization_config\": quantization_config},\n",
+    "    \n",
+    "    ...\n",
+    ")\n",
+    "```"
+   ]
+  },
  {
   "cell_type": "markdown",
   "metadata": {},