docs: document_loaders improvements (#4200)

- made notebooks consistent: titles, service/format descriptions. - corrected short names to full names, for example, `Word` -> `Microsoft Word` - added missed descriptions - renamed notebook files to make ToC correctly sorted
2025-10-05 12:19:26 +00:00 · 2023-05-05 17:44:54 -07:00
parent eeb7c96e0c
commit 59204a5033
43 changed files with 151 additions and 72 deletions
--- a/docs/modules/indexes/document_loaders/examples/hugging_face_dataset.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/hugging_face_dataset.ipynb
@@ -5,12 +5,11 @@
   "id": "04c9fdc5",
   "metadata": {},
   "source": [
-    "# HuggingFace dataset \n",
+    "# HuggingFace dataset\n",
    "\n",
-    "The [Hugging Face Hub](https://huggingface.co/docs/hub/index) hosts a large number of community-curated datasets for a diverse range of tasks such as translation,\n",
+    ">The [Hugging Face Hub](https://huggingface.co/docs/hub/index) is home to over 5,000 [datasets](https://huggingface.co/docs/hub/index#datasets) in more than 100 languages that can be used for a broad range of tasks across NLP, Computer Vision, and Audio. They used for a diverse range of tasks such as translation,\n",
    "automatic speech recognition, and image classification.\n",
    "\n",
-    ">The `Hugging Face Hub` is home to over 5,000 [datasets](https://huggingface.co/docs/hub/index#datasets) in more than 100 languages that can be used for a broad range of tasks across NLP, Computer Vision, and Audio.\n",
    "\n",
    "This notebook shows how to load `Hugging Face Hub` datasets to LangChain."
   ]