docs: document_loaders improvements (#4200)

- made notebooks consistent: titles, service/format descriptions.
- corrected short names to full names, for example, `Word` -> `Microsoft
Word`
- added missed descriptions
- renamed notebook files to make ToC correctly sorted
This commit is contained in:
Leonid Ganeline
2023-05-05 17:44:54 -07:00
committed by GitHub
parent eeb7c96e0c
commit 59204a5033
43 changed files with 151 additions and 72 deletions

View File

@@ -5,12 +5,11 @@
"id": "04c9fdc5",
"metadata": {},
"source": [
"# HuggingFace dataset \n",
"# HuggingFace dataset\n",
"\n",
"The [Hugging Face Hub](https://huggingface.co/docs/hub/index) hosts a large number of community-curated datasets for a diverse range of tasks such as translation,\n",
">The [Hugging Face Hub](https://huggingface.co/docs/hub/index) is home to over 5,000 [datasets](https://huggingface.co/docs/hub/index#datasets) in more than 100 languages that can be used for a broad range of tasks across NLP, Computer Vision, and Audio. They used for a diverse range of tasks such as translation,\n",
"automatic speech recognition, and image classification.\n",
"\n",
">The `Hugging Face Hub` is home to over 5,000 [datasets](https://huggingface.co/docs/hub/index#datasets) in more than 100 languages that can be used for a broad range of tasks across NLP, Computer Vision, and Audio.\n",
"\n",
"This notebook shows how to load `Hugging Face Hub` datasets to LangChain."
]