Vwp/docs improved document loaders (#4006)

Huge thanks to @leo-gan for improving the document loaders notebooks --------- Co-authored-by: Leonid Ganeline <leo.gan.57@gmail.com>
2025-10-04 20:00:25 +00:00 · 2023-05-02 15:24:53 -07:00
parent 1c68cbdb28
commit aa38355999
57 changed files with 1227 additions and 779 deletions
--- a/docs/modules/indexes/document_loaders/examples/readthedocs_documentation.ipynb
+++ b/docs/modules/indexes/document_loaders/examples/readthedocs_documentation.ipynb
@@ -6,11 +6,24 @@
   "metadata": {},
   "source": [
    "# ReadTheDocs Documentation\n",
-    "This notebook covers how to load content from html that was generated as part of a Read-The-Docs build.\n",
+    "\n",
+    ">[Read the Docs](https://readthedocs.org/) is an open-sourced free software documentation hosting platform. It generates documentation written with the `Sphinx` documentation generator.\n",
+    "\n",
+    "This notebook covers how to load content from HTML that was generated as part of a `Read-The-Docs` build.\n",
    "\n",
    "For an example of this in the wild, see [here](https://github.com/hwchase17/chat-langchain).\n",
    "\n",
-    "This assumes that the html has already been scraped into a folder. This can be done by uncommenting and running the following command"
+    "This assumes that the HTML has already been scraped into a folder. This can be done by uncommenting and running the following command"
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "3d153e07-8339-4cbe-8481-fc08644ba927",
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "#!pip install beautifulsoup4"
   ]
  },
  {
@@ -25,9 +38,11 @@
  },
  {
   "cell_type": "code",
-   "execution_count": 2,
+   "execution_count": 1,
   "id": "92dd950b",
-   "metadata": {},
+   "metadata": {
+    "tags": []
+   },
   "outputs": [],
   "source": [
    "from langchain.document_loaders import ReadTheDocsLoader"
@@ -70,7 +85,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.10.9"
+   "version": "3.10.6"
  }
 },
 "nbformat": 4,