docs: update scrapegraph.ipynb (#32617)

I updated ScrapeGraphAI for checking the new ScrapeGraphAI tool
This commit is contained in:
Marco Vinciguerra
2025-09-11 04:33:57 +02:00
committed by GitHub
parent e4a90490c3
commit 64fe1e9a80

View File

@@ -32,6 +32,7 @@
"| [SmartScraperTool](https://python.langchain.com/docs/integrations/tools/scrapegraph) | langchain-scrapegraph | ✅ | ❌ | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-scrapegraph?style=flat-square&label=%20) |\n", "| [SmartScraperTool](https://python.langchain.com/docs/integrations/tools/scrapegraph) | langchain-scrapegraph | ✅ | ❌ | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-scrapegraph?style=flat-square&label=%20) |\n",
"| [SmartCrawlerTool](https://python.langchain.com/docs/integrations/tools/scrapegraph) | langchain-scrapegraph | ✅ | ❌ | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-scrapegraph?style=flat-square&label=%20) |\n", "| [SmartCrawlerTool](https://python.langchain.com/docs/integrations/tools/scrapegraph) | langchain-scrapegraph | ✅ | ❌ | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-scrapegraph?style=flat-square&label=%20) |\n",
"| [MarkdownifyTool](https://python.langchain.com/docs/integrations/tools/scrapegraph) | langchain-scrapegraph | ✅ | ❌ | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-scrapegraph?style=flat-square&label=%20) |\n", "| [MarkdownifyTool](https://python.langchain.com/docs/integrations/tools/scrapegraph) | langchain-scrapegraph | ✅ | ❌ | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-scrapegraph?style=flat-square&label=%20) |\n",
"| [AgenticScraperTool](https://python.langchain.com/docs/integrations/tools/scrapegraph) | langchain-scrapegraph | ✅ | ❌ | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-scrapegraph?style=flat-square&label=%20) |\n",
"| [GetCreditsTool](https://python.langchain.com/docs/integrations/tools/scrapegraph) | langchain-scrapegraph | ✅ | ❌ | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-scrapegraph?style=flat-square&label=%20) |\n", "| [GetCreditsTool](https://python.langchain.com/docs/integrations/tools/scrapegraph) | langchain-scrapegraph | ✅ | ❌ | ![PyPI - Version](https://img.shields.io/pypi/v/langchain-scrapegraph?style=flat-square&label=%20) |\n",
"\n", "\n",
"### Tool features\n", "### Tool features\n",
@@ -41,6 +42,7 @@
"| SmartScraperTool | Extract structured data from websites | URL + prompt | JSON |\n", "| SmartScraperTool | Extract structured data from websites | URL + prompt | JSON |\n",
"| SmartCrawlerTool | Extract data from multiple pages with crawling | URL + prompt + crawl options | JSON |\n", "| SmartCrawlerTool | Extract data from multiple pages with crawling | URL + prompt + crawl options | JSON |\n",
"| MarkdownifyTool | Convert webpages to markdown | URL | Markdown text |\n", "| MarkdownifyTool | Convert webpages to markdown | URL | Markdown text |\n",
"| AgenticScraperTool | Extract specifying steps | URL | Markdown text |\n",
"| GetCreditsTool | Check API credits | None | Credit info |\n", "| GetCreditsTool | Check API credits | None | Credit info |\n",
"\n", "\n",
"\n", "\n",
@@ -51,7 +53,7 @@
}, },
{ {
"cell_type": "code", "cell_type": "code",
"execution_count": 3, "execution_count": null,
"id": "f85b4089", "id": "f85b4089",
"metadata": {}, "metadata": {},
"outputs": [ "outputs": [
@@ -79,7 +81,7 @@
}, },
{ {
"cell_type": "code", "cell_type": "code",
"execution_count": 4, "execution_count": null,
"id": "e0b178a2", "id": "e0b178a2",
"metadata": {}, "metadata": {},
"outputs": [], "outputs": [],
@@ -285,7 +287,7 @@
}, },
{ {
"cell_type": "code", "cell_type": "code",
"execution_count": 7, "execution_count": null,
"id": "f90e33a7", "id": "f90e33a7",
"metadata": {}, "metadata": {},
"outputs": [ "outputs": [
@@ -329,7 +331,7 @@
}, },
{ {
"cell_type": "code", "cell_type": "code",
"execution_count": 5, "execution_count": null,
"id": "af3123ad", "id": "af3123ad",
"metadata": {}, "metadata": {},
"outputs": [ "outputs": [
@@ -353,7 +355,7 @@
}, },
{ {
"cell_type": "code", "cell_type": "code",
"execution_count": 8, "execution_count": null,
"id": "fdbf35b5", "id": "fdbf35b5",
"metadata": {}, "metadata": {},
"outputs": [ "outputs": [