diff --git a/docs/docs/integrations/providers/scrapeless.mdx b/docs/docs/integrations/providers/scrapeless.mdx
new file mode 100644
index 00000000000..7b25f8a5b52
--- /dev/null
+++ b/docs/docs/integrations/providers/scrapeless.mdx
@@ -0,0 +1,26 @@
+# Scrapeless
+
+[Scrapeless](https://scrapeless.com) offers flexible and feature-rich data acquisition services with extensive parameter customization and multi-format export support.
+
+## Installation and Setup
+
+```bash
+pip install langchain-scrapeless
+```
+
+You'll need to set up your Scrapeless API key:
+
+```python
+import os
+os.environ["SCRAPELESS_API_KEY"] = "your-api-key"
+```
+
+## Tools
+
+The Scrapeless integration provides several tools:
+
+- [ScrapelessDeepSerpGoogleSearchTool](/docs/integrations/tools/scrapeless_scraping_api) - Enables comprehensive extraction of Google SERP data across all result types.
+- [ScrapelessDeepSerpGoogleTrendsTool](/docs/integrations/tools/scrapeless_scraping_api) - Retrieves keyword trend data from Google, including popularity over time, regional interest, and related searches.
+- [ScrapelessUniversalScrapingTool](/docs/integrations/tools/scrapeless_universal_scraping) - Access and extract data from JS-Render websites that typically block bots.
+- [ScrapelessCrawlerCrawlTool](/docs/integrations/tools/scrapeless_crawl) - Crawl a website and its linked pages to extract comprehensive data.
+- [ScrapelessCrawlerScrapeTool](/docs/integrations/tools/scrapeless_crawl) - Extract information from a single webpage.
diff --git a/docs/docs/integrations/tools/scrapeless_crawl.ipynb b/docs/docs/integrations/tools/scrapeless_crawl.ipynb
new file mode 100644
index 00000000000..efed9d88ff9
--- /dev/null
+++ b/docs/docs/integrations/tools/scrapeless_crawl.ipynb
@@ -0,0 +1,446 @@
+{
+ "cells": [
+ {
+ "cell_type": "markdown",
+ "id": "a6f91f20",
+ "metadata": {},
+ "source": [
+ "# Scrapeless\n",
+ "\n",
+ "[**Scrapeless**](https://www.scrapeless.com/) offers flexible and feature-rich data acquisition services with extensive parameter customization and multi-format export support. These capabilities empower LangChain to integrate and leverage external data more effectively. The core functional modules include:\n",
+ "\n",
+ "**DeepSerp**\n",
+ "- **Google Search**: Enables comprehensive extraction of Google SERP data across all result types.\n",
+ " - Supports selection of localized Google domains (e.g., `google.com`, `google.ad`) to retrieve region-specific search results.\n",
+ " - Pagination supported for retrieving results beyond the first page.\n",
+ " - Supports a search result filtering toggle to control whether to exclude duplicate or similar content.\n",
+ "- **Google Trends**: Retrieves keyword trend data from Google, including popularity over time, regional interest, and related searches.\n",
+ " - Supports multi-keyword comparison.\n",
+ " - Supports multiple data types: `interest_over_time`, `interest_by_region`, `related_queries`, and `related_topics`.\n",
+ " - Allows filtering by specific Google properties (Web, YouTube, News, Shopping) for source-specific trend analysis.\n",
+ "\n",
+ "**Universal Scraping**\n",
+ "- Designed for modern, JavaScript-heavy websites, allowing dynamic content extraction.\n",
+ " - Global premium proxy support for bypassing geo-restrictions and improving reliability.\n",
+ "\n",
+ "**Crawler**\n",
+ "- **Crawl**: Recursively crawl a website and its linked pages to extract site-wide content.\n",
+ " - Supports configurable crawl depth and scoped URL targeting.\n",
+ "- **Scrape**: Extract content from a single webpage with high precision.\n",
+ " - Supports \"main content only\" extraction to exclude ads, footers, and other non-essential elements.\n",
+ " - Allows batch scraping of multiple standalone URLs.\n",
+ "\n",
+ "## Overview\n",
+ "\n",
+ "### Integration details\n",
+ "\n",
+ "| Class | Package | Serializable | JS support | Package latest |\n",
+ "| :--- | :--- | :---: | :---: | :---: |\n",
+ "| [ScrapelessCrawlerScrapeTool](https://pypi.org/project/langchain-scrapeless/) | [langchain-scrapeless](https://pypi.org/project/langchain-scrapeless/) | ✅ | ❌ |  |\n",
+ "| [ScrapelessCrawlerCrawlTool](https://pypi.org/project/langchain-scrapeless/) | [langchain-scrapeless](https://pypi.org/project/langchain-scrapeless/) | ✅ | ❌ |  |\n",
+ "\n",
+ "### Tool features\n",
+ "\n",
+ "|Native async|Returns artifact|Return data|\n",
+ "|:-:|:-:|:-:|\n",
+ "|✅|✅|markdown, rawHtml, screenshot@fullPage, json, links, screenshot, html|\n",
+ "\n",
+ "\n",
+ "## Setup\n",
+ "\n",
+ "The integration lives in the `langchain-scrapeless` package."
+ ]
+ },
+ {
+ "cell_type": "raw",
+ "id": "ca676665",
+ "metadata": {
+ "vscode": {
+ "languageId": "raw"
+ }
+ },
+ "source": [
+ "!pip install langchain-scrapeless"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "b15e9266",
+ "metadata": {},
+ "source": [
+ "### Credentials\n",
+ "\n",
+ "You'll need a Scrapeless API key to use this tool. You can set it as an environment variable:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "e0b178a2-8816-40ca-b57c-ccdd86dde9c9",
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import os\n",
+ "\n",
+ "os.environ[\"SCRAPELESS_API_KEY\"] = \"your-api-key\""
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "1c97218f-f366-479d-8bf7-fe9f2f6df73f",
+ "metadata": {},
+ "source": [
+ "## Instantiation\n",
+ "\n",
+ "### ScrapelessCrawlerScrapeTool\n",
+ "\n",
+ "The ScrapelessCrawlerScrapeTool allows you to scrape content from one or multiple websites using Scrapeless’s Crawler Scrape API. You can extract the main content, control formatting, headers, wait times, and output types.\n",
+ "\n",
+ "\n",
+ "The tool accepts the following parameters:\n",
+ "- `urls` (required, List[str]): One or more URLs of websites you want to scrape.\n",
+ "- `formats` (optional, List[str]): Defines the format(s) of the scraped output. Default is `['markdown']`. Options include:\n",
+ " - `'markdown'`\n",
+ " - `'rawHtml'`\n",
+ " - `'screenshot@fullPage'`\n",
+ " - `'json'`\n",
+ " - `'links'`\n",
+ " - `'screenshot'`\n",
+ " - `'html'`\n",
+ "- `only_main_content` (optional, bool): Whether to return only the main page content, excluding headers, navs, footers, etc. Default is True.\n",
+ "- `include_tags` (optional, List[str]): A list of HTML tags to include in the output (e.g., `['h1', 'p']`). If set to None, no tags are explicitly included.\n",
+ "- `exclude_tags` (optional, List[str]): A list of HTML tags to exclude from the output. If set to None, no tags are explicitly excluded.\n",
+ "- `headers` (optional, Dict[str, str]): Custom headers to send with the request (e.g., for cookies or user-agent). Default is None.\n",
+ "- `wait_for` (optional, int): Time to wait in milliseconds before scraping. Useful for giving the page time to fully load. Default is `0`.\n",
+ "- `timeout` (optional, int): Request timeout in milliseconds. Default is `30000`.\n",
+ "\n",
+ "### ScrapelessCrawlerCrawlTool\n",
+ "\n",
+ "The ScrapelessCrawlerCrawlTool allows you to crawl a website starting from a base URL using Scrapeless’s Crawler Crawl API. It supports advanced filtering of URLs, crawl depth control, content scraping options, headers customization, and more.\n",
+ "\n",
+ "The tool accepts the following parameters:\n",
+ "- `url` (required, str): The base URL to start crawling from.\n",
+ "\n",
+ "- `limit` (optional, int): Maximum number of pages to crawl. Default is `10000`.\n",
+ "- `include_paths` (optional, List[str]): URL pathname regex patterns to include matching URLs in the crawl. Only URLs matching these patterns will be included. For example, setting `[\"blog/.*\"]` will only include URLs under the `/blog/` path. Default is None.\n",
+ "- `exclude_paths` (optional, List[str]): URL pathname regex patterns to exclude matching URLs from the crawl. For example, setting `[\"blog/.*\"]` will exclude URLs under the `/blog/` path. Default is None.\n",
+ "- `max_depth` (optional, int): Maximum crawl depth relative to the base URL, measured by the number of slashes in the URL path. Default is `10`.\n",
+ "- `max_discovery_depth` (optional, int): Maximum crawl depth based on discovery order. Root and sitemapped pages have depth `0`. For example, setting to `1` and ignoring sitemap will crawl only the entered URL and its immediate links. Default is None.\n",
+ "- `ignore_sitemap` (optional, bool): Whether to ignore the website sitemap during crawling. Default is False.\n",
+ "- `ignore_query_params` (optional, bool): Whether to ignore query parameter differences to avoid re-scraping similar URLs. Default is False.\n",
+ "- `deduplicate_similar_urls` (optional, bool): Whether to deduplicate similar URLs. Default is True.\n",
+ "- `regex_on_full_url` (optional, bool): Whether regex matching applies to the full URL instead of just the path. Default is True.\n",
+ "- `allow_backward_links` (optional, bool): Whether to allow crawling backlinks outside the URL hierarchy. Default is False.\n",
+ "- `allow_external_links` (optional, bool): Whether to allow crawling links to external websites. Default is False.\n",
+ "- `delay` (optional, int): Delay in seconds between page scrapes to respect rate limits. Default is `1`.\n",
+ "- `formats` (optional, List[str]): The format(s) of the scraped content. Default is [\"markdown\"]. Options include:\n",
+ " - `'markdown'`\n",
+ " - `'rawHtml'`\n",
+ " - `'screenshot@fullPage'`\n",
+ " - `'json'`\n",
+ " - `'links'`\n",
+ " - `'screenshot'`\n",
+ " - `'html'`\n",
+ "- `only_main_content` (optional, bool): Whether to return only the main content, excluding headers, navigation bars, footers, etc. Default is True.\n",
+ "- `include_tags` (optional, List[str]): List of HTML tags to include in the output (e.g., `['h1', 'p']`). Default is None (no explicit include filter).\n",
+ "- `exclude_tags` (optional, List[str]): List of HTML tags to exclude from the output. Default is None (no explicit exclude filter).\n",
+ "- `headers` (optional, Dict[str, str]): Custom HTTP headers to send with the requests, such as cookies or user-agent strings. Default is None.\n",
+ "- `wait_for` (optional, int): Time in milliseconds to wait before scraping the content, allowing the page to load fully. Default is `0`.\n",
+ "- `timeout` (optional, int):Request timeout in milliseconds. Default is `30000`."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "74147a1a",
+ "metadata": {},
+ "source": [
+ "## Invocation\n",
+ "\n",
+ "### ScrapelessCrawlerCrawlTool\n",
+ "\n",
+ "#### Usage with Parameters"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "65310a8b-eb0c-4d9e-a618-4f4abe2414fc",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "{'success': True, 'status': 'completed', 'completed': 1, 'total': 1, 'data': [{'markdown': '# Well hello there.\\n\\nWelcome to exmaple.com.\\n\\nChances are you got here by mistake (example.com, anyone?)', 'metadata': {'scrapeId': '547b2478-a41a-4a17-8015-8db378ee455f', 'sourceURL': 'https://exmaple.com', 'url': 'https://exmaple.com', 'statusCode': 200}}]}\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_scrapeless import ScrapelessCrawlerCrawlTool\n",
+ "\n",
+ "tool = ScrapelessCrawlerCrawlTool()\n",
+ "\n",
+ "# Advanced usage\n",
+ "result = tool.invoke({\"url\": \"https://exmaple.com\", \"limit\": 4})\n",
+ "print(result)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "659f9fbd-6fcf-445f-aa8c-72d8e60154bd",
+ "metadata": {},
+ "source": [
+ "#### Use within an agent"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "af3123ad-7a02-40e5-b58e-7d56e23e5830",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "================================\u001b[1m Human Message \u001b[0m=================================\n",
+ "\n",
+ "Use the scrapeless crawler crawl tool to crawl the website https://example.com and output the markdown content as a string.\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "Tool Calls:\n",
+ " scrapeless_crawler_crawl (call_Ne5HbxqsYDOKFaGDSuc4xppB)\n",
+ " Call ID: call_Ne5HbxqsYDOKFaGDSuc4xppB\n",
+ " Args:\n",
+ " url: https://example.com\n",
+ " formats: ['markdown']\n",
+ " limit: 1\n",
+ "=================================\u001b[1m Tool Message \u001b[0m=================================\n",
+ "Name: scrapeless_crawler_crawl\n",
+ "\n",
+ "{\"success\": true, \"status\": \"completed\", \"completed\": 1, \"total\": 1, \"data\": [{\"markdown\": \"# Example Domain\\n\\nThis domain is for use in illustrative examples in documents. You may use this\\ndomain in literature without prior coordination or asking for permission.\\n\\n[More information...](https://www.iana.org/domains/example)\", \"metadata\": {\"viewport\": \"width=device-width, initial-scale=1\", \"title\": \"Example Domain\", \"scrapeId\": \"00561460-9166-492b-8fed-889667383e55\", \"sourceURL\": \"https://example.com\", \"url\": \"https://example.com\", \"statusCode\": 200}}]}\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "\n",
+ "The crawl of the website https://example.com has been completed. Here is the markdown content extracted from the website:\n",
+ "\n",
+ "```\n",
+ "# Example Domain\n",
+ "\n",
+ "This domain is for use in illustrative examples in documents. You may use this\n",
+ "domain in literature without prior coordination or asking for permission.\n",
+ "\n",
+ "[More information...](https://www.iana.org/domains/example)\n",
+ "```\n",
+ "\n",
+ "You can find more information on the website [here](https://www.iana.org/domains/example).\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_openai import ChatOpenAI\n",
+ "from langchain_scrapeless import ScrapelessCrawlerCrawlTool\n",
+ "from langgraph.prebuilt import create_react_agent\n",
+ "\n",
+ "llm = ChatOpenAI()\n",
+ "\n",
+ "tool = ScrapelessCrawlerCrawlTool()\n",
+ "\n",
+ "# Use the tool with an agent\n",
+ "tools = [tool]\n",
+ "agent = create_react_agent(llm, tools)\n",
+ "\n",
+ "for chunk in agent.stream(\n",
+ " {\n",
+ " \"messages\": [\n",
+ " (\n",
+ " \"human\",\n",
+ " \"Use the scrapeless crawler crawl tool to crawl the website https://example.com and output the markdown content as a string.\",\n",
+ " )\n",
+ " ]\n",
+ " },\n",
+ " stream_mode=\"values\",\n",
+ "):\n",
+ " chunk[\"messages\"][-1].pretty_print()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "769b2246",
+ "metadata": {},
+ "source": [
+ "### ScrapelessCrawlerScrapeTool\n",
+ "\n",
+ "#### Usage with Parameters"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "ca993de7",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "{'parameters': {'engine': 'google.trends.search', 'hl': 'en', 'data_type': 'INTEREST_OVER_TIME', 'tz': '0', 'cat': '0', 'date': 'today 1-m', 'q': 'Funny 2048,negamon monster trainer'}, 'interest_over_time': {'timeline_data': [{'date': 'Jul 11, 2025', 'timestamp': '1752192000', 'value': [0, 0]}, {'date': 'Jul 12, 2025', 'timestamp': '1752278400', 'value': [0, 0]}, {'date': 'Jul 13, 2025', 'timestamp': '1752364800', 'value': [0, 0]}, {'date': 'Jul 14, 2025', 'timestamp': '1752451200', 'value': [0, 0]}, {'date': 'Jul 15, 2025', 'timestamp': '1752537600', 'value': [0, 0]}, {'date': 'Jul 16, 2025', 'timestamp': '1752624000', 'value': [0, 0]}, {'date': 'Jul 17, 2025', 'timestamp': '1752710400', 'value': [0, 0]}, {'date': 'Jul 18, 2025', 'timestamp': '1752796800', 'value': [0, 0]}, {'date': 'Jul 19, 2025', 'timestamp': '1752883200', 'value': [0, 0]}, {'date': 'Jul 20, 2025', 'timestamp': '1752969600', 'value': [0, 0]}, {'date': 'Jul 21, 2025', 'timestamp': '1753056000', 'value': [0, 0]}, {'date': 'Jul 22, 2025', 'timestamp': '1753142400', 'value': [0, 0]}, {'date': 'Jul 23, 2025', 'timestamp': '1753228800', 'value': [0, 0]}, {'date': 'Jul 24, 2025', 'timestamp': '1753315200', 'value': [0, 0]}, {'date': 'Jul 25, 2025', 'timestamp': '1753401600', 'value': [0, 0]}, {'date': 'Jul 26, 2025', 'timestamp': '1753488000', 'value': [0, 0]}, {'date': 'Jul 27, 2025', 'timestamp': '1753574400', 'value': [0, 0]}, {'date': 'Jul 28, 2025', 'timestamp': '1753660800', 'value': [0, 0]}, {'date': 'Jul 29, 2025', 'timestamp': '1753747200', 'value': [0, 0]}, {'date': 'Jul 30, 2025', 'timestamp': '1753833600', 'value': [0, 0]}, {'date': 'Jul 31, 2025', 'timestamp': '1753920000', 'value': [0, 0]}, {'date': 'Aug 1, 2025', 'timestamp': '1754006400', 'value': [0, 0]}, {'date': 'Aug 2, 2025', 'timestamp': '1754092800', 'value': [0, 0]}, {'date': 'Aug 3, 2025', 'timestamp': '1754179200', 'value': [0, 0]}, {'date': 'Aug 4, 2025', 'timestamp': '1754265600', 'value': [0, 0]}, {'date': 'Aug 5, 2025', 'timestamp': '1754352000', 'value': [0, 0]}, {'date': 'Aug 6, 2025', 'timestamp': '1754438400', 'value': [0, 0]}, {'date': 'Aug 7, 2025', 'timestamp': '1754524800', 'value': [0, 0]}, {'date': 'Aug 8, 2025', 'timestamp': '1754611200', 'value': [0, 0]}, {'date': 'Aug 9, 2025', 'timestamp': '1754697600', 'value': [0, 0]}, {'date': 'Aug 10, 2025', 'timestamp': '1754784000', 'value': [0, 100]}, {'date': 'Aug 11, 2025', 'timestamp': '1754870400', 'value': [0, 0]}], 'averages': [{'value': 0}, {'value': 3}], 'isPartial': True}}\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_scrapeless import ScrapelessDeepSerpGoogleTrendsTool\n",
+ "\n",
+ "tool = ScrapelessDeepSerpGoogleTrendsTool()\n",
+ "\n",
+ "# Basic usage\n",
+ "result = tool.invoke(\"Funny 2048,negamon monster trainer\")\n",
+ "print(result)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "1a4db36f",
+ "metadata": {},
+ "source": [
+ "#### Advanced Usage with Parameters"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "42c83c46",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "{'success': True, 'status': 'completed', 'completed': 1, 'total': 1, 'data': [{'markdown': \"[🩵 Don't just take our word for it. See what our users say on Product Hunt.](https://www.producthunt.com/posts/scrapeless-deep-serpapi)\\n\\n# Effortless Web Scraping Toolkit for Business and Developers\\n\\nThe ultimate scraper's companion: an expandable suite of tools, including\\n\\nScraping Browser, Scraping API, Universal Scraping API\\n\\nand Anti-Bot Solutions—designed to work together or independently.\\n\\n[**4.8**](https://www.g2.com/products/scrapeless/reviews) [**4.5**](https://www.trustpilot.com/review/scrapeless.com) [**4.8**](https://slashdot.org/software/p/Scrapeless/) [**8.5**](https://tekpon.com/software/scrapeless/reviews/)\\n\\nNo credit card required\\n\\n## A Flexible Toolkit for Accessing Public Web Data\\n\\nAI-powered seamless data extraction, effortlessly bypassing blocks with a single API call.\\n\\n[scrapeless](https://www.scrapeless.com/en)\\n\\n[\\\\\\\\\\n\\\\\\\\\\nView more\\\\\\\\\\n\\\\\\\\\\n20+ custom parameters\\\\\\\\\\n\\\\\\\\\\n20+ Google SERP scenarios\\\\\\\\\\n\\\\\\\\\\nPrecision Search Fueling LLM & RAG AI\\\\\\\\\\n\\\\\\\\\\n1-2s response; $0.1/1k queries](https://www.scrapeless.com/en/product/deep-serp-api) [\\\\\\\\\\n\\\\\\\\\\nView more\\\\\\\\\\n\\\\\\\\\\nHuman-like Behavior\\\\\\\\\\n\\\\\\\\\\nHigh Performance\\\\\\\\\\n\\\\\\\\\\nBypassing Risk Control\\\\\\\\\\n\\\\\\\\\\nConnect using the CDP Protocol](https://www.scrapeless.com/en/product/scraping-browser) [\\\\\\\\\\n\\\\\\\\\\nView more\\\\\\\\\\n\\\\\\\\\\nSession Mode\\\\\\\\\\n\\\\\\\\\\nCustom TLS\\\\\\\\\\n\\\\\\\\\\nJs Render](https://www.scrapeless.com/en/product/universal-scraping-api)\\n\\n### Customized Services\\n\\nContact our technical experts for custom solutions.\\n\\nBook a demo\\n\\n## From Simple Data Scraping to Complex Anti-Bot Challenges, Scrapeless Has You Covered.\\n\\nFlexible Toolkit for Adapting to Diverse Data Extraction Needs.\\n\\n[Try for Free](https://app.scrapeless.com/passport/register)\\n\\n### Fully Compatible with Key Programming Languages and Tools\\n\\nSeamlessly integrate across all devices, OS, and languages. Worry-free compatibility ensures smooth data collection.\\n\\nGet all example codes on the dashboard after login\\n\\n\\n\\n## Enterprise-level Data Scraping Solution\\n\\nHigh-quality, tailored web scraping solutions and expert services designed for critical business projects.\\n\\n### Customized Data Scraping Solutions\\n\\nTailored web scraping services designed to address your\\xa0 unique business requirements and deliver actionable insights.\\n\\n### High Concurrency and High-Performance Scraping\\n\\nEfficiently gather massive volumes of data with unparalleled speed and reliability,\\xa0ensuring optimal performance even under heavy load.\\n\\n### Data Cleaning and Transformation\\n\\nEnhance data accuracy and usability through comprehensive\\xa0 cleaning and transformation processes, turning raw data into\\xa0 valuable information.\\n\\n### Real-Time Data Push and API Integration\\n\\nSeamlessly integrate and access live data streams with robust APIs,\\xa0ensuring your applications are always up-to-date with the latest information.\\n\\n### Data Security and Privacy Protection\\n\\nProtect your data with state-of-the-art security measures and strict\\xa0compliance standards, ensuring privacy and confidentiality at every step.\\n\\n### Enterprise-level SLA\\n\\nThe Service Level Agreement (SLA) serves as a safeguard for your project,\\xa0ensuring a contract for anticipated outcomes, automated oversight, prompt issue\\xa0resolution, and a personalized maintenance plan.\\n\\n## Why Scrapeless: Simplify Your Data Flow Effortlessly.\\n\\nAchieve all your data scraping tasks with more power, simplicity, and cost-effectiveness in less time.\\n\\n### Articles\\n\\nNews articles/Blog posts/Research papers\\n\\n### Organized Fresh Data\\n\\n### Prices\\n\\nProduct prices/Discount information/Market trend analysis\\n\\n### No need to hassle with browser maintenance\\n\\n### Reviews\\n\\nProduct reviews/User feedback/Social media reviews\\n\\n### Only pay for successful requests\\n\\n### Products\\n\\nProduct Launches/Tech Specs/Product Comparisons\\n\\n### Fully scalable\\n\\n## Unleash Your Competitive Edge in Data within the Industry\\n\\n## Regulate Compliance for All Users\\n\\nContact us\\n\\nWe are committed to using technology for the benefit of humanity and firmly oppose any illegal activities and misuse of our products. We support the collection of publicly available data to improve human life, while strongly opposing the collection of unauthorized or unapproved sensitive information. If you find anyone abusing our services, please provide us with feedback! To further enhance user confidence and control, we have established a dedicated Privacy Center aimed at empowering users with more capabilities and information rights.\\n\\n\\n\\n## Web Scraping Blog\\n\\nMost comprehensive guide, created for all Web Scraping developers.\\n\\n[View All Blogs](https://www.scrapeless.com/en/blog)\\n\\n[**Scrapeless MCP Server Is Officially Live! Build Your Ultimate AI-Web Connector** \\\\\\\\\\n\\\\\\\\\\nDiscover how the Scrapeless MCP Server gives LLMs real-time web browsing and scraping abilities. Learn how to build AI agents that search, extract, and interact with dynamic web content seamlessly.\\\\\\\\\\n\\\\\\\\\\nMichael Lee\\\\\\\\\\n\\\\\\\\\\n17-Jul-2025\\\\\\\\\\n\\\\\\\\\\n](https://www.scrapeless.com/en/blog/scrapeless-mcp-server) [**Product Updates \\\\| New Profile Feature** \\\\\\\\\\n\\\\\\\\\\nProduct Updates \\\\| Introducing the new Profile feature to enable persistent browser data storage, streamline cross-session workflows, and boost automation efficiency.\\\\\\\\\\n\\\\\\\\\\nEmily Chen\\\\\\\\\\n\\\\\\\\\\n17-Jul-2025\\\\\\\\\\n\\\\\\\\\\n](https://www.scrapeless.com/en/blog/scrapeelss-profile) [**How to Track Your Ranking on ChatGPT?** \\\\\\\\\\n\\\\\\\\\\nLearn why traditional SEO tools fall short and how Scrapeless helps you monitor and optimize your AI rankings effortlessly.\\\\\\\\\\n\\\\\\\\\\nMichael Lee\\\\\\\\\\n\\\\\\\\\\n01-Jul-2025\\\\\\\\\\n\\\\\\\\\\n](https://www.scrapeless.com/en/blog/chatgpt-scraper)\\n\\nContact our sales team\\n\\nMonday to Friday, 9:00 AM - 18:00 PMSingapore Standard Time (UTC+08:00)\\n\\nScrapeless offers AI-powered, robust, and scalable web scraping and automation services trusted by leading enterprises. Our enterprise-grade solutions are tailored to meet your project needs, with dedicated technical support throughout. With a strong technical team and flexible delivery times, we charge only for successful data, enabling efficient data extraction while bypassing limitations.\\n\\nContact us now to fuel your business growth.\\n\\n[**4.8**](https://www.g2.com/products/scrapeless/reviews) [**4.5**](https://www.trustpilot.com/review/scrapeless.com) [**4.8**](https://slashdot.org/software/p/Scrapeless/) [**8.5**](https://tekpon.com/software/scrapeless/reviews/)\\n\\nBook a demo\\n\\nProvide your contact details, and we'll promptly reach out to offer a product demo and introduction. We ensure your information remains confidential, complying with GDPR standards.\\n\\nGet a demo\\n\\nRegister and Claim Free Trial\\n\\nYour free trial is ready! Sign up for a Scrapeless account for free, and your trial will be instantly activated in your account.\\n\\n[Sign up](https://app.scrapeless.com/passport/register)\\n\\nWe value your privacy\\n\\nWe use cookies to analyze website usage and do not record any of your personal information. View [Privacy Policy](https://www.scrapeless.com/en/legal/privacy-policy)\\n\\nReject\\n\\nAccept\", 'metadata': {'language': 'en', 'description': 'Scrapeless is the best full-stack web scraping toolkit offering Scraping API, Scraping Browser, Universal Scraping API, Captcha Solver, and Proxies, designed to handle all your data collection needs with ease and reliability, empowering businesses and developers with efficient data extraction solutions.', 'google-site-verification': 'xj1xDpU8LpGG_h-2lIBVW_6GNW5Vtx0h5M3lz43HUXc', 'viewport': 'width=device-width, initial-scale=1', 'keywords': 'Scraping API, Scraping Browser, Universal Scraping API, Captcha Solver, and Proxies, web scraping, web scraper, web scraping api, Web scraper,data scraping, web crawler', 'next-size-adjust': '', 'favicon': 'https://www.scrapeless.com/favicon.ico', 'title': 'Effortless Web Scraping Toolkit - Scrapeless', 'scrapeId': 'c7189211-7034-4e86-9afd-89fa5268b013', 'sourceURL': 'https://www.scrapeless.com/en', 'url': 'https://www.scrapeless.com/en', 'statusCode': 200}}]}\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_scrapeless import ScrapelessCrawlerScrapeTool\n",
+ "\n",
+ "tool = ScrapelessCrawlerScrapeTool()\n",
+ "\n",
+ "result = tool.invoke(\n",
+ " {\n",
+ " \"urls\": [\"https://exmaple.com\", \"https://www.scrapeless.com/en\"],\n",
+ " \"formats\": [\"markdown\"],\n",
+ " }\n",
+ ")\n",
+ "print(result)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "7dde00ff",
+ "metadata": {},
+ "source": [
+ "#### Use within an agent"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "f6ca1aff",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "================================\u001b[1m Human Message \u001b[0m=================================\n",
+ "\n",
+ "Use the scrapeless crawler scrape tool to get the website content of https://example.com and output the html content as a string.\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "Tool Calls:\n",
+ " scrapeless_crawler_scrape (call_qrPMGLjXmzb5QlVoIZgMuyPN)\n",
+ " Call ID: call_qrPMGLjXmzb5QlVoIZgMuyPN\n",
+ " Args:\n",
+ " urls: ['https://example.com']\n",
+ " formats: ['html']\n",
+ "=================================\u001b[1m Tool Message \u001b[0m=================================\n",
+ "Name: scrapeless_crawler_scrape\n",
+ "\n",
+ "{\"success\": true, \"status\": \"completed\", \"completed\": 1, \"total\": 1, \"data\": [{\"metadata\": {\"viewport\": \"width=device-width, initial-scale=1\", \"title\": \"Example Domain\", \"scrapeId\": \"63070ee5-ebef-4727-afe7-2b06466c6777\", \"sourceURL\": \"https://example.com\", \"url\": \"https://example.com\", \"statusCode\": 200}, \"html\": \"\\n\\n
\\n\\n
Example Domain
\\n
This domain is for use in illustrative examples in documents. You may use this\\n domain in literature without prior coordination or asking for permission.
\\n
More information...
\\n
\\n\\n\\n\"}]}\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "\n",
+ "The HTML content of the website \"https://example.com\" is as follows:\n",
+ "\n",
+ "```html\n",
+ "\n",
+ "\n",
+ "\n",
+ "
Example Domain
\n",
+ "
This domain is for use in illustrative examples in documents. You may use this\n",
+ " domain in literature without prior coordination or asking for permission.
\n",
+ "
More information...
\n",
+ "
\n",
+ "\n",
+ "\n",
+ "```\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_openai import ChatOpenAI\n",
+ "from langchain_scrapeless import ScrapelessCrawlerScrapeTool\n",
+ "from langgraph.prebuilt import create_react_agent\n",
+ "\n",
+ "llm = ChatOpenAI()\n",
+ "\n",
+ "tool = ScrapelessCrawlerScrapeTool()\n",
+ "\n",
+ "# Use the tool with an agent\n",
+ "tools = [tool]\n",
+ "agent = create_react_agent(llm, tools)\n",
+ "\n",
+ "for chunk in agent.stream(\n",
+ " {\n",
+ " \"messages\": [\n",
+ " (\n",
+ " \"human\",\n",
+ " \"Use the scrapeless crawler scrape tool to get the website content of https://example.com and output the html content as a string.\",\n",
+ " )\n",
+ " ]\n",
+ " },\n",
+ " stream_mode=\"values\",\n",
+ "):\n",
+ " chunk[\"messages\"][-1].pretty_print()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "4ac8146c",
+ "metadata": {},
+ "source": [
+ "## API reference\n",
+ "\n",
+ "- [Scrapeless Documentation](https://docs.scrapeless.com/en/crawl/quickstart/introduction/)\n",
+ "- [Scrapeless API Reference](https://apidocs.scrapeless.com/api-17509003)"
+ ]
+ }
+ ],
+ "metadata": {
+ "kernelspec": {
+ "display_name": "langchain",
+ "language": "python",
+ "name": "python3"
+ },
+ "language_info": {
+ "codemirror_mode": {
+ "name": "ipython",
+ "version": 3
+ },
+ "file_extension": ".py",
+ "mimetype": "text/x-python",
+ "name": "python",
+ "nbconvert_exporter": "python",
+ "pygments_lexer": "ipython3",
+ "version": "3.12.11"
+ }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
diff --git a/docs/docs/integrations/tools/scrapeless_scraping_api.ipynb b/docs/docs/integrations/tools/scrapeless_scraping_api.ipynb
new file mode 100644
index 00000000000..701a7a4b39e
--- /dev/null
+++ b/docs/docs/integrations/tools/scrapeless_scraping_api.ipynb
@@ -0,0 +1,474 @@
+{
+ "cells": [
+ {
+ "cell_type": "markdown",
+ "id": "a6f91f20",
+ "metadata": {},
+ "source": [
+ "# Scrapeless\n",
+ "\n",
+ "**Scrapeless** offers flexible and feature-rich data acquisition services with extensive parameter customization and multi-format export support. These capabilities empower LangChain to integrate and leverage external data more effectively. The core functional modules include:\n",
+ "\n",
+ "**DeepSerp**\n",
+ "- **Google Search**: Enables comprehensive extraction of Google SERP data across all result types.\n",
+ " - Supports selection of localized Google domains (e.g., `google.com`, `google.ad`) to retrieve region-specific search results.\n",
+ " - Pagination supported for retrieving results beyond the first page.\n",
+ " - Supports a search result filtering toggle to control whether to exclude duplicate or similar content.\n",
+ "- **Google Trends**: Retrieves keyword trend data from Google, including popularity over time, regional interest, and related searches.\n",
+ " - Supports multi-keyword comparison.\n",
+ " - Supports multiple data types: `interest_over_time`, `interest_by_region`, `related_queries`, and `related_topics`.\n",
+ " - Allows filtering by specific Google properties (Web, YouTube, News, Shopping) for source-specific trend analysis.\n",
+ "\n",
+ "**Universal Scraping**\n",
+ "- Designed for modern, JavaScript-heavy websites, allowing dynamic content extraction.\n",
+ " - Global premium proxy support for bypassing geo-restrictions and improving reliability.\n",
+ "\n",
+ "**Crawler**\n",
+ "- **Crawl**: Recursively crawl a website and its linked pages to extract site-wide content.\n",
+ " - Supports configurable crawl depth and scoped URL targeting.\n",
+ "- **Scrape**: Extract content from a single webpage with high precision.\n",
+ " - Supports \"main content only\" extraction to exclude ads, footers, and other non-essential elements.\n",
+ " - Allows batch scraping of multiple standalone URLs.\n",
+ "\n",
+ "## Overview\n",
+ "\n",
+ "### Integration details\n",
+ "\n",
+ "| Class | Package | Serializable | JS support | Package latest |\n",
+ "| :--- | :--- | :---: | :---: | :---: |\n",
+ "| [ScrapelessDeepSerpGoogleSearchTool](https://pypi.org/project/langchain-scrapeless/) | [langchain-scrapeless](https://pypi.org/project/langchain-scrapeless/) | ✅ | ❌ |  |\n",
+ "| [ScrapelessDeepSerpGoogleTrendsTool](https://pypi.org/project/langchain-scrapeless/) | [langchain-scrapeless](https://pypi.org/project/langchain-scrapeless/) | ✅ | ❌ |  |\n",
+ "\n",
+ "### Tool features\n",
+ "\n",
+ "|Native async|Returns artifact|Return data|\n",
+ "|:-:|:-:|:-:|\n",
+ "|✅|❌|Search Results Based on Tool|\n",
+ "\n",
+ "\n",
+ "## Setup\n",
+ "\n",
+ "The integration lives in the `langchain-scrapeless` package."
+ ]
+ },
+ {
+ "cell_type": "raw",
+ "id": "ca676665",
+ "metadata": {
+ "vscode": {
+ "languageId": "raw"
+ }
+ },
+ "source": [
+ "!pip install langchain-scrapeless"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "b15e9266",
+ "metadata": {},
+ "source": [
+ "### Credentials\n",
+ "\n",
+ "You'll need a Scrapeless API key to use this tool. You can set it as an environment variable:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "e0b178a2-8816-40ca-b57c-ccdd86dde9c9",
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import os\n",
+ "\n",
+ "os.environ[\"SCRAPELESS_API_KEY\"] = \"your-api-key\""
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "1c97218f-f366-479d-8bf7-fe9f2f6df73f",
+ "metadata": {},
+ "source": [
+ "## Instantiation\n",
+ "\n",
+ "### ScrapelessDeepSerpGoogleSearchTool\n",
+ "\n",
+ "Here we show how to instantiate an instance of the `ScrapelessDeepSerpGoogleSearchTool`. The universal Information Search Engine allows you to retrieve any data information.\n",
+ "- Retrieves any data information.\n",
+ "- Handles explanatory queries (e.g., \"why\", \"how\").\n",
+ "- Supports comparative analysis requests.\n",
+ "\n",
+ "The tool accepts the following parameters:\n",
+ "- `q`: (str) The search query string. Supports advanced Google syntax like `inurl:`, `site:`, `intitle:`, `as_eq`, etc.\n",
+ "- `hl`: (str) Language code for result content, e.g., `en`, `es`, `fr`. Default: `'en'`.\n",
+ "- `gl`: (str) Country code for geo-specific result targeting, e.g., `us`, `uk`, `de`. Default: `'us'`.\n",
+ "- `google_domain`: (str) Which Google domain to use (e.g., `'google.com'`, `'google.co.jp'`). Default: `'google.com'`.\n",
+ "- `start`: (int) Defines the result offset. It skips the given number of results. Used for pagination. Examples:\n",
+ " - `0` (default): the first page of results\n",
+ " - `10`: the second page\n",
+ " - `20`: the third page\n",
+ "- `num`: (int) Defines the maximum number of results to return. Examples:\n",
+ " - `10` (default): returns 10 results\n",
+ " - `40`: returns 40 results\n",
+ " - `100`: returns 100 results\n",
+ "- `ludocid`: (str) Defines the ID (CID) of the Google My Business listing you want to scrape. Also known as Google Place ID.\n",
+ "- `kgmid`: (str) Defines the ID (KGMID) of the Google Knowledge Graph listing you want to scrape. Also known as Google Knowledge Graph ID. Searches with the kgmid parameter will return results for the originally encrypted search parameters. For some searches, `kgmid` may override all other parameters except `start` and `num`.\n",
+ "- `ibp`: (str) Responsible for rendering layouts and expansions for some elements. Example: gwp;0,7 to expand searches with ludocid for expanded knowledge graph.\n",
+ "- `cr`: (str) Defines one or multiple countries to limit the search to. Uses format `country{two-letter country code}`, separated by `|`. Example:\n",
+ " - `countryFR|countryDE` only searches French and German pages.\n",
+ "- `lr`: (str) Defines one or multiple languages to limit the search to. Uses format `lang_{two-letter language code}`, separated by `|`. Example:\n",
+ " - `lang_fr|lang_de` only searches French and German pages.\n",
+ "- `tbs`: (str) Defines advanced search parameters not possible in the regular query field. Examples include advanced search for:\n",
+ " - `patents`\n",
+ " - `dates`\n",
+ " - `news`\n",
+ " - `videos`\n",
+ " - `images`\n",
+ " - `apps`\n",
+ " - `text` contents\n",
+ "- `safe`: (str) Defines the level of filtering for adult content. Values:\n",
+ " - `active`: blur explicit content\n",
+ " - `off`: no filtering\n",
+ "- `nfpr`: (str) Defines exclusion of results from auto-corrected queries when the original query is misspelled. Values:\n",
+ " - `1`: exclude these results\n",
+ " - `0` (default): include them\n",
+ " - Note: This may not prevent Google from returning auto-corrected results if no other results are available.\n",
+ "- `filter`: (str) Defines if `'Similar Results'` and `'Omitted Results'` filters are on or off. Values:\n",
+ " - `1` (default): enable filters\n",
+ " - `0`: disable filters\n",
+ "- `tbm`: (str) Defines the type of search to perform. Values:\n",
+ " - `none`: regular Google Search\n",
+ " - `isch`: Google Images\n",
+ " - `lcl`: Google Local\n",
+ " - `vid`: Google Videos\n",
+ " - `nws`: Google News\n",
+ " - `shop`: Google Shopping\n",
+ " - `pts`: Google Patents\n",
+ " - `jobs`: Google Jobs\n",
+ "\n",
+ "\n",
+ "### ScrapelessDeepSerpGoogleTrendsTool\n",
+ "\n",
+ "Here we show how to instantiate an instance of the `ScrapelessDeepSerpGoogleTrendsTool`. This tool allows you to query real-time or historical trend data from Google Trends with fine control over locale, category, and result type, using the Scrapeless API.\n",
+ "\n",
+ "The tool accepts the following parameters:\n",
+ "- `q` (required, str): Parameter defines the query or queries you want to search. You can use anything that you would use in a regular Google Trends search. The maximum number of queries per search is **5**. (This only applies to `interest_over_time` and `compared_breakdown_by_region` data types.) Other types of data will only accept **1 query** per search.\n",
+ "- `data_type` (optional, str): The type of data to retrieve. Default is `'interest_over_time'`. Options include:\n",
+ " - `autocomplete`\n",
+ " - `interest_over_time`\n",
+ " - `compared_breakdown_by_region`\n",
+ " - `interest_by_subregion`\n",
+ " - `related_queries`\n",
+ " - `related_topics`\n",
+ "- `date` (optional, str): Defines the date range to fetch data for. Default is `'today 1-m'`. Supported formats:\n",
+ " - Relative: `'now 1-H'`, `'now 7-d'`, `'today 12-m'`, `'today 5-y'`, `'all'`\n",
+ " - Custom date ranges: `'2023-01-01 2023-12-31'`\n",
+ " - With hours: `'2023-07-01T10 2023-07-03T22'`\n",
+ "- `hl` (optional, str): Language code to use in the search. Default is `'en'`. Examples:\n",
+ " - `'es'` (Spanish)\n",
+ " - `'fr'` (French)\n",
+ "- `tz` (optional, str): Time zone offset. Default is `'420'` (PST).\n",
+ "- `geo` (optional, str): Two-letter country code to define the geographic origin of the search. Examples include:\n",
+ " - `'US'` (United States)\n",
+ " - `'GB'` (United Kingdom)\n",
+ " - `'JP'` (Japan)\n",
+ " - Leave empty or `None` for worldwide search.\n",
+ "- `cat` (optional, `CategoryEnum`): Category ID to narrow down the search context. Default is `'all_categories'` (0). Categories can include:\n",
+ " - `'0'` – All categories\n",
+ " - Others like `'3'` – News, `'29'` – Sports, etc."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "74147a1a",
+ "metadata": {},
+ "source": [
+ "## Invocation\n",
+ "\n",
+ "### ScrapelessDeepSerpGoogleSearchTool\n",
+ "\n",
+ "#### Basic Usage"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "65310a8b-eb0c-4d9e-a618-4f4abe2414fc",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "{'inline_images': [{'position': 1, 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcTyTSMoVoK_U4eIL_sta9g-jpa0WkrHb8g4Ww&s', 'related_content_id': 'W25Uu31q8mmB1M,IdsKzFwOrIHamM', 'related_content_link': 'https://www.google.com/search/about-this-image?img=H4sIAAAAAAAA_wEXAOj_ChUIobar4MzLg9aBARDb3NHa28-a-WmbHuN1FwAAAA%3D%3D&q=https://www.parsehub.com/blog/web-scraping-examples/&ctx=iv&hl=en-US', 'source': 'www.parsehub.com', 'source_logo': '', 'title': 'Web Scraping Examples: How are Businesses Using Web Scraping ...', 'link': 'https://www.parsehub.com/blog/web-scraping-examples/', 'original': 'https://www.parsehub.com/blog/content/images/2019/10/web-scraping-examples.jpg', 'original_width': 800, 'original_height': 400, 'in_stock': False, 'is_product': False}], 'inline_videos': [{'position': 1, 'title': 'How This AI Tool Makes Web Scraping Cheaper & 10x ...', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc', 'channel': 'SkillCurb', 'duration': '7:44', 'platform': 'YouTube', 'key_moments': [{'time': '00:00', 'title': 'Introduction', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=0', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQa-iXYOifpWDZZJLq8V45waX8C_mRTvar2rFgKASqipA&s'}, {'time': '00:20', 'title': 'Overview', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=20', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQ2I1-L6_cnRoz_nbhUYcLsG8KOjJL8aM2tMZAez_zigg&s'}, {'time': '01:20', 'title': 'How to Use SERP API', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=80', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRettKVpkN7yFSzZBeYPNhmhTdhzmjSJ3p4vTP9oY5VmA&s'}, {'time': '03:30', 'title': 'Additional Features', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=210', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRBSPraPiDau9kIyxZbygn_csUKH3q8Eop_E8jzYTyBOg&s'}, {'time': '07:05', 'title': 'Final Thoughts', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=425', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcSiSXSUSq3hJH7XOukFzMfXWj7iVcb-F2b9Lwug0KjXHQ&s'}]}, {'position': 2, 'title': 'Finally! Web Scraping Without Code That ACTUALLY Works ...', 'link': 'https://www.youtube.com/watch?v=bQTxINH9GbI', 'channel': 'Execute Automation', 'duration': '15:44', 'platform': 'YouTube'}, {'position': 3, 'title': 'Web Scraping 101: How To Scrape 99% of Sites', 'link': 'https://www.youtube.com/watch?v=WYp0dmZOHXM&pp=0gcJCdgAo7VqN5tD', 'channel': 'Dorian Develops', 'duration': '18:58', 'platform': 'YouTube'}], 'metadata': {'engine': 'google.search', 'rawUrl': 'https://api.scrapeless.com/storage/scrapeless.scraper.google.search/806ff29bc59caf4b65eaaca9aea80740/b4b20ee690f50282a5919181c2d72ae5_1754948945.html'}, 'organic_results': [{'position': 1, 'title': 'Scrapeless: Effortless Web Scraping Toolkit', 'link': 'https://www.scrapeless.com/', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.scrapeless.com/&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4QFnoECB0QAQ', 'favicon': '', 'snippet': 'Scrapeless offers AI-powered, robust, and scalable web scraping and automation services trusted by leading enterprises. Our enterprise-grade solutions are\\xa0...', 'snippet_highlighted_words': ['Scrapeless offers AI-powered, robust, and scalable web scraping and automation services'], 'site_links': {'inline': [{'title': 'Scrapeless', 'link': 'https://app.scrapeless.com/'}, {'title': 'Sign up', 'link': 'https://app.scrapeless.com/passport/register'}, {'title': 'Pricing', 'link': 'https://www.scrapeless.com/en/pricing'}, {'title': 'Scraping API', 'link': 'https://www.scrapeless.com/en/product/scraping-api'}]}, 'source': 'Scrapeless'}, {'position': 2, 'title': 'Scrapeless FAQs and Solutions | Quick Help Center', 'link': 'https://www.scrapeless.com/en/faq', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.scrapeless.com/en/faq&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4QFnoECBkQAQ', 'favicon': '', 'snippet': 'Scrapeless offers AI-powered, robust, and scalable web scraping and automation services trusted by leading enterprises.', 'snippet_highlighted_words': ['Scrapeless'], 'source': 'Scrapeless'}, {'position': 3, 'title': 'Scrapeless', 'link': 'https://github.com/scrapeless-ai', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://github.com/scrapeless-ai&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4QFnoECCAQAQ', 'favicon': '', 'snippet': 'Scrapeless.com offers an enterprise-grade, AI-driven web scraping toolkit designed to help businesses efficiently access public web data.', 'snippet_highlighted_words': ['enterprise-grade, AI-driven web scraping toolkit'], 'source': 'GitHub'}, {'position': 4, 'title': 'Read Customer Service Reviews of scrapeless.com', 'link': 'https://www.trustpilot.com/review/scrapeless.com', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.trustpilot.com/review/scrapeless.com&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4QFnoECCIQAQ', 'favicon': '', 'snippet': \"Do you agree with Scrapeless's 4-star rating? Check out what 34 people have written so far, and share your own experience.\", 'snippet_highlighted_words': [\"Scrapeless's 4-star rating\"], 'source': 'Trustpilot'}, {'position': 5, 'title': 'Scrapeless Review: Hands-on Testing and My Opinion', 'link': 'https://geekflare.com/proxy/scrapeless-review/', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://geekflare.com/proxy/scrapeless-review/&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4QFnoECCQQAQ', 'favicon': '', 'snippet': 'Apr 24, 2025 — Apr 24, 2025Scrapeless is a full-stack web scraping solution. It offers browser-based and API-based scraping, AI-powered automation, rotating proxies,\\xa0...', 'snippet_highlighted_words': ['Scrapeless is a full-stack web scraping solution'], 'source': 'Geekflare'}, {'position': 6, 'title': 'Scrapeless Review 2025', 'link': 'https://affmaven.com/scrapeless-review/', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://affmaven.com/scrapeless-review/&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4QFnoECCMQAQ', 'favicon': '', 'snippet': \"My favorite discovery was Scrapeless browser's ability to handle JavaScript-heavy sites. CAPTCHA solving worked flawlessly – I never had to solve one manually.\", 'snippet_highlighted_words': [\"Scrapeless browser's ability to handle JavaScript-heavy sites\"], 'source': 'AffMaven'}, {'position': 7, 'title': 'Meet Scrapeless: The Most Cost-effective No-Code Web ...', 'link': 'https://www.reddit.com/r/AIinBusinessNews/comments/1hylunb/meet_scrapeless_the_most_costeffective_nocode_web/', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.reddit.com/r/AIinBusinessNews/comments/1hylunb/meet_scrapeless_the_most_costeffective_nocode_web/&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4QFnoECCYQAQ', 'favicon': '', 'snippet': 'This AI web scraping toolkit can handle small-scale projects and millions of requests daily. Scrapeless can be used by businesses, developers,\\xa0...', 'snippet_highlighted_words': ['can handle small-scale projects'], 'source': 'Reddit\\xa0·\\xa0r/AIinBusinessNews'}, {'position': 8, 'title': 'Scrapeless - AI Agent', 'link': 'https://aiagentstore.ai/ai-agent/scrapeless', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://aiagentstore.ai/ai-agent/scrapeless&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4QFnoECCUQAQ', 'snippet': 'Feb 6, 2025 — Feb 6, 2025Scrapeless is an AI-driven web scraping toolkit designed to provide enterprises with efficient access to public web data.', 'snippet_highlighted_words': ['Scrapeless'], 'source': 'AI Agent Store'}, {'position': 9, 'title': 'Scrapeless Reviews 2025: Details, Pricing, & Features', 'link': 'https://www.g2.com/products/scrapeless/reviews', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.g2.com/products/scrapeless/reviews&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4QFnoECCEQAQ', 'favicon': '', 'snippet': 'Scrapeless is a powerful and flexible web scraping solution that helps businesses of all sizes access critical public web data with ease.', 'snippet_highlighted_words': ['Scrapeless is a powerful and flexible web scraping solution'], 'source': 'G2'}], 'pagination': {'current': 1, 'next': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=10&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8NMDegQIChAW', 'other_pages': {'10': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=90&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8tMDegQIChAU', '2': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=10&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8tMDegQIChAE', '3': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=20&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8tMDegQIChAG', '4': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=30&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8tMDegQIChAI', '5': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=40&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8tMDegQIChAK', '6': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=50&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8tMDegQIChAM', '7': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=60&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8tMDegQIChAO', '8': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=70&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8tMDegQIChAQ', '9': 'https://www.google.com/search?q=I+want+to+know+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=UWWaaMvfCsaJptQPyO2H8AM&start=80&sa=N&sstk=Ac65TH4S2Efk7veeqCXMSTw9cckfRJtP6_2AWF4wscNLO3ZE5TqDrBT8EHxsHOWi1jbiCPF8cF_COPsIAQ0Ud4fn5jmzYdfcqH6CQg&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q8tMDegQIChAS'}}, 'related_questions': [{'question': 'How much does simplescraper cost?', 'snippet': 'Simplescraper Pricing Plans Free: $0/month \\\\u2013 100 cloud scrape credits per month. Unlimited local (in-browser) scrapes, but limited cloud automation. Plus: $39/month \\\\u2013 ~6,000 cloud scrape credits. Adds recipe storage and priority email support.', 'date': 'May 20, 2025'}, {'question': 'Can websites detect scrapers?', 'snippet': \"The number one way sites detect web scrapers is by examining their IP address and tracking how it's behaving. If the server finds a pattern, strange behaviors, or an impossible request frequency (to name a few) for a real user, the server can block the IP address from accessing the site again.\", 'date': 'Jan 23, 2025'}, {'question': 'What is an example of scraping?', 'snippet': 'Web scraping refers to the extraction of web data on to a format that is more useful for the user. For example, you might scrape product information from an ecommerce website onto an excel spreadsheet. Although web scraping can be done manually, in most cases, you might be better off using an automated tool.', 'date': 'Feb 1, 2022'}, {'question': 'How do I scrape 99% of websites?', 'snippet': 'Bypass Anti-Bot Protections Websites often block scraping attempts using TLS fingerprinting or IP monitoring. To avoid these roadblocks: Use rotating proxies to mimic real users. Set headers, such as a custom User-Agent , to make requests look like they come from a browser.', 'date': 'Jan 15, 2025'}], 'related_searches': [{'block_position': '3', 'query': 'I want to know scrapeless javascript', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=I+want+to+know+scrapeless+javascript&sa=X&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q1QJ6BAg-EAE'}, {'block_position': '3', 'query': 'I want to know scrapeless github', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=I+want+to+know+scrapeless+github&sa=X&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q1QJ6BAg_EAE'}, {'block_position': '3', 'query': 'I want to know scrapeless html', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=I+want+to+know+scrapeless+html&sa=X&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q1QJ6BAg8EAE'}, {'block_position': '3', 'query': 'Zenrow', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Zenrow&sa=X&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q1QJ6BAg7EAE'}, {'block_position': '3', 'query': 'ZenRows tutorial', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=ZenRows+tutorial&sa=X&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q1QJ6BAg3EAE'}, {'block_position': '3', 'query': 'Zenrows screenshot', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Zenrows+screenshot&sa=X&ved=2ahUKEwiLzOej3oOPAxXGhIkEHcj2AT4Q1QJ6BAgrEAE'}], 'search_information': {'organic_results_state': 'Results for exact spelling', 'query_displayed': 'I want to know Scrapeless', 'total_results': 0, 'time_taken_displayed': ''}}\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_scrapeless import ScrapelessDeepSerpGoogleSearchTool\n",
+ "\n",
+ "tool = ScrapelessDeepSerpGoogleSearchTool()\n",
+ "\n",
+ "# Basic usage\n",
+ "result = tool.invoke(\"I want to know Scrapeless\")\n",
+ "print(result)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "d6e73897",
+ "metadata": {},
+ "source": [
+ "#### Advanced Usage with Parameters"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "f90e33a7",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "{'inline_videos': [{'position': 1, 'title': 'How This AI Tool Makes Web Scraping Cheaper & 10x ...', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc', 'channel': 'SkillCurb', 'duration': '7:44', 'platform': 'YouTube', 'key_moments': [{'time': '00:00', 'title': 'Introduction', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=0', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQa-iXYOifpWDZZJLq8V45waX8C_mRTvar2rFgKASqipA&s'}, {'time': '00:20', 'title': 'Overview', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=20', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQ2I1-L6_cnRoz_nbhUYcLsG8KOjJL8aM2tMZAez_zigg&s'}, {'time': '01:20', 'title': 'How to Use SERP API', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=80', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRettKVpkN7yFSzZBeYPNhmhTdhzmjSJ3p4vTP9oY5VmA&s'}, {'time': '03:30', 'title': 'Additional Features', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=210', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRBSPraPiDau9kIyxZbygn_csUKH3q8Eop_E8jzYTyBOg&s'}, {'time': '07:05', 'title': 'Final Thoughts', 'link': 'https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=425', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcSiSXSUSq3hJH7XOukFzMfXWj7iVcb-F2b9Lwug0KjXHQ&s'}]}, {'position': 2, 'title': '[100% DONE] How to Bypass Cloudflare | Fast & Secure ...', 'link': 'https://www.youtube.com/watch?v=5brhS7FpcuE', 'channel': 'Daniel | Tech & Data', 'duration': '10:10', 'platform': 'YouTube', 'key_moments': [{'time': '00:00', 'title': 'Scrapeless Review', 'link': 'https://www.youtube.com/watch?v=5brhS7FpcuE&t=0', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcT5fwopDijht5tfX5x2M2vdZ3EmXYZdSSUq-1aVO_f1uw&s'}, {'time': '00:41', 'title': 'What Is Scrapeless?', 'link': 'https://www.youtube.com/watch?v=5brhS7FpcuE&t=41', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRym_G7tzv4Oykpw1pJliD4-eTwdbS22UqSooTUJcGYGQ&s'}, {'time': '02:38', 'title': 'How to Bypass Cloudflare When Web Scraping Using Scrapeless Scraping Browser', 'link': 'https://www.youtube.com/watch?v=5brhS7FpcuE&t=158', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcSG0OEEi89Kvgcje0HSledZVZG1XGF4VevxM0C_z1MPsw&s'}, {'time': '08:57', 'title': 'Final Thoughts', 'link': 'https://www.youtube.com/watch?v=5brhS7FpcuE&t=537', 'thumbnail': 'https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQaQKuz16Q7C5xPWWanLZYCeEo2VPAlj3FpKWXQ6lSbEg&s'}]}, {'position': 3, 'title': 'Scrapeless + N8N + Cline,Roo,Kilo : This CRAZY DEEP ...', 'link': 'https://www.youtube.com/watch?v=mDhdJWyo4uY', 'channel': 'AICodeKing', 'duration': '8:04', 'platform': 'YouTube'}], 'metadata': {'engine': 'google.search', 'rawUrl': 'https://api.scrapeless.com/storage/scrapeless.scraper.google.search/a68966fbee93aebd47a3cbac944eaab5/510b01c4cc6c1a898a06ceb6c8994451_1754948948.html'}, 'organic_results': [{'position': 1, 'snippet': 'Scrapeless offers AI-powered, robust, and scalable web scraping and automation services trusted by leading enterprises. Our enterprise-grade solutions are\\xa0...', 'site_links': {'expanded': [{'title': 'Pricing', 'link': 'https://www.scrapeless.com/en/pricing', 'snippet': 'Scrapeless offers AI-powered, robust, and scalable web ...'}, {'title': 'Scraping Browser', 'link': 'https://www.scrapeless.com/en/product/scraping-browser', 'snippet': 'Scrapeless offers AI-powered, robust, and scalable web ...'}, {'title': 'Web Scraping Services', 'link': 'https://www.scrapeless.com/en/product', 'snippet': 'Experience AI-driven web scraping with Scrapeless! Try Scraping ...'}, {'title': 'Sign up', 'link': 'https://app.scrapeless.com/passport/register', 'snippet': 'Scrapeless is the best full-stack web scraping toolkit offering ...'}, {'title': 'Scraping API', 'link': 'https://www.scrapeless.com/en/product/scraping-api', 'snippet': 'Effortlessly extract structured data at scale from popular websites ...'}, {'title': 'More results from scrapeless.com\\xa0»', 'link': '/search?q=Scrapeless+site:scrapeless.com&sca_esv=81a4eb9d017737df&gl=us&hl=en&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8QrAN6BAgdEAE'}]}, 'source': 'ScrapelessScrapeless'}, {'position': 2, 'title': 'Scrapeless', 'link': 'https://github.com/scrapeless-ai', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://github.com/scrapeless-ai&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8QFnoECBgQAQ', 'favicon': '', 'snippet': 'Scrapeless.com offers an enterprise-grade, AI-driven web scraping toolkit designed to help businesses efficiently access public web data.', 'snippet_highlighted_words': ['enterprise-grade, AI-driven web scraping toolkit'], 'source': 'GitHub'}, {'position': 3, 'title': 'Scrapeless', 'link': 'https://www.linkedin.com/company/scrapeless', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.linkedin.com/company/scrapeless&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8QFnoECEkQAQ', 'favicon': '', 'snippet': 'Scrapeless has developed a powerful and flexible web scraping toolkit specifically designed for enterprises, enabling them to easily and efficiently access\\xa0...', 'snippet_highlighted_words': ['Scrapeless'], 'source': 'LinkedIn\\xa0·\\xa0Scrapeless'}, {'position': 4, 'title': 'Scrapeless - Crunchbase Company Profile & Funding', 'link': 'https://www.crunchbase.com/organization/scrapeless', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.crunchbase.com/organization/scrapeless&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8QFnoECEoQAQ', 'favicon': '', 'snippet': 'Scrapeless is an innovative web scraping and data extraction company specializing in providing powerful, scalable, and flexible solutions for enterprises.', 'snippet_highlighted_words': ['an innovative web scraping and data extraction company'], 'source': 'Crunchbase'}, {'position': 5, 'title': 'Scrapeless Review: Hands-on Testing and My Opinion', 'link': 'https://geekflare.com/proxy/scrapeless-review/', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://geekflare.com/proxy/scrapeless-review/&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8QFnoECE4QAQ', 'favicon': '', 'snippet': 'Apr 24, 2025 — Apr 24, 2025Scrapeless is an excellent choice for businesses, developers, and data professionals who need a powerful, AI-driven web scraping solution.', 'snippet_highlighted_words': ['Scrapeless'], 'source': 'Geekflare'}, {'position': 6, 'title': 'Read Customer Service Reviews of scrapeless.com', 'link': 'https://www.trustpilot.com/review/scrapeless.com', 'redirect_link': 'https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.trustpilot.com/review/scrapeless.com&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8QFnoECEwQAQ', 'favicon': '', 'snippet': \"Do you agree with Scrapeless's 4-star rating? Check out what 34 people have written so far, and share your own experience.\", 'snippet_highlighted_words': [\"Scrapeless's 4-star rating\"], 'source': 'Trustpilot'}], 'pagination': {'current': 1, 'next': 'https://www.google.com/search?q=Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=U2WaaKKoDKzc1sQPjs7p-Ak&start=10&sa=N&sstk=Ac65TH5XpLz_rkhp583Of7nAh8bK-zcYHvYSNYVUJ_2S4NE58P8BKBgFGrJXmXbv5UCifNmvb1_ZzdOwJ0hVniny2xj6gAe8z5cK_w&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q8NMDegQICxAS', 'other_pages': {'2': 'https://www.google.com/search?q=Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=U2WaaKKoDKzc1sQPjs7p-Ak&start=10&sa=N&sstk=Ac65TH5XpLz_rkhp583Of7nAh8bK-zcYHvYSNYVUJ_2S4NE58P8BKBgFGrJXmXbv5UCifNmvb1_ZzdOwJ0hVniny2xj6gAe8z5cK_w&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q8tMDegQICxAE', '3': 'https://www.google.com/search?q=Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=U2WaaKKoDKzc1sQPjs7p-Ak&start=20&sa=N&sstk=Ac65TH5XpLz_rkhp583Of7nAh8bK-zcYHvYSNYVUJ_2S4NE58P8BKBgFGrJXmXbv5UCifNmvb1_ZzdOwJ0hVniny2xj6gAe8z5cK_w&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q8tMDegQICxAG', '4': 'https://www.google.com/search?q=Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=U2WaaKKoDKzc1sQPjs7p-Ak&start=30&sa=N&sstk=Ac65TH5XpLz_rkhp583Of7nAh8bK-zcYHvYSNYVUJ_2S4NE58P8BKBgFGrJXmXbv5UCifNmvb1_ZzdOwJ0hVniny2xj6gAe8z5cK_w&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q8tMDegQICxAI', '5': 'https://www.google.com/search?q=Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=U2WaaKKoDKzc1sQPjs7p-Ak&start=40&sa=N&sstk=Ac65TH5XpLz_rkhp583Of7nAh8bK-zcYHvYSNYVUJ_2S4NE58P8BKBgFGrJXmXbv5UCifNmvb1_ZzdOwJ0hVniny2xj6gAe8z5cK_w&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q8tMDegQICxAK', '6': 'https://www.google.com/search?q=Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=U2WaaKKoDKzc1sQPjs7p-Ak&start=50&sa=N&sstk=Ac65TH5XpLz_rkhp583Of7nAh8bK-zcYHvYSNYVUJ_2S4NE58P8BKBgFGrJXmXbv5UCifNmvb1_ZzdOwJ0hVniny2xj6gAe8z5cK_w&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q8tMDegQICxAM', '7': 'https://www.google.com/search?q=Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=U2WaaKKoDKzc1sQPjs7p-Ak&start=60&sa=N&sstk=Ac65TH5XpLz_rkhp583Of7nAh8bK-zcYHvYSNYVUJ_2S4NE58P8BKBgFGrJXmXbv5UCifNmvb1_ZzdOwJ0hVniny2xj6gAe8z5cK_w&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q8tMDegQICxAO', '8': 'https://www.google.com/search?q=Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=U2WaaKKoDKzc1sQPjs7p-Ak&start=70&sa=N&sstk=Ac65TH5XpLz_rkhp583Of7nAh8bK-zcYHvYSNYVUJ_2S4NE58P8BKBgFGrJXmXbv5UCifNmvb1_ZzdOwJ0hVniny2xj6gAe8z5cK_w&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q8tMDegQICxAQ'}}, 'related_questions': [{'question': 'What is scraping?', 'snippet': 'Scraping, in the context of computers, refers to the automated extraction of data from websites or other sources.'}, {'question': 'How much do web scrapers get paid?', 'snippet': 'Web scrapers can earn varying amounts depending on their experience, location, and the specific nature of their work (freelance vs. full-time, etc.).'}, {'question': 'Can sites detect web scraping?', 'snippet': \"The number one way sites detect web scrapers is by examining their IP address and tracking how it's behaving. If the server finds a pattern, strange behaviors, or an impossible request frequency (to name a few) for a real user, the server can block the IP address from accessing the site again.\"}, {'question': 'Can AutoGPT do web scraping?', 'snippet': 'Web Scraping: AutoGPT can extract data from websites through web scraping. It can gather the necessary information based on the specific task you want to complete.', 'date': 'Nov 29, 2023'}], 'related_searches': [{'block_position': '3', 'query': 'Scrapeless Shopee', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Scrapeless+Shopee&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q1QJ6BAhNEAE'}, {'block_position': '3', 'query': 'Scrapeless pricing', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Scrapeless+pricing&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q1QJ6BAhIEAE'}, {'block_position': '3', 'query': 'Scrapeless app', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Scrapeless+app&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q1QJ6BAhEEAE'}, {'block_position': '3', 'query': 'Scrapeless software', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Scrapeless+software&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q1QJ6BAhBEAE'}, {'block_position': '3', 'query': 'Scrapeless mcp server', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Scrapeless+mcp+server&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q1QJ6BAg_EAE'}, {'block_position': '3', 'query': 'ScrapeGraphAI', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=ScrapeGraphAI&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q1QJ6BAg8EAE'}, {'block_position': '3', 'query': 'Octoparse', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Octoparse&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q1QJ6BAg6EAE'}, {'block_position': '3', 'query': 'N8n', 'link': 'htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=N8n&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8Q1QJ6BAg4EAE'}], 'search_information': {'organic_results_state': 'Results for exact spelling', 'query_displayed': 'Scrapeless', 'total_results': 0, 'time_taken_displayed': ''}, 'things_to_know': {'buttons': [{'text': 'Legal Status', 'subtitle': 'How legal is web scraping?', 'displayed_link': 'https://', 'search_link': '/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=how legal is web scraping&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8QrooIegQIHBAJ'}, {'text': 'Instructions', 'subtitle': 'how to web scraping', 'displayed_link': 'https://'}, {'text': 'Benefits', 'subtitle': 'web scraping benefits', 'displayed_link': 'https://', 'search_link': '/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=web scraping benefits&sa=X&ved=2ahUKEwjineOk3oOPAxUsrpUCHQ5nGp8QrooIegQIHBAZ'}]}}\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_scrapeless import ScrapelessDeepSerpGoogleSearchTool\n",
+ "\n",
+ "tool = ScrapelessDeepSerpGoogleSearchTool()\n",
+ "\n",
+ "# Advanced usage\n",
+ "result = tool.invoke({\"q\": \"Scrapeless\", \"hl\": \"en\", \"google_domain\": \"google.com\"})\n",
+ "print(result)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "659f9fbd-6fcf-445f-aa8c-72d8e60154bd",
+ "metadata": {},
+ "source": [
+ "#### Use within an agent"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "af3123ad-7a02-40e5-b58e-7d56e23e5830",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "================================\u001b[1m Human Message \u001b[0m=================================\n",
+ "\n",
+ "I want to what is Scrapeless\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "Tool Calls:\n",
+ " scrapeless_deepserp_google_search (call_wqdCO6YiU1bkxpSuqXaht87f)\n",
+ " Call ID: call_wqdCO6YiU1bkxpSuqXaht87f\n",
+ " Args:\n",
+ " q: What is Scrapeless\n",
+ "=================================\u001b[1m Tool Message \u001b[0m=================================\n",
+ "Name: scrapeless_deepserp_google_search\n",
+ "\n",
+ "{\"inline_videos\": [{\"position\": 1, \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc\", \"key_moments\": [{\"time\": \"00:00\", \"title\": \"Introduction\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=0\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQa-iXYOifpWDZZJLq8V45waX8C_mRTvar2rFgKASqipA&s\"}, {\"time\": \"00:20\", \"title\": \"Overview\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=20\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQ2I1-L6_cnRoz_nbhUYcLsG8KOjJL8aM2tMZAez_zigg&s\"}, {\"time\": \"01:20\", \"title\": \"How to Use SERP API\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=80\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRettKVpkN7yFSzZBeYPNhmhTdhzmjSJ3p4vTP9oY5VmA&s\"}, {\"time\": \"03:30\", \"title\": \"Additional Features\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=210\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRBSPraPiDau9kIyxZbygn_csUKH3q8Eop_E8jzYTyBOg&s\"}, {\"time\": \"07:05\", \"title\": \"Final Thoughts\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=425\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcSiSXSUSq3hJH7XOukFzMfXWj7iVcb-F2b9Lwug0KjXHQ&s\"}]}], \"knowledge_graph\": {\"web_results\": [{\"link\": \"https://www.google.comhttps://en.wikipedia.org/wiki/Web_scraping\"}], \"source\": {}}, \"metadata\": {\"engine\": \"google.search\", \"rawUrl\": \"https://api.scrapeless.com/storage/scrapeless.scraper.google.search/49b6a0d866d5d0f0250ba8276a0c0661/44192a678eb7f46a8768a7da6f05870d_1754948950.html\"}, \"organic_results\": [{\"position\": 1, \"title\": \"Scrapeless: Effortless Web Scraping Toolkit\", \"link\": \"https://www.scrapeless.com/\", \"redirect_link\": \"https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.scrapeless.com/&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQFnoECBgQAQ\", \"favicon\": \"\", \"snippet\": \"Scrapeless offers AI-powered, robust, and scalable web scraping and automation services trusted by leading enterprises. Our enterprise-grade solutions are ...\", \"snippet_highlighted_words\": [\"AI-powered, robust, and scalable web scraping and automation services\"], \"site_links\": {\"inline\": [{\"title\": \"Scrapeless\", \"link\": \"https://app.scrapeless.com/\"}, {\"title\": \"Sign up\", \"link\": \"https://app.scrapeless.com/passport/register\"}, {\"title\": \"Pricing\", \"link\": \"https://www.scrapeless.com/en/pricing\"}, {\"title\": \"Scraping API\", \"link\": \"https://www.scrapeless.com/en/product/scraping-api\"}]}, \"source\": \"Scrapeless\"}, {\"position\": 2, \"title\": \"Scrapeless — Unlock a New Era of Data Scraping!\", \"link\": \"https://www.scrapeless.com/en/blog/scrapeless-web-scraping-toolkit\", \"redirect_link\": \"https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.scrapeless.com/en/blog/scrapeless-web-scraping-toolkit&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQFnoECBkQAQ\", \"favicon\": \"\", \"snippet\": \"Jan 6, 2025 — Jan 6, 2025Scrapeless is an AI-powered web scraping toolkit designed for efficient and seamless extraction of publicly available web data.\", \"snippet_highlighted_words\": [\"an AI-powered web scraping toolkit\"], \"source\": \"Scrapeless\"}, {\"position\": 3, \"title\": \"Scrapeless\", \"link\": \"https://www.futuretools.io/tools/scrapeless\", \"redirect_link\": \"https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.futuretools.io/tools/scrapeless&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQFnoECEMQAQ\", \"favicon\": \"\", \"snippet\": \"Scrapeless is an AI-powered web scraping toolkit that helps businesses extract data from websites efficiently, even from those with complex features or ...\", \"snippet_highlighted_words\": [\"Scrapeless\"], \"source\": \"Future Tools\"}, {\"position\": 4, \"title\": \"Scrapeless - Crunchbase Company Profile & Funding\", \"link\": \"https://www.crunchbase.com/organization/scrapeless\", \"redirect_link\": \"https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.crunchbase.com/organization/scrapeless&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQFnoECEIQAQ\", \"favicon\": \"\", \"snippet\": \"Scrapeless is an innovative web scraping and data extraction company specializing in providing powerful, scalable, and flexible solutions for enterprises.\", \"snippet_highlighted_words\": [\"an innovative web scraping and data extraction company\"], \"source\": \"Crunchbase\"}, {\"position\": 5, \"title\": \"Meet Scrapeless: The Most Cost-effective No-Code Web ...\", \"link\": \"https://www.reddit.com/r/AIinBusinessNews/comments/1hylunb/meet_scrapeless_the_most_costeffective_nocode_web/\", \"redirect_link\": \"https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.reddit.com/r/AIinBusinessNews/comments/1hylunb/meet_scrapeless_the_most_costeffective_nocode_web/&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQFnoECEYQAQ\", \"favicon\": \"\", \"snippet\": \"Scrapeless is a no-code, cost-effective, all-in-one AI web scraping toolkit that simplifies data extraction for developers and businesses of all ...\", \"snippet_highlighted_words\": [\"Scrapeless is a no-code, cost-effective, all-in-one AI web scraping toolkit\"], \"source\": \"Reddit · r/AIinBusinessNews\"}, {\"position\": 6, \"title\": \"Scrapeless - Ai Tool Details & Features\", \"link\": \"https://airespo.com/ai-tools/scrapeless/\", \"redirect_link\": \"https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://airespo.com/ai-tools/scrapeless/&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQFnoECEEQAQ\", \"favicon\": \"\", \"snippet\": \"Scrapeless is an AI scraping platform that extracts web data at scale using smart APIs, real browser tech, and full compliance.\", \"snippet_highlighted_words\": [\"an AI scraping platform\"], \"source\": \"AIRespo\"}, {\"position\": 7, \"title\": \"Scrapeless Review: Hands-on Testing and My Opinion\", \"link\": \"https://geekflare.com/proxy/scrapeless-review/\", \"redirect_link\": \"https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://geekflare.com/proxy/scrapeless-review/&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQFnoECEQQAQ\", \"favicon\": \"\", \"snippet\": \"Apr 24, 2025 — Apr 24, 2025Scrapeless has a built-in headless browser that mimics human behavior when scraping data. It renders JavaScript to load dynamic content, ...\", \"snippet_highlighted_words\": [\"Scrapeless has a built-in headless browser that mimics human behavior\"], \"source\": \"Geekflare\"}, {\"position\": 8, \"title\": \"Scrapeless\", \"link\": \"https://github.com/scrapeless-ai\", \"redirect_link\": \"https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://github.com/scrapeless-ai&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQFnoECEcQAQ\", \"favicon\": \"\", \"snippet\": \"Scrapeless.com offers an enterprise-grade, AI-driven web scraping toolkit designed to help businesses efficiently access public web data.\", \"snippet_highlighted_words\": [\"enterprise-grade, AI-driven web scraping toolkit\"], \"site_links\": {\"inline\": [{\"title\": \"What\", \"link\": \"https://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=%22What%22+is+Scrapeless&sa=X&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ5t4CegQIWRAB\"}]}, \"source\": \"GitHub\", \"missing\": [\"What\"]}, {\"position\": 9, \"title\": \"Scrapeless\", \"link\": \"https://www.linkedin.com/company/scrapeless\", \"redirect_link\": \"https://www.google.com/url?sa=t&source=web&rct=j&opi=89978449&url=https://www.linkedin.com/company/scrapeless&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQFnoECEUQAQ\", \"favicon\": \"\", \"snippet\": \"Scrapeless has developed a powerful and flexible web scraping toolkit specifically designed for enterprises, enabling them to easily and efficiently access ...\", \"snippet_highlighted_words\": [\"Scrapeless\"], \"source\": \"LinkedIn · Scrapeless\"}], \"pagination\": {\"current\": 1, \"next\": \"https://www.google.com/search?q=What+is+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=VmWaaPbQD9ewhbIP-6WWiQE&start=10&sa=N&sstk=Ac65TH4zqLfkUk3LtzFgh4CMrmKB7zPojfFEfSbrmRj4OoUbfBgGNJl2QVq60dWKPnpU61FCErTmf6dEJJPY7nqFuY_nRMI0jBmjYg&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ8NMDegQIDRAU\", \"other_pages\": {\"2\": \"https://www.google.com/search?q=What+is+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=VmWaaPbQD9ewhbIP-6WWiQE&start=10&sa=N&sstk=Ac65TH4zqLfkUk3LtzFgh4CMrmKB7zPojfFEfSbrmRj4OoUbfBgGNJl2QVq60dWKPnpU61FCErTmf6dEJJPY7nqFuY_nRMI0jBmjYg&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ8tMDegQIDRAE\", \"3\": \"https://www.google.com/search?q=What+is+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=VmWaaPbQD9ewhbIP-6WWiQE&start=20&sa=N&sstk=Ac65TH4zqLfkUk3LtzFgh4CMrmKB7zPojfFEfSbrmRj4OoUbfBgGNJl2QVq60dWKPnpU61FCErTmf6dEJJPY7nqFuY_nRMI0jBmjYg&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ8tMDegQIDRAG\", \"4\": \"https://www.google.com/search?q=What+is+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=VmWaaPbQD9ewhbIP-6WWiQE&start=30&sa=N&sstk=Ac65TH4zqLfkUk3LtzFgh4CMrmKB7zPojfFEfSbrmRj4OoUbfBgGNJl2QVq60dWKPnpU61FCErTmf6dEJJPY7nqFuY_nRMI0jBmjYg&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ8tMDegQIDRAI\", \"5\": \"https://www.google.com/search?q=What+is+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=VmWaaPbQD9ewhbIP-6WWiQE&start=40&sa=N&sstk=Ac65TH4zqLfkUk3LtzFgh4CMrmKB7zPojfFEfSbrmRj4OoUbfBgGNJl2QVq60dWKPnpU61FCErTmf6dEJJPY7nqFuY_nRMI0jBmjYg&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ8tMDegQIDRAK\", \"6\": \"https://www.google.com/search?q=What+is+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=VmWaaPbQD9ewhbIP-6WWiQE&start=50&sa=N&sstk=Ac65TH4zqLfkUk3LtzFgh4CMrmKB7zPojfFEfSbrmRj4OoUbfBgGNJl2QVq60dWKPnpU61FCErTmf6dEJJPY7nqFuY_nRMI0jBmjYg&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ8tMDegQIDRAM\", \"7\": \"https://www.google.com/search?q=What+is+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=VmWaaPbQD9ewhbIP-6WWiQE&start=60&sa=N&sstk=Ac65TH4zqLfkUk3LtzFgh4CMrmKB7zPojfFEfSbrmRj4OoUbfBgGNJl2QVq60dWKPnpU61FCErTmf6dEJJPY7nqFuY_nRMI0jBmjYg&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ8tMDegQIDRAO\", \"8\": \"https://www.google.com/search?q=What+is+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=VmWaaPbQD9ewhbIP-6WWiQE&start=70&sa=N&sstk=Ac65TH4zqLfkUk3LtzFgh4CMrmKB7zPojfFEfSbrmRj4OoUbfBgGNJl2QVq60dWKPnpU61FCErTmf6dEJJPY7nqFuY_nRMI0jBmjYg&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ8tMDegQIDRAQ\", \"9\": \"https://www.google.com/search?q=What+is+Scrapeless&sca_esv=81a4eb9d017737df&gl=us&hl=en&ei=VmWaaPbQD9ewhbIP-6WWiQE&start=80&sa=N&sstk=Ac65TH4zqLfkUk3LtzFgh4CMrmKB7zPojfFEfSbrmRj4OoUbfBgGNJl2QVq60dWKPnpU61FCErTmf6dEJJPY7nqFuY_nRMI0jBmjYg&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ8tMDegQIDRAS\"}}, \"related_questions\": [{\"question\": \"What is the purpose of scraping?\", \"snippet\": \"Scraping is a specialized form of manual therapy used in physical therapy to treat soft tissue injuries and dysfunctions. Scraping (aka Instrument-Assisted Soft Tissue Mobilization (IASTM) and the Graston Technique) is a form of manual therapy used in physical therapy to treat soft tissue injuries and dysfunctions.\", \"date\": \"Sep 5, 2024\"}, {\"question\": \"Is scraping a site illegal?\", \"snippet\": \"There are no specific laws prohibiting web scraping, and many companies employ it in legitimate ways to gain data-driven insights. However, there can be situations where other laws or regulations may come into play and make web scraping illegal.\", \"date\": \"Jan 7, 2025\"}, {\"question\": \"What is scraping in simple words?\", \"snippet\": \"the act of removing the surface from something using a sharp edge or something rough : Use techniques that generate less dust, such as wet sanding or scraping.\", \"date\": \"6 days ago\"}, {\"question\": \"What is an example of scraping?\", \"snippet\": \"Web scraping refers to the extraction of web data on to a format that is more useful for the user. For example, you might scrape product information from an ecommerce website onto an excel spreadsheet. Although web scraping can be done manually, in most cases, you might be better off using an automated tool.\", \"date\": \"Feb 1, 2022\"}], \"related_searches\": [{\"block_position\": \"2\", \"query\": \"What is scrapeless tools\", \"link\": \"htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=What+is+scrapeless+tools&sa=X&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ1QJ6BAhaEAE\"}, {\"block_position\": \"2\", \"query\": \"What is scrapeless app\", \"link\": \"htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=What+is+scrapeless+app&sa=X&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ1QJ6BAhREAE\"}, {\"block_position\": \"2\", \"query\": \"What is scrapeless api\", \"link\": \"htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=What+is+scrapeless+api&sa=X&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ1QJ6BAhKEAE\"}, {\"block_position\": \"2\", \"query\": \"Scrapeless github\", \"link\": \"htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Scrapeless+github&sa=X&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ1QJ6BAhJEAE\"}, {\"block_position\": \"2\", \"query\": \"Scrapeless mcp server\", \"link\": \"htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Scrapeless+mcp+server&sa=X&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ1QJ6BAhIEAE\"}, {\"block_position\": \"2\", \"query\": \"ZenRows vs Selenium\", \"link\": \"htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=ZenRows+vs+Selenium&sa=X&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ1QJ6BAhAEAE\"}, {\"block_position\": \"2\", \"query\": \"ZenRows documentation\", \"link\": \"htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=ZenRows+documentation&sa=X&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ1QJ6BAg_EAE\"}, {\"block_position\": \"2\", \"query\": \"Python ZenRows\", \"link\": \"htts://www.google.com/search?sca_esv=81a4eb9d017737df&gl=us&hl=en&q=Python+ZenRows&sa=X&ved=2ahUKEwj2052m3oOPAxVXWEEAHfuSJREQ1QJ6BAg-EAE\"}], \"search_information\": {\"organic_results_state\": \"Results for exact spelling\", \"query_displayed\": \"What is Scrapeless\", \"total_results\": 0, \"time_taken_displayed\": \"\"}, \"video_results\": [{\"key_moments\": [{\"time\": \"00:00\", \"title\": \"Introduction\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=0\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQa-iXYOifpWDZZJLq8V45waX8C_mRTvar2rFgKASqipA&s\"}, {\"time\": \"00:20\", \"title\": \"Overview\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=20\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQ2I1-L6_cnRoz_nbhUYcLsG8KOjJL8aM2tMZAez_zigg&s\"}, {\"time\": \"01:20\", \"title\": \"How to Use SERP API\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=80\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRettKVpkN7yFSzZBeYPNhmhTdhzmjSJ3p4vTP9oY5VmA&s\"}, {\"time\": \"03:30\", \"title\": \"Additional Features\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=210\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcRBSPraPiDau9kIyxZbygn_csUKH3q8Eop_E8jzYTyBOg&s\"}, {\"time\": \"07:05\", \"title\": \"Final Thoughts\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc&t=425\", \"thumbnail\": \"https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcSiSXSUSq3hJH7XOukFzMfXWj7iVcb-F2b9Lwug0KjXHQ&s\"}], \"position\": 1, \"title\": \"How This AI Tool Makes Web Scraping Cheaper & 10x ...\", \"link\": \"https://www.youtube.com/watch?v=4xQ1UVxW0Pc\", \"snippet\": \"scrapeless is an all-in-one data extraction platform offering a scraping browser which is a headless browser for bypassing anti-bolt mayors.\", \"duration\": \"7:44\", \"rich_snippet\": {\"top\": {\"detected_extensions\": {}, \"extensions\": []}}, \"video_link\": \"https://encrypted-vtbn0.gstatic.com/video?q=tbn:ANd9GcT8EfwcqzUeELPUtULSJ5LC7wWLZgMQA-oN2Q\"}]}\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "\n",
+ "Scrapeless is an AI-powered web scraping toolkit designed for efficient and seamless extraction of publicly available web data. It offers enterprise-grade solutions that are AI-powered, robust, and scalable for web scraping and automation services. You can learn more about Scrapeless on their [official website](https://www.scrapeless.com/). Additionally, you can watch a video that provides an overview of Scrapeless and its features [here](https://www.youtube.com/watch?v=4xQ1UVxW0Pc).\n",
+ "\n",
+ "If you need more detailed information or have specific questions about Scrapeless, feel free to explore the provided links for a deeper understanding.\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_openai import ChatOpenAI\n",
+ "from langchain_scrapeless import ScrapelessDeepSerpGoogleSearchTool\n",
+ "from langgraph.prebuilt import create_react_agent\n",
+ "\n",
+ "llm = ChatOpenAI()\n",
+ "\n",
+ "tool = ScrapelessDeepSerpGoogleSearchTool()\n",
+ "\n",
+ "# Use the tool with an agent\n",
+ "tools = [tool]\n",
+ "agent = create_react_agent(llm, tools)\n",
+ "\n",
+ "for chunk in agent.stream(\n",
+ " {\"messages\": [(\"human\", \"I want to what is Scrapeless\")]}, stream_mode=\"values\"\n",
+ "):\n",
+ " chunk[\"messages\"][-1].pretty_print()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "769b2246",
+ "metadata": {},
+ "source": [
+ "### ScrapelessDeepSerpGoogleTrendsTool\n",
+ "\n",
+ "#### Basic Usage"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": 5,
+ "id": "ca993de7",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "{'parameters': {'engine': 'google.trends.search', 'hl': 'en', 'data_type': 'INTEREST_OVER_TIME', 'tz': '0', 'cat': '0', 'date': 'today 1-m', 'q': 'Funny 2048,negamon monster trainer'}, 'interest_over_time': {'timeline_data': [{'date': 'Jul 11, 2025', 'timestamp': '1752192000', 'value': [0, 0]}, {'date': 'Jul 12, 2025', 'timestamp': '1752278400', 'value': [0, 0]}, {'date': 'Jul 13, 2025', 'timestamp': '1752364800', 'value': [0, 0]}, {'date': 'Jul 14, 2025', 'timestamp': '1752451200', 'value': [0, 0]}, {'date': 'Jul 15, 2025', 'timestamp': '1752537600', 'value': [0, 0]}, {'date': 'Jul 16, 2025', 'timestamp': '1752624000', 'value': [0, 0]}, {'date': 'Jul 17, 2025', 'timestamp': '1752710400', 'value': [0, 0]}, {'date': 'Jul 18, 2025', 'timestamp': '1752796800', 'value': [0, 0]}, {'date': 'Jul 19, 2025', 'timestamp': '1752883200', 'value': [0, 0]}, {'date': 'Jul 20, 2025', 'timestamp': '1752969600', 'value': [0, 0]}, {'date': 'Jul 21, 2025', 'timestamp': '1753056000', 'value': [0, 0]}, {'date': 'Jul 22, 2025', 'timestamp': '1753142400', 'value': [0, 0]}, {'date': 'Jul 23, 2025', 'timestamp': '1753228800', 'value': [0, 0]}, {'date': 'Jul 24, 2025', 'timestamp': '1753315200', 'value': [0, 0]}, {'date': 'Jul 25, 2025', 'timestamp': '1753401600', 'value': [0, 0]}, {'date': 'Jul 26, 2025', 'timestamp': '1753488000', 'value': [0, 0]}, {'date': 'Jul 27, 2025', 'timestamp': '1753574400', 'value': [0, 0]}, {'date': 'Jul 28, 2025', 'timestamp': '1753660800', 'value': [0, 0]}, {'date': 'Jul 29, 2025', 'timestamp': '1753747200', 'value': [0, 0]}, {'date': 'Jul 30, 2025', 'timestamp': '1753833600', 'value': [0, 0]}, {'date': 'Jul 31, 2025', 'timestamp': '1753920000', 'value': [0, 0]}, {'date': 'Aug 1, 2025', 'timestamp': '1754006400', 'value': [0, 0]}, {'date': 'Aug 2, 2025', 'timestamp': '1754092800', 'value': [0, 0]}, {'date': 'Aug 3, 2025', 'timestamp': '1754179200', 'value': [0, 0]}, {'date': 'Aug 4, 2025', 'timestamp': '1754265600', 'value': [0, 0]}, {'date': 'Aug 5, 2025', 'timestamp': '1754352000', 'value': [0, 0]}, {'date': 'Aug 6, 2025', 'timestamp': '1754438400', 'value': [0, 0]}, {'date': 'Aug 7, 2025', 'timestamp': '1754524800', 'value': [0, 0]}, {'date': 'Aug 8, 2025', 'timestamp': '1754611200', 'value': [0, 0]}, {'date': 'Aug 9, 2025', 'timestamp': '1754697600', 'value': [0, 0]}, {'date': 'Aug 10, 2025', 'timestamp': '1754784000', 'value': [0, 100]}, {'date': 'Aug 11, 2025', 'timestamp': '1754870400', 'value': [0, 0]}], 'averages': [{'value': 0}, {'value': 3}], 'isPartial': True}}\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_scrapeless import ScrapelessDeepSerpGoogleTrendsTool\n",
+ "\n",
+ "tool = ScrapelessDeepSerpGoogleTrendsTool()\n",
+ "\n",
+ "# Basic usage\n",
+ "result = tool.invoke(\"Funny 2048, negamon monster trainer\")\n",
+ "print(result)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "1a4db36f",
+ "metadata": {},
+ "source": [
+ "#### Advanced Usage with Parameters"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "42c83c46",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "{'parameters': {'engine': 'google.trends.search', 'hl': 'en', 'data_type': 'RELATED_TOPICS', 'tz': '0', 'cat': '0', 'date': 'today 1-m', 'q': 'Scrapeless'}, 'related_topics': {}}\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_scrapeless import ScrapelessDeepSerpGoogleTrendsTool\n",
+ "\n",
+ "tool = ScrapelessDeepSerpGoogleTrendsTool()\n",
+ "\n",
+ "# Advanced usage\n",
+ "result = tool.invoke({\"q\": \"Scrapeless\", \"data_type\": \"related_topics\", \"hl\": \"en\"})\n",
+ "print(result)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "7dde00ff",
+ "metadata": {},
+ "source": [
+ "#### Use within an agent"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "f6ca1aff",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "================================\u001b[1m Human Message \u001b[0m=================================\n",
+ "\n",
+ "I want to know the iPhone keyword trends\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "Tool Calls:\n",
+ " scrapeless_deepserp_google_trends (call_i7vhiCvbfFjjZTJXZMvD5iQM)\n",
+ " Call ID: call_i7vhiCvbfFjjZTJXZMvD5iQM\n",
+ " Args:\n",
+ " q: iPhone\n",
+ " data_type: interest_over_time\n",
+ " date: today 1-m\n",
+ " hl: en\n",
+ "=================================\u001b[1m Tool Message \u001b[0m=================================\n",
+ "Name: scrapeless_deepserp_google_trends\n",
+ "\n",
+ "{\"parameters\": {\"engine\": \"google.trends.search\", \"hl\": \"en\", \"data_type\": \"INTEREST_OVER_TIME\", \"tz\": \"0\", \"cat\": \"0\", \"date\": \"today 1-m\", \"q\": \"iPhone\"}, \"interest_over_time\": {\"timeline_data\": [{\"date\": \"Jul 11, 2025\", \"timestamp\": \"1752192000\", \"value\": [89]}, {\"date\": \"Jul 12, 2025\", \"timestamp\": \"1752278400\", \"value\": [93]}, {\"date\": \"Jul 13, 2025\", \"timestamp\": \"1752364800\", \"value\": [91]}, {\"date\": \"Jul 14, 2025\", \"timestamp\": \"1752451200\", \"value\": [87]}, {\"date\": \"Jul 15, 2025\", \"timestamp\": \"1752537600\", \"value\": [85]}, {\"date\": \"Jul 16, 2025\", \"timestamp\": \"1752624000\", \"value\": [81]}, {\"date\": \"Jul 17, 2025\", \"timestamp\": \"1752710400\", \"value\": [86]}, {\"date\": \"Jul 18, 2025\", \"timestamp\": \"1752796800\", \"value\": [86]}, {\"date\": \"Jul 19, 2025\", \"timestamp\": \"1752883200\", \"value\": [91]}, {\"date\": \"Jul 20, 2025\", \"timestamp\": \"1752969600\", \"value\": [99]}, {\"date\": \"Jul 21, 2025\", \"timestamp\": \"1753056000\", \"value\": [87]}, {\"date\": \"Jul 22, 2025\", \"timestamp\": \"1753142400\", \"value\": [87]}, {\"date\": \"Jul 23, 2025\", \"timestamp\": \"1753228800\", \"value\": [85]}, {\"date\": \"Jul 24, 2025\", \"timestamp\": \"1753315200\", \"value\": [84]}, {\"date\": \"Jul 25, 2025\", \"timestamp\": \"1753401600\", \"value\": [87]}, {\"date\": \"Jul 26, 2025\", \"timestamp\": \"1753488000\", \"value\": [90]}, {\"date\": \"Jul 27, 2025\", \"timestamp\": \"1753574400\", \"value\": [92]}, {\"date\": \"Jul 28, 2025\", \"timestamp\": \"1753660800\", \"value\": [86]}, {\"date\": \"Jul 29, 2025\", \"timestamp\": \"1753747200\", \"value\": [92]}, {\"date\": \"Jul 30, 2025\", \"timestamp\": \"1753833600\", \"value\": [94]}, {\"date\": \"Jul 31, 2025\", \"timestamp\": \"1753920000\", \"value\": [90]}, {\"date\": \"Aug 1, 2025\", \"timestamp\": \"1754006400\", \"value\": [89]}, {\"date\": \"Aug 2, 2025\", \"timestamp\": \"1754092800\", \"value\": [92]}, {\"date\": \"Aug 3, 2025\", \"timestamp\": \"1754179200\", \"value\": [92]}, {\"date\": \"Aug 4, 2025\", \"timestamp\": \"1754265600\", \"value\": [96]}, {\"date\": \"Aug 5, 2025\", \"timestamp\": \"1754352000\", \"value\": [87]}, {\"date\": \"Aug 6, 2025\", \"timestamp\": \"1754438400\", \"value\": [90]}, {\"date\": \"Aug 7, 2025\", \"timestamp\": \"1754524800\", \"value\": [94]}, {\"date\": \"Aug 8, 2025\", \"timestamp\": \"1754611200\", \"value\": [95]}, {\"date\": \"Aug 9, 2025\", \"timestamp\": \"1754697600\", \"value\": [98]}, {\"date\": \"Aug 10, 2025\", \"timestamp\": \"1754784000\", \"value\": [100]}, {\"date\": \"Aug 11, 2025\", \"timestamp\": \"1754870400\", \"value\": [89]}], \"isPartial\": true}}\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "\n",
+ "The trend for the keyword \"iPhone\" over the past month shows a fluctuating interest level. Here are some data points:\n",
+ "\n",
+ "- July 11, 2025: Interest level 89\n",
+ "- July 20, 2025: Interest level 99\n",
+ "- July 31, 2025: Interest level 90\n",
+ "- August 10, 2025: Interest level 100\n",
+ "\n",
+ "The interest seems to have peaked on August 10, 2025, with an interest level of 100.\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_openai import ChatOpenAI\n",
+ "from langchain_scrapeless import ScrapelessDeepSerpGoogleTrendsTool\n",
+ "from langgraph.prebuilt import create_react_agent\n",
+ "\n",
+ "llm = ChatOpenAI()\n",
+ "\n",
+ "tool = ScrapelessDeepSerpGoogleTrendsTool()\n",
+ "\n",
+ "# Use the tool with an agent\n",
+ "tools = [tool]\n",
+ "agent = create_react_agent(llm, tools)\n",
+ "\n",
+ "for chunk in agent.stream(\n",
+ " {\"messages\": [(\"human\", \"I want to know the iPhone keyword trends\")]},\n",
+ " stream_mode=\"values\",\n",
+ "):\n",
+ " chunk[\"messages\"][-1].pretty_print()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "4ac8146c",
+ "metadata": {},
+ "source": [
+ "## API reference\n",
+ "\n",
+ "- [Scrapeless Documentation](https://docs.scrapeless.com/en/deep-serp-api/quickstart/introduction/)\n",
+ "- [Scrapeless API Reference](https://apidocs.scrapeless.com/doc-800321)"
+ ]
+ }
+ ],
+ "metadata": {
+ "kernelspec": {
+ "display_name": "langchain",
+ "language": "python",
+ "name": "python3"
+ },
+ "language_info": {
+ "codemirror_mode": {
+ "name": "ipython",
+ "version": 3
+ },
+ "file_extension": ".py",
+ "mimetype": "text/x-python",
+ "name": "python",
+ "nbconvert_exporter": "python",
+ "pygments_lexer": "ipython3",
+ "version": "3.12.11"
+ }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
diff --git a/docs/docs/integrations/tools/scrapeless_universal_scraping.ipynb b/docs/docs/integrations/tools/scrapeless_universal_scraping.ipynb
new file mode 100644
index 00000000000..ec19d4b02b2
--- /dev/null
+++ b/docs/docs/integrations/tools/scrapeless_universal_scraping.ipynb
@@ -0,0 +1,339 @@
+{
+ "cells": [
+ {
+ "cell_type": "markdown",
+ "id": "a6f91f20",
+ "metadata": {},
+ "source": [
+ "# Scrapeless\n",
+ "\n",
+ "**Scrapeless** offers flexible and feature-rich data acquisition services with extensive parameter customization and multi-format export support. These capabilities empower LangChain to integrate and leverage external data more effectively. The core functional modules include:\n",
+ "\n",
+ "**DeepSerp**\n",
+ "- **Google Search**: Enables comprehensive extraction of Google SERP data across all result types.\n",
+ " - Supports selection of localized Google domains (e.g., `google.com`, `google.ad`) to retrieve region-specific search results.\n",
+ " - Pagination supported for retrieving results beyond the first page.\n",
+ " - Supports a search result filtering toggle to control whether to exclude duplicate or similar content.\n",
+ "- **Google Trends**: Retrieves keyword trend data from Google, including popularity over time, regional interest, and related searches.\n",
+ " - Supports multi-keyword comparison.\n",
+ " - Supports multiple data types: `interest_over_time`, `interest_by_region`, `related_queries`, and `related_topics`.\n",
+ " - Allows filtering by specific Google properties (Web, YouTube, News, Shopping) for source-specific trend analysis.\n",
+ "\n",
+ "**Universal Scraping**\n",
+ "- Designed for modern, JavaScript-heavy websites, allowing dynamic content extraction.\n",
+ " - Global premium proxy support for bypassing geo-restrictions and improving reliability.\n",
+ "\n",
+ "**Crawler**\n",
+ "- **Crawl**: Recursively crawl a website and its linked pages to extract site-wide content.\n",
+ " - Supports configurable crawl depth and scoped URL targeting.\n",
+ "- **Scrape**: Extract content from a single webpage with high precision.\n",
+ " - Supports \"main content only\" extraction to exclude ads, footers, and other non-essential elements.\n",
+ " - Allows batch scraping of multiple standalone URLs.\n",
+ "\n",
+ "## Overview\n",
+ "\n",
+ "### Integration details\n",
+ "\n",
+ "| Class | Package | Serializable | JS support | Package latest |\n",
+ "| :--- | :--- | :---: | :---: | :---: |\n",
+ "| [ScrapelessUniversalScrapingTool](https://pypi.org/project/langchain-scrapeless/) | [langchain-scrapeless](https://pypi.org/project/langchain-scrapeless/) | ✅ | ❌ |  |\n",
+ "\n",
+ "### Tool features\n",
+ "\n",
+ "|Native async|Returns artifact|Return data|\n",
+ "|:-:|:-:|:-:|\n",
+ "|✅|✅|html, markdown, links, metadata, structured content|\n",
+ "\n",
+ "\n",
+ "## Setup\n",
+ "\n",
+ "The integration lives in the `langchain-scrapeless` package."
+ ]
+ },
+ {
+ "cell_type": "raw",
+ "id": "ca676665",
+ "metadata": {
+ "vscode": {
+ "languageId": "raw"
+ }
+ },
+ "source": [
+ "!pip install langchain-scrapeless"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "b15e9266",
+ "metadata": {},
+ "source": [
+ "### Credentials\n",
+ "\n",
+ "You'll need a Scrapeless API key to use this tool. You can set it as an environment variable:"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "e0b178a2-8816-40ca-b57c-ccdd86dde9c9",
+ "metadata": {},
+ "outputs": [],
+ "source": [
+ "import os\n",
+ "\n",
+ "os.environ[\"SCRAPELESS_API_KEY\"] = \"your-api-key\""
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "1c97218f-f366-479d-8bf7-fe9f2f6df73f",
+ "metadata": {},
+ "source": [
+ "## Instantiation\n",
+ "\n",
+ "Here we show how to instantiate an instance of the Scrapeless Universal Scraping Tool. This tool allows you to scrape any website using a headless browser with JavaScript rendering capabilities, customizable output types, and geo-specific proxy support.\n",
+ "\n",
+ "The tool accepts the following parameters during instantiation:\n",
+ "- `url` (required, str): The URL of the website to scrape.\n",
+ "- `headless` (optional, bool): Whether to use a headless browser. Default is True.\n",
+ "- `js_render` (optional, bool): Whether to enable JavaScript rendering. Default is True.\n",
+ "- `js_wait_until` (optional, str): Defines when to consider the JavaScript-rendered page ready. Default is `'domcontentloaded'`. Options include:\n",
+ " - `load`: Wait until the page is fully loaded.\n",
+ " - `domcontentloaded`: Wait until the DOM is fully loaded.\n",
+ " - `networkidle0`: Wait until the network is idle.\n",
+ " - `networkidle2`: Wait until the network is idle for 2 seconds.\n",
+ "- `outputs` (optional, str): The specific type of data to extract from the page. Options include:\n",
+ " - `phone_numbers`\n",
+ " - `headings`\n",
+ " - `images`\n",
+ " - `audios`\n",
+ " - `videos`\n",
+ " - `links`\n",
+ " - `menus`\n",
+ " - `hashtags`\n",
+ " - `emails`\n",
+ " - `metadata`\n",
+ " - `tables`\n",
+ " - `favicon`\n",
+ "- `response_type` (optional, str): Defines the format of the response. Default is `'html'`. Options include:\n",
+ " - `html`: Return the raw HTML of the page.\n",
+ " - `plaintext`: Return the plain text content.\n",
+ " - `markdown`: Return a Markdown version of the page.\n",
+ " - `png`: Return a PNG screenshot.\n",
+ " - `jpeg`: Return a JPEG screenshot.\n",
+ "- `response_image_full_page` (optional, bool): Whether to capture and return a full-page image when using screenshot output (png or jpeg). Default is False.\n",
+ "- `selector` (optional, str): A specific CSS selector to scope scraping within a part of the page. Default is `None`.\n",
+ "- `proxy_country` (optional, str): Two-letter country code for geo-specific proxy access (e.g., `'us'`, `'gb'`, `'de'`, `'jp'`). Default is `'ANY'`."
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "74147a1a",
+ "metadata": {},
+ "source": [
+ "## Invocation\n",
+ "\n",
+ "### Basic Usage"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "65310a8b-eb0c-4d9e-a618-4f4abe2414fc",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "\n",
+ " Example Domain\n",
+ "\n",
+ " \n",
+ " \n",
+ " \n",
+ " \n",
+ "\n",
+ "\n",
+ "\n",
+ "\n",
+ "
Example Domain
\n",
+ "
This domain is for use in illustrative examples in documents. You may use this\n",
+ " domain in literature without prior coordination or asking for permission.
\n",
+ "
More information...
\n",
+ "
\n",
+ "\n",
+ "\n",
+ "\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_scrapeless import ScrapelessUniversalScrapingTool\n",
+ "\n",
+ "tool = ScrapelessUniversalScrapingTool()\n",
+ "\n",
+ "# Basic usage\n",
+ "result = tool.invoke(\"https://example.com\")\n",
+ "print(result)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "d6e73897",
+ "metadata": {},
+ "source": [
+ "### Advanced Usage with Parameters"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "f90e33a7",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "# Well hello there.\n",
+ "\n",
+ "Welcome to exmaple.com.\n",
+ "Chances are you got here by mistake (example.com, anyone?)\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_scrapeless import ScrapelessUniversalScrapingTool\n",
+ "\n",
+ "tool = ScrapelessUniversalScrapingTool()\n",
+ "\n",
+ "result = tool.invoke({\"url\": \"https://exmaple.com\", \"response_type\": \"markdown\"})\n",
+ "print(result)"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "659f9fbd-6fcf-445f-aa8c-72d8e60154bd",
+ "metadata": {},
+ "source": [
+ "### Use within an agent"
+ ]
+ },
+ {
+ "cell_type": "code",
+ "execution_count": null,
+ "id": "af3123ad-7a02-40e5-b58e-7d56e23e5830",
+ "metadata": {},
+ "outputs": [
+ {
+ "name": "stdout",
+ "output_type": "stream",
+ "text": [
+ "================================\u001b[1m Human Message \u001b[0m=================================\n",
+ "\n",
+ "Use the scrapeless scraping tool to fetch https://www.scrapeless.com/en and extract the h1 tag.\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "Tool Calls:\n",
+ " scrapeless_universal_scraping (call_jBrvMVL2ixhvf6gklhi7Gqtb)\n",
+ " Call ID: call_jBrvMVL2ixhvf6gklhi7Gqtb\n",
+ " Args:\n",
+ " url: https://www.scrapeless.com/en\n",
+ " outputs: headings\n",
+ "=================================\u001b[1m Tool Message \u001b[0m=================================\n",
+ "Name: scrapeless_universal_scraping\n",
+ "\n",
+ "{\"headings\":[\"Effortless Web Scraping Toolkitfor Business and Developers\",\"4.8\",\"4.5\",\"8.5\",\"A Flexible Toolkit for Accessing Public Web Data\",\"Deep SerpApi\",\"Scraping Browser\",\"Universal Scraping API\",\"Customized Services\",\"From Simple Data Scraping to Complex Anti-Bot Challenges, Scrapeless Has You Covered.\",\"Fully Compatible with Key Programming Languages and Tools\",\"Enterprise-level Data Scraping Solution\",\"Customized Data Scraping Solutions\",\"High Concurrency and High-Performance Scraping\",\"Data Cleaning and Transformation\",\"Real-Time Data Push and API Integration\",\"Data Security and Privacy Protection\",\"Enterprise-level SLA\",\"Why Scrapeless: Simplify Your Data Flow Effortlessly.\",\"Articles\",\"Organized Fresh Data\",\"Prices\",\"No need to hassle with browser maintenance\",\"Reviews\",\"Only pay for successful requests\",\"Products\",\"Fully scalable\",\"Unleash Your Competitive Edgein Data within the Industry\",\"Regulate Compliance for All Users\",\"Web Scraping Blog\",\"Scrapeless MCP Server Is Officially Live! Build Your Ultimate AI-Web Connector\",\"Product Updates | New Profile Feature\",\"How to Track Your Ranking on ChatGPT?\",\"For Scraping\",\"For Data\",\"For AI\",\"Top Scraper API\",\"Learning Center\",\"Legal\"]}\n",
+ "==================================\u001b[1m Ai Message \u001b[0m==================================\n",
+ "\n",
+ "The h1 tag extracted from the website https://www.scrapeless.com/en is \"Effortless Web Scraping Toolkit for Business and Developers\".\n"
+ ]
+ }
+ ],
+ "source": [
+ "from langchain_openai import ChatOpenAI\n",
+ "from langchain_scrapeless import ScrapelessUniversalScrapingTool\n",
+ "from langgraph.prebuilt import create_react_agent\n",
+ "\n",
+ "llm = ChatOpenAI()\n",
+ "\n",
+ "tool = ScrapelessUniversalScrapingTool()\n",
+ "\n",
+ "# Use the tool with an agent\n",
+ "tools = [tool]\n",
+ "agent = create_react_agent(llm, tools)\n",
+ "\n",
+ "for chunk in agent.stream(\n",
+ " {\n",
+ " \"messages\": [\n",
+ " (\n",
+ " \"human\",\n",
+ " \"Use the scrapeless scraping tool to fetch https://www.scrapeless.com/en and extract the h1 tag.\",\n",
+ " )\n",
+ " ]\n",
+ " },\n",
+ " stream_mode=\"values\",\n",
+ "):\n",
+ " chunk[\"messages\"][-1].pretty_print()"
+ ]
+ },
+ {
+ "cell_type": "markdown",
+ "id": "4ac8146c",
+ "metadata": {},
+ "source": [
+ "## API reference\n",
+ "\n",
+ "- [Scrapeless Documentation](https://docs.scrapeless.com/en/universal-scraping-api/quickstart/introduction/)\n",
+ "- [Scrapeless API Reference](https://apidocs.scrapeless.com/api-12948840)"
+ ]
+ }
+ ],
+ "metadata": {
+ "kernelspec": {
+ "display_name": "langchain",
+ "language": "python",
+ "name": "python3"
+ },
+ "language_info": {
+ "codemirror_mode": {
+ "name": "ipython",
+ "version": 3
+ },
+ "file_extension": ".py",
+ "mimetype": "text/x-python",
+ "name": "python",
+ "nbconvert_exporter": "python",
+ "pygments_lexer": "ipython3",
+ "version": "3.12.11"
+ }
+ },
+ "nbformat": 4,
+ "nbformat_minor": 5
+}
diff --git a/libs/packages.yml b/libs/packages.yml
index e1d24b1a9b3..312d9efae6f 100644
--- a/libs/packages.yml
+++ b/libs/packages.yml
@@ -716,4 +716,7 @@ packages:
- name: toolbox-langchain
repo: googleapis/mcp-toolbox-sdk-python
path: packages/toolbox-langchain
+- name: langchain-scrapeless
+ repo: scrapeless-ai/langchain-scrapeless
+ path: .