Added deeplake use case examples of the new features (#6528)

Fixes # (issue) #### Before submitting  #### Who can review? Tag maintainers/contributors who might be interested:  1. Added use cases of the new features 2. Done some code refactoring --------- Co-authored-by: Ivo Stranic <istranic@gmail.com>
2025-10-02 02:38:36 +00:00 · 2023-07-10 20:04:29 +06:00
parent 9b615022e2
commit 5debd5043e
8 changed files with 345 additions and 1286 deletions
--- a/docs/extras/use_cases/code/twitter-the-algorithm-analysis-deeplake.ipynb
+++ b/docs/extras/use_cases/code/twitter-the-algorithm-analysis-deeplake.ipynb
@@ -5,8 +5,8 @@
   "cell_type": "markdown",
   "metadata": {},
   "source": [
-    "# Analysis of Twitter the-algorithm source code with LangChain, GPT4 and Deep Lake\n",
-    "In this tutorial, we are going to use Langchain + Deep Lake with GPT4 to analyze the code base of the twitter algorithm. "
+    "# Analysis of Twitter the-algorithm source code with LangChain, GPT4 and Activeloop's Deep Lake\n",
+    "In this tutorial, we are going to use Langchain + Activeloop's Deep Lake with GPT4 to analyze the code base of the twitter algorithm. "
   ]
  },
  {
@@ -15,7 +15,7 @@
   "metadata": {},
   "outputs": [],
   "source": [
-    "!python3 -m pip install --upgrade langchain deeplake openai tiktoken"
+    "!python3 -m pip install --upgrade langchain 'deeplake[enterprise]' openai tiktoken"
   ]
  },
  {
@@ -41,7 +41,8 @@
    "from langchain.vectorstores import DeepLake\n",
    "\n",
    "os.environ[\"OPENAI_API_KEY\"] = getpass.getpass(\"OpenAI API Key:\")\n",
-    "os.environ[\"ACTIVELOOP_TOKEN\"] = getpass.getpass(\"Activeloop Token:\")"
+    "activeloop_token = getpass.getpass(\"Activeloop Token:\")\n",
+    "os.environ[\"ACTIVELOOP_TOKEN\"] = activeloop_token"
   ]
  },
  {
@@ -149,6 +150,29 @@
    "db.add_documents(texts)"
   ]
  },
+  {
+   "attachments": {},
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "`Optional`: You can also use Deep Lake's Managed Tensor Database as a hosting service and run queries there. In order to do so, it is necessary to specify the runtime parameter as {'tensor_db': True} during the creation of the vector store. This configuration enables the execution of queries on the Managed Tensor Database, rather than on the client side. It should be noted that this functionality is not applicable to datasets stored locally or in-memory. In the event that a vector store has already been created outside of the Managed Tensor Database, it is possible to transfer it to the Managed Tensor Database by following the prescribed steps."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "metadata": {},
+   "outputs": [],
+   "source": [
+    "# username = \"davitbun\"  # replace with your username from app.activeloop.ai\n",
+    "# db = DeepLake(\n",
+    "#     dataset_path=f\"hub://{username}/twitter-algorithm\",\n",
+    "#     embedding_function=embeddings,\n",
+    "#     runtime={\"tensor_db\": True}\n",
+    "# )\n",
+    "# db.add_documents(texts)"
+   ]
+  },
  {
   "attachments": {},
   "cell_type": "markdown",
@@ -176,6 +200,7 @@
    "    dataset_path=\"hub://davitbun/twitter-algorithm\",\n",
    "    read_only=True,\n",
    "    embedding_function=embeddings,\n",
+    "    \n",
    ")"
   ]
  },