feat(rag): Support RAG SDK (#1322)

2025-09-04 18:40:10 +00:00 · 2024-03-22 15:36:57 +08:00
parent e65732d6e4
commit 8a17099dd2
69 changed files with 1332 additions and 558 deletions
--- a/docs/docs/api/introduction.md
+++ b/docs/docs/api/introduction.md
@@ -34,4 +34,10 @@ API_KEYS - The list of API keys that are allowed to access the API. Each of the
 API_KEYS=dbgpt
 ```

+## Installation
+If you use Python, you should install the official DB-GPT Client package from PyPI:
+
+```bash
+pip install "dbgpt[client]>=0.5.2"
+```

--- a/docs/docs/awel/awel_tutorial/getting_started/1.1_hello_world.md
+++ b/docs/docs/awel/awel_tutorial/getting_started/1.1_hello_world.md
@@ -40,7 +40,7 @@ awel-tutorial
 ## Adding DB-GPT Dependency

 ```bash
-poetry add "dbgpt>=0.5.1rc0"
+poetry add "dbgpt>=0.5.1"
 ```

 ## First Hello World
--- a/docs/docs/awel/cookbook/first_rag_with_awel.md
+++ b/docs/docs/awel/cookbook/first_rag_with_awel.md
@@ -0,0 +1,358 @@
+# RAG With AWEL
+
+In this example, we will show how to use the AWEL library to create a RAG program.
+
+Now, let us create a python file `first_rag_with_awel.py`.
+
+In this example, we will load your knowledge from a URL and store it in a vector store.
+
+### Install Dependencies
+
+First, you need to install the `dbgpt` library.
+
+```bash
+pip install "dbgpt[rag]>=0.5.2"
+````
+
+### Prepare Embedding Model
+
+To store the knowledge in a vector store, we need an embedding model, DB-GPT supports 
+a lot of embedding models, here are some of them:
+
+import Tabs from '@theme/Tabs';
+import TabItem from '@theme/TabItem';
+
+<Tabs
+  defaultValue="openai"
+  values={[
+    {label: 'Open AI(API)', value: 'openai'},
+    {label: 'text2vec(local)', value: 'text2vec'},
+    {label: 'Embedding API Server(cluster)', value: 'remote_embedding'},
+  ]}>
+  <TabItem value="openai">
+```python
+from dbgpt.rag.embedding import DefaultEmbeddingFactory
+
+embeddings = DefaultEmbeddingFactory.openai()
+```
+  </TabItem>
+
+  <TabItem value="text2vec">
+
+```python
+from dbgpt.rag.embedding import DefaultEmbeddingFactory
+
+embeddings = DefaultEmbeddingFactory.default("/data/models/text2vec-large-chinese")
+```
+  </TabItem>
+
+  <TabItem value="remote_embedding">
+
+If you have deployed [DB-GPT cluster](/docs/installation/model_service/cluster) and 
+[API server](/docs/installation/advanced_usage/OpenAI_SDK_call)
+, you can connect to the API server to get the embeddings.
+
+```python
+from dbgpt.rag.embedding import DefaultEmbeddingFactory
+
+embeddings = DefaultEmbeddingFactory.remote(
+  api_url="http://localhost:8100/api/v1/embeddings",
+  api_key="{your_api_key}",
+  model_name="text2vec"
+)
+```
+  </TabItem>
+</Tabs>
+
+### Load Knowledge And Store In Vector Store
+
+Then we can create a DAG which loads the knowledge from a URL and stores it in a vector 
+store.
+
+```python
+import asyncio
+import shutil
+from dbgpt.core.awel import DAG
+from dbgpt.rag import ChunkParameters
+from dbgpt.rag.knowledge import KnowledgeType
+from dbgpt.rag.operators import EmbeddingAssemblerOperator, KnowledgeOperator
+from dbgpt.storage.vector_store.chroma_store import ChromaVectorConfig
+from dbgpt.storage.vector_store.connector import VectorStoreConnector
+
+# Delete old vector store directory(/tmp/awel_rag_test_vector_store)
+shutil.rmtree("/tmp/awel_rag_test_vector_store", ignore_errors=True)
+
+vector_connector = VectorStoreConnector.from_default(
+    "Chroma",
+    vector_store_config=ChromaVectorConfig(
+        name="test_vstore",
+        persist_path="/tmp/awel_rag_test_vector_store",
+    ),
+    embedding_fn=embeddings
+)
+
+with DAG("load_knowledge_dag") as knowledge_dag:
+    # Load knowledge from URL
+    knowledge_task = KnowledgeOperator(knowledge_type=KnowledgeType.URL.name)
+    assembler_task = EmbeddingAssemblerOperator(
+        vector_store_connector=vector_connector,
+        chunk_parameters=ChunkParameters(chunk_strategy="CHUNK_BY_SIZE")
+    )
+    knowledge_task >> assembler_task
+
+chunks = asyncio.run(assembler_task.call("https://docs.dbgpt.site/docs/latest/awel/"))
+print(f"Chunk length: {len(chunks)}")
+```
+
+### Retrieve Knowledge From Vector Store
+
+Then you can retrieve the knowledge from the vector store.
+
+```python
+
+from dbgpt.core.awel import MapOperator
+from dbgpt.rag.operators import EmbeddingRetrieverOperator
+
+with DAG("retriever_dag") as retriever_dag:
+    retriever_task = EmbeddingRetrieverOperator(
+        top_k=3,
+        vector_store_connector=vector_connector,
+    )
+    content_task = MapOperator(lambda cks: "\n".join(c.content for c in cks))
+    retriever_task >> content_task
+
+chunks = asyncio.run(content_task.call("What is the AWEL?"))
+print(chunks)
+```
+
+### Prepare LLM
+
+To build a RAG program, we need a LLM, here are some of the LLMs that DB-GPT supports:
+
+<Tabs
+  defaultValue="openai"
+  values={[
+    {label: 'Open AI(API)', value: 'openai'},
+    {label: 'YI(API)', value: 'yi_proxy'},
+    {label: 'API Server(cluster)', value: 'model_service'},
+  ]}>
+  <TabItem value="openai">
+
+First, you should install the `openai` library. 
+
+```bash
+pip install openai
+```
+Then set your API key in the environment `OPENAI_API_KEY`.
+
+```python
+from dbgpt.model.proxy import OpenAILLMClient
+
+llm_client = OpenAILLMClient()
+```
+  </TabItem>
+
+  <TabItem value="yi_proxy">
+
+You should have a YI account and get the API key from the YI official website.
+
+First, you should install the `openai` library.
+
+```bash
+pip install openai
+```
+
+Then set your API key in the environment variable `YI_API_KEY`.
+
+```python
+from dbgpt.model.proxy import YiLLMClient
+
+llm_client = YiLLMClient()
+```
+  </TabItem>
+
+  <TabItem value="model_service">
+
+If you have deployed [DB-GPT cluster](/docs/installation/model_service/cluster) and 
+[API server](/docs/installation/advanced_usage/OpenAI_SDK_call)
+, you can connect to the API server to get the LLM model.
+
+The API is compatible with the OpenAI API, so you can use the OpenAILLMClient to 
+connect to the API server.
+
+First you should install the `openai` library.
+```bash
+pip install openai
+```
+
+```python
+from dbgpt.model.proxy import OpenAILLMClient
+
+llm_client = OpenAILLMClient(api_base="http://localhost:8100/api/v1/", api_key="{your_api_key}")
+```
+  </TabItem>
+</Tabs>
+
+
+### Create RAG Program
+
+Lastly, we can create a RAG with the retrieved knowledge.
+
+```python
+
+from dbgpt.core.awel import InputOperator, JoinOperator, InputSource
+from dbgpt.core.operators import PromptBuilderOperator, RequestBuilderOperator
+from dbgpt.model.operators import LLMOperator
+
+prompt = """Based on the known information below, provide users with professional and concise answers to their questions. 
+If the answer cannot be obtained from the provided content, please say: 
+"The information provided in the knowledge base is not sufficient to answer this question.". 
+It is forbidden to make up information randomly. When answering, it is best to summarize according to points 1.2.3.
+          known information: 
+          {context}
+          question:
+          {question}
+"""
+
+with DAG("llm_rag_dag") as rag_dag:
+    input_task = InputOperator(input_source=InputSource.from_callable())
+    retriever_task = EmbeddingRetrieverOperator(
+        top_k=3,
+        vector_store_connector=vector_connector,
+    )
+    content_task = MapOperator(lambda cks: "\n".join(c.content for c in cks))
+    
+    merge_task = JoinOperator(lambda context, question: {"context": context, "question": question})
+    
+    prompt_task = PromptBuilderOperator(prompt)
+    # The model is gpt-3.5-turbo, you can replace it with other models.
+    req_build_task = RequestBuilderOperator(model="gpt-3.5-turbo")
+    llm_task = LLMOperator(llm_client=llm_client)
+    result_task = MapOperator(lambda r: r.text)
+
+    input_task >> retriever_task >> content_task >> merge_task
+    input_task >> merge_task
+
+    merge_task >> prompt_task >> req_build_task >> llm_task >> result_task
+
+print(asyncio.run(result_task.call("What is the AWEL?")))
+```
+The output will be:
+
+```bash
+AWEL stands for Agentic Workflow Expression Language, which is a set of intelligent agent workflow expression language designed for large model application development. It simplifies the process by providing functionality and flexibility through its layered API design architecture, including the operator layer, AgentFrame layer, and DSL layer. Its goal is to allow developers to focus on business logic for LLMs applications without having to deal with intricate model and environment details.
+```
+
+Congratulations! You have created a RAG program with AWEL.
+
+### Full Code
+
+And let's look the full code of `first_rag_with_awel.py`:
+
+```python
+import asyncio
+import shutil
+from dbgpt.core.awel import DAG, MapOperator, InputOperator, JoinOperator, InputSource
+from dbgpt.core.operators import PromptBuilderOperator, RequestBuilderOperator
+from dbgpt.rag import ChunkParameters
+from dbgpt.rag.knowledge import KnowledgeType
+from dbgpt.rag.operators import EmbeddingAssemblerOperator, KnowledgeOperator, EmbeddingRetrieverOperator
+from dbgpt.rag.embedding import DefaultEmbeddingFactory
+from dbgpt.storage.vector_store.chroma_store import ChromaVectorConfig
+from dbgpt.storage.vector_store.connector import VectorStoreConnector
+from dbgpt.model.operators import LLMOperator
+from dbgpt.model.proxy import OpenAILLMClient
+
+# Here we use the openai embedding model, if you want to use other models, you can 
+# replace it according to the previous example.
+embeddings = DefaultEmbeddingFactory.openai()
+# Here we use the openai LLM model, if you want to use other models, you can replace
+# it according to the previous example.
+llm_client = OpenAILLMClient()
+
+# Delete old vector store directory(/tmp/awel_rag_test_vector_store)
+shutil.rmtree("/tmp/awel_rag_test_vector_store", ignore_errors=True)
+
+vector_connector = VectorStoreConnector.from_default(
+    "Chroma",
+    vector_store_config=ChromaVectorConfig(
+        name="test_vstore",
+        persist_path="/tmp/awel_rag_test_vector_store",
+    ),
+    embedding_fn=embeddings
+)
+
+with DAG("load_knowledge_dag") as knowledge_dag:
+    # Load knowledge from URL
+    knowledge_task = KnowledgeOperator(knowledge_type=KnowledgeType.URL.name)
+    assembler_task = EmbeddingAssemblerOperator(
+        vector_store_connector=vector_connector,
+        chunk_parameters=ChunkParameters(chunk_strategy="CHUNK_BY_SIZE")
+    )
+    knowledge_task >> assembler_task
+
+chunks = asyncio.run(assembler_task.call("https://docs.dbgpt.site/docs/latest/awel/"))
+print(f"Chunk length: {len(chunks)}\n")
+
+
+prompt = """Based on the known information below, provide users with professional and concise answers to their questions. 
+If the answer cannot be obtained from the provided content, please say: 
+"The information provided in the knowledge base is not sufficient to answer this question.". 
+It is forbidden to make up information randomly. When answering, it is best to summarize according to points 1.2.3.
+          known information: 
+          {context}
+          question:
+          {question}
+"""
+
+
+with DAG("llm_rag_dag") as rag_dag:
+    input_task = InputOperator(input_source=InputSource.from_callable())
+    retriever_task = EmbeddingRetrieverOperator(
+        top_k=3,
+        vector_store_connector=vector_connector,
+    )
+    content_task = MapOperator(lambda cks: "\n".join(c.content for c in cks))
+    
+    merge_task = JoinOperator(lambda context, question: {"context": context, "question": question})
+    
+    prompt_task = PromptBuilderOperator(prompt)
+    # The model is gpt-3.5-turbo, you can replace it with other models.
+    req_build_task = RequestBuilderOperator(model="gpt-3.5-turbo")
+    llm_task = LLMOperator(llm_client=llm_client)
+    result_task = MapOperator(lambda r: r.text)
+
+    input_task >> retriever_task >> content_task >> merge_task
+    input_task >> merge_task
+
+    merge_task >> prompt_task >> req_build_task >> llm_task >> result_task
+
+print(asyncio.run(result_task.call("What is the AWEL?")))
+```
+
+### Visualize DAGs
+
+And we can visualize the DAGs with the following code:
+
+```python
+knowledge_dag.visualize_dag()
+rag_dag.visualize_dag()
+```
+If you execute the code in Jupyter Notebook, you can see the DAGs in the notebook.
+
+```python
+display(knowledge_dag.show())
+display(rag_dag.show())
+```
+
+The graph of the `knowledge_dag` is:
+
+<p align="left">
+  <img src={'/img/awel/cookbook/first_rag_knowledge_dag.png'} width="1000px"/>
+</p>
+
+And the graph of the `rag_dag` is:
+<p align="left">
+  <img src={'/img/awel/cookbook/first_rag_rag_dag.png'} width="1000px"/>
+</p>
+
--- a/docs/sidebars.js
+++ b/docs/sidebars.js
@@ -68,6 +68,10 @@ const sidebars = {
              type: "doc",
              id: "awel/cookbook/multi_round_chat_withllm"
            },
+            {
+              type:"doc",
+              id: "awel/cookbook/first_rag_with_awel"
+            }
          ],
          link: {
            type: 'generated-index',
--- a/docs/static/img/awel/cookbook/first_rag_knowledge_dag.png
+++ b/docs/static/img/awel/cookbook/first_rag_knowledge_dag.png
--- a/docs/static/img/awel/cookbook/first_rag_rag_dag.png
+++ b/docs/static/img/awel/cookbook/first_rag_rag_dag.png