mirror of https://github.com/csunny/DB-GPT.git synced 2025-09-12 12:37:14 +00:00

Go to file

BurnCloud.com d22023c148 ✨ feat: add BurnCloud as new AI model provider (#2890 )

Co-authored-by: Claude <noreply@anthropic.com>

2025-09-10 18:58:25 +08:00

.devcontainer

Bugfix(RAG):handle exceptions in aload_document_with_limit results (#2712 )

2025-05-22 17:00:03 +08:00

.github

ci(SDK): Add 0.7.0 workflow (#2493 )

2025-03-20 15:27:44 +08:00

assets

docs: update readme (#2760 )

2025-06-10 16:07:51 +08:00

configs

✨ feat: add BurnCloud as new AI model provider (#2890 )

2025-09-10 18:58:25 +08:00

docker

feat(model): AI/ML API integration (#2844 )

2025-07-15 13:17:14 +08:00

docs

chore: update 0.7.3 version (#2860 )

2025-07-25 11:21:15 +08:00

examples

fix(RAG): fix url document rag mode (#2874 )

2025-08-05 21:52:39 +08:00

i18n

feat(model): Support model icon

2025-03-19 11:51:50 +08:00

packages

✨ feat: add BurnCloud as new AI model provider (#2890 )

2025-09-10 18:58:25 +08:00

pilot/meta_data

Native data AI application framework based on AWEL+AGENT (#1152 )

2024-02-07 17:43:27 +08:00

requirements

fix(ChatKnowledge): Fix chat knowledge error (#1753 )

2024-07-29 11:13:57 +08:00

scripts

feat(model): Proxy multimodal supports (#2641 )

2025-04-21 19:36:29 +08:00

tests

fix(datasource): fix doris DB connection use mysql protocol (#2875 )

2025-08-07 10:34:14 +08:00

web

fix(web): escape document separator (#2870 )

2025-08-01 17:47:29 +08:00

.devcontainer.json

feat(KnowledgeBase):Add Word97-2003 (.doc) Binary File parsing module (#2544 )

2025-03-29 11:05:06 +08:00

.dockerignore

feat(build): Support docker install

2025-03-12 10:24:22 +08:00

.flake8

feat(model): Support yi proxy LLM (#1303 )

2024-03-15 22:15:37 +08:00

.gitignore

feat(KnowledgeBase):Add Word97-2003 (.doc) Binary File parsing module (#2544 )

2025-03-29 11:05:06 +08:00

.isort.cfg

chore: Add pylint for DB-GPT core lib (#1076 )

2024-01-16 17:36:26 +08:00

.mypy.ini

chore: Fix pylint error (#1915 )

2024-08-29 16:37:31 +08:00

.pre-commit-config.yaml

fix(model): Fix apiserver error (#2605 )

2025-04-10 10:23:49 +08:00

.python-version

feat(model): Support reasoning model (#2375 )

2025-02-28 14:32:47 +08:00

CODE_OF_CONDUCT

Added CODE_OF_CONDUCT file

2023-10-30 23:16:52 +05:30

CONTRIBUTING.md

docs: update CONTRIBUTING.md to use uv package manager (#2855 )

2025-07-19 00:19:33 +08:00

DISCKAIMER.md

chore: add disckaimer (#2274 )

2025-01-03 20:51:25 +08:00

docker-compose.yml

feat(build): Support docker install

2025-03-12 10:24:22 +08:00

install_help.py

feat(agent): More general ReAct Agent (#2556 )

2025-03-31 09:38:31 +08:00

LICENSE

Initial commit

2023-04-13 22:52:44 +08:00

Makefile

feat(model): Proxy multimodal supports (#2641 )

2025-04-21 19:36:29 +08:00

MANIFEST.in

docs: SMMF introduction and usage (#878 )

2023-12-01 12:37:29 +08:00

pyproject.toml

chore: update 0.7.3 version (#2860 )

2025-07-25 11:21:15 +08:00

README.ja.md

docs: update readme style (#2767 )

2025-06-12 19:55:27 +08:00

README.md

docs: update readme style (#2767 )

2025-06-12 19:55:27 +08:00

README.zh.md

docs: update readme style (#2767 )

2025-06-12 19:55:27 +08:00

uv.lock

feat(model): Support glm4.5 models (#2867 )

2025-07-29 16:21:14 +08:00

README.md

DB-GPT: AI Native Data App Development framework with AWEL and Agents

Documents | Contact Us | Community | Paper

What is DB-GPT?

🤖 DB-GPT is an open source AI native data app development framework with AWEL(Agentic Workflow Expression Language) and agents.

The purpose is to build infrastructure in the field of large models, through the development of multiple technical capabilities such as multi-model management (SMMF), Text2SQL effect optimization, RAG framework and optimization, Multi-Agents framework collaboration, AWEL (agent workflow orchestration), etc. Which makes large model applications with data simpler and more convenient.

🚀 In the Data 3.0 era, based on models and databases, enterprises and developers can build their own bespoke applications with less code.

Introduction

The architecture of DB-GPT is shown in the following figure:

The core capabilities include the following parts:

RAG (Retrieval Augmented Generation): RAG is currently the most practically implemented and urgently needed domain. DB-GPT has already implemented a framework based on RAG, allowing users to build knowledge-based applications using the RAG capabilities of DB-GPT.
GBI (Generative Business Intelligence): Generative BI is one of the core capabilities of the DB-GPT project, providing the foundational data intelligence technology to build enterprise report analysis and business insights.
Fine-tuning Framework: Model fine-tuning is an indispensable capability for any enterprise to implement in vertical and niche domains. DB-GPT provides a complete fine-tuning framework that integrates seamlessly with the DB-GPT project. In recent fine-tuning efforts, an accuracy rate based on the Spider dataset has been achieved at 82.5%.
Data-Driven Multi-Agents Framework: DB-GPT offers a data-driven self-evolving multi-agents framework, aiming to continuously make decisions and execute based on data.
Data Factory: The Data Factory is mainly about cleaning and processing trustworthy knowledge and data in the era of large models.
Data Sources: Integrating various data sources to seamlessly connect production business data to the core capabilities of DB-GPT.

SubModule

DB-GPT-Hub Text-to-SQL workflow with high performance by applying Supervised Fine-Tuning (SFT) on Large Language Models (LLMs).
dbgpts dbgpts is the official repository which contains some data apps、AWEL operators、AWEL workflow templates and agents which build upon DB-GPT.

Text2SQL Finetune

LLM	Supported
LLaMA	✅
LLaMA-2	✅
BLOOM	✅
BLOOMZ	✅
Falcon	✅
Baichuan	✅
Baichuan2	✅
InternLM	✅
Qwen	✅
XVERSE	✅
ChatGLM2	✅

More Information about Text2SQL finetune

DB-GPT-Plugins DB-GPT Plugins that can run Auto-GPT plugin directly
GPT-Vis Visualization protocol

AI-Native Data App

🔥🔥🔥 Released V0.7.0 | A set of significant upgrades

Installation / Quick Start

Usage Tutorial

Features

At present, we have introduced several key features to showcase our current capabilities:

Private Domain Q&A & Data Processing

The DB-GPT project offers a range of functionalities designed to improve knowledge base construction and enable efficient storage and retrieval of both structured and unstructured data. These functionalities include built-in support for uploading multiple file formats, the ability to integrate custom data extraction plug-ins, and unified vector storage and retrieval capabilities for effectively managing large volumes of information.
Multi-Data Source & GBI(Generative Business intelligence)

The DB-GPT project facilitates seamless natural language interaction with diverse data sources, including Excel, databases, and data warehouses. It simplifies the process of querying and retrieving information from these sources, empowering users to engage in intuitive conversations and gain insights. Moreover, DB-GPT supports the generation of analytical reports, providing users with valuable data summaries and interpretations.
Multi-Agents&Plugins

It offers support for custom plug-ins to perform various tasks and natively integrates the Auto-GPT plug-in model. The Agents protocol adheres to the Agent Protocol standard.

Automated Fine-tuning text2SQL

We've also developed an automated fine-tuning lightweight framework centred on large language models (LLMs), Text2SQL datasets, LoRA/QLoRA/Pturning, and other fine-tuning methods. This framework simplifies Text-to-SQL fine-tuning, making it as straightforward as an assembly line process. DB-GPT-Hub

SMMF(Service-oriented Multi-model Management Framework)

We offer extensive model support, including dozens of large language models (LLMs) from both open-source and API agents, such as LLaMA/LLaMA2, Baichuan, ChatGLM, Wenxin, Tongyi, Zhipu, and many more.

News

Provider	Supported	Models
DeepSeek	✅	🔥🔥🔥 DeepSeek-R1-0528 🔥🔥🔥 DeepSeek-V3-0324 🔥🔥🔥 DeepSeek-R1 🔥🔥🔥 DeepSeek-V3 🔥🔥🔥 DeepSeek-R1-Distill-Llama-70B 🔥🔥🔥 DeepSeek-R1-Distill-Qwen-32B 🔥🔥🔥 DeepSeek-Coder-V2-Instruct
Qwen	✅	🔥🔥🔥 Qwen3-235B-A22B 🔥🔥🔥 Qwen3-30B-A3B 🔥🔥🔥 Qwen3-32B 🔥🔥🔥 QwQ-32B 🔥🔥🔥 Qwen2.5-Coder-32B-Instruct 🔥🔥🔥 Qwen2.5-Coder-14B-Instruct 🔥🔥🔥 Qwen2.5-72B-Instruct 🔥🔥🔥 Qwen2.5-32B-Instruct
GLM	✅	🔥🔥🔥 GLM-Z1-32B-0414 🔥🔥🔥 GLM-4-32B-0414 🔥🔥🔥 Glm-4-9b-chat
Llama	✅	🔥🔥🔥 Meta-Llama-3.1-405B-Instruct 🔥🔥🔥 Meta-Llama-3.1-70B-Instruct 🔥🔥🔥 Meta-Llama-3.1-8B-Instruct 🔥🔥🔥 Meta-Llama-3-70B-Instruct 🔥🔥🔥 Meta-Llama-3-8B-Instruct
Gemma	✅	🔥🔥🔥 gemma-2-27b-it 🔥🔥🔥 gemma-2-9b-it 🔥🔥🔥 gemma-7b-it 🔥🔥🔥 gemma-2b-it
Yi	✅	🔥🔥🔥 Yi-1.5-34B-Chat 🔥🔥🔥 Yi-1.5-9B-Chat 🔥🔥🔥 Yi-1.5-6B-Chat 🔥🔥🔥 Yi-34B-Chat
Starling	✅	🔥🔥🔥 Starling-LM-7B-beta
SOLAR	✅	🔥🔥🔥 SOLAR-10.7B
Mixtral	✅	🔥🔥🔥 Mixtral-8x7B
Phi	✅	🔥🔥🔥 Phi-3

More Supported LLMs

Privacy and Security

We ensure the privacy and security of data through the implementation of various technologies, including privatized large models and proxy desensitization.
Support Datasources
- Datasources

Image

🌐 AutoDL Image

Contribution

To check detailed guidelines for new contributions, please refer how to contribute

Contributors Wall

Licence

The MIT License (MIT)

DISCKAIMER

disckaimer

Citation

If you want to understand the overall architecture of DB-GPT, please cite Paper and Paper

If you want to learn about using DB-GPT for Agent development, please cite the Paper

@article{xue2023dbgpt,
      title={DB-GPT: Empowering Database Interactions with Private Large Language Models}, 
      author={Siqiao Xue and Caigao Jiang and Wenhui Shi and Fangyin Cheng and Keting Chen and Hongjun Yang and Zhiping Zhang and Jianshan He and Hongyang Zhang and Ganglin Wei and Wang Zhao and Fan Zhou and Danrui Qi and Hong Yi and Shaodong Liu and Faqiang Chen},
      year={2023},
      journal={arXiv preprint arXiv:2312.17449},
      url={https://arxiv.org/abs/2312.17449}
}
@misc{huang2024romasrolebasedmultiagentdatabase,
      title={ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning}, 
      author={Yi Huang and Fangyin Cheng and Fan Zhou and Jiahui Li and Jian Gong and Hongjun Yang and Zhidong Fan and Caigao Jiang and Siqiao Xue and Faqiang Chen},
      year={2024},
      eprint={2412.13520},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2412.13520}, 
}
@inproceedings{xue2024demonstration,
      title={Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models}, 
      author={Siqiao Xue and Danrui Qi and Caigao Jiang and Wenhui Shi and Fangyin Cheng and Keting Chen and Hongjun Yang and Zhiping Zhang and Jianshan He and Hongyang Zhang and Ganglin Wei and Wang Zhao and Fan Zhou and Hong Yi and Shaodong Liu and Hongjun Yang and Faqiang Chen},
      year={2024},
      booktitle = "Proceedings of the VLDB Endowment",
      url={https://arxiv.org/abs/2404.10209}
}

Contact Information

Thanks to everyone who has contributed to DB-GPT! Your ideas, code, comments, and even sharing them at events and on social platforms can make DB-GPT better. We are working on building a community, if you have any ideas for building the community, feel free to contact us.

Github Issues ⭐️：For questions about using GB-DPT, see the CONTRIBUTING.
Github Discussions ⭐️：Share your experience or unique apps.
Twitter ⭐️：Please feel free to talk to us.

README.md Unescape Escape

DB-GPT: AI Native Data App Development framework with AWEL and Agents

What is DB-GPT?

Introduction

SubModule

Text2SQL Finetune

AI-Native Data App

Installation / Quick Start

Features

Image

Contribution

Contributors Wall

Licence

DISCKAIMER

Citation

Contact Information

README.md