Interact your data and environment using the local GPT, no data leaks, 100% privately, 100% security
Go to file
2025-03-30 09:42:25 +08:00
.devcontainer feat(KnowledgeBase):Add Word97-2003 (.doc) Binary File parsing module (#2544) 2025-03-29 11:05:06 +08:00
.github ci(SDK): Add 0.7.0 workflow (#2493) 2025-03-20 15:27:44 +08:00
assets refactor(v0.7.0): restructure modules and config handling (#2358) 2025-02-21 19:54:53 +08:00
configs feat: add sglang support 2025-03-29 22:39:15 +08:00
docker fix: some typos using typos (#2552) 2025-03-29 16:39:46 +08:00
docs feat: add sglang support 2025-03-29 22:39:15 +08:00
examples feat(agent): Supports MCP (#2497) 2025-03-21 12:00:21 +08:00
i18n feat(model): Support model icon 2025-03-19 11:51:50 +08:00
packages feat: add sglang config 2025-03-30 09:42:25 +08:00
pilot/meta_data Native data AI application framework based on AWEL+AGENT (#1152) 2024-02-07 17:43:27 +08:00
requirements fix(ChatKnowledge): Fix chat knowledge error (#1753) 2024-07-29 11:13:57 +08:00
scripts ci(SDK): Add 0.7.0 workflow (#2493) 2025-03-20 15:27:44 +08:00
tests fix: fix unit test error (#2085) 2024-10-22 09:35:51 +08:00
web feat(web): Support html/svg preview (#2518) 2025-03-25 11:41:18 +08:00
.devcontainer.json feat(KnowledgeBase):Add Word97-2003 (.doc) Binary File parsing module (#2544) 2025-03-29 11:05:06 +08:00
.dockerignore feat(build): Support docker install 2025-03-12 10:24:22 +08:00
.flake8 feat(model): Support yi proxy LLM (#1303) 2024-03-15 22:15:37 +08:00
.gitignore feat(KnowledgeBase):Add Word97-2003 (.doc) Binary File parsing module (#2544) 2025-03-29 11:05:06 +08:00
.isort.cfg chore: Add pylint for DB-GPT core lib (#1076) 2024-01-16 17:36:26 +08:00
.mypy.ini chore: Fix pylint error (#1915) 2024-08-29 16:37:31 +08:00
.pre-commit-config.yaml feat: call xunfei spark with stream, and fix the temperature bug (#2121) 2024-11-19 23:30:02 +08:00
.python-version feat(model): Support reasoning model (#2375) 2025-02-28 14:32:47 +08:00
CODE_OF_CONDUCT Added CODE_OF_CONDUCT file 2023-10-30 23:16:52 +05:30
CONTRIBUTING.md feat(KnowledgeBase):Add Word97-2003 (.doc) Binary File parsing module (#2544) 2025-03-29 11:05:06 +08:00
DISCKAIMER.md chore: add disckaimer (#2274) 2025-01-03 20:51:25 +08:00
docker-compose.yml feat(build): Support docker install 2025-03-12 10:24:22 +08:00
install_help.py feat: add sglang support 2025-03-29 22:39:15 +08:00
LICENSE Initial commit 2023-04-13 22:52:44 +08:00
Makefile fix(test): fix make test failed (#2502) 2025-03-21 18:04:53 +08:00
MANIFEST.in docs: SMMF introduction and usage (#878) 2023-12-01 12:37:29 +08:00
pyproject.toml feat(agent): Supports MCP (#2497) 2025-03-21 12:00:21 +08:00
README.ja.md docs: Add 0.7.0 release blog (#2515) 2025-03-24 18:29:35 +08:00
README.md doc: fix a typo in README (#2526) 2025-03-25 23:32:02 +08:00
README.zh.md fix: column alias bug fix (#2499) 2025-03-21 15:50:43 +08:00
uv.lock feat(KnowledgeBase):Add Word97-2003 (.doc) Binary File parsing module (#2544) 2025-03-29 11:05:06 +08:00

DB-GPT: AI Native Data App Development framework with AWEL(Agentic Workflow Expression Language) and Agents

What is DB-GPT?

🤖 DB-GPT is an open source AI native data app development framework with AWEL(Agentic Workflow Expression Language) and agents.

The purpose is to build infrastructure in the field of large models, through the development of multiple technical capabilities such as multi-model management (SMMF), Text2SQL effect optimization, RAG framework and optimization, Multi-Agents framework collaboration, AWEL (agent workflow orchestration), etc. Which makes large model applications with data simpler and more convenient.

🚀 In the Data 3.0 era, based on models and databases, enterprises and developers can build their own bespoke applications with less code.

DISCKAIMER

AI-Native Data App



app_chat_v0 6

app_manage_chat_data_v0 6

chat_dashboard_display_v0 6

agent_prompt_awel_v0 6

Contents

Introduction

The architecture of DB-GPT is shown in the following figure:

The core capabilities include the following parts:

  • RAG (Retrieval Augmented Generation): RAG is currently the most practically implemented and urgently needed domain. DB-GPT has already implemented a framework based on RAG, allowing users to build knowledge-based applications using the RAG capabilities of DB-GPT.

  • GBI (Generative Business Intelligence): Generative BI is one of the core capabilities of the DB-GPT project, providing the foundational data intelligence technology to build enterprise report analysis and business insights.

  • Fine-tuning Framework: Model fine-tuning is an indispensable capability for any enterprise to implement in vertical and niche domains. DB-GPT provides a complete fine-tuning framework that integrates seamlessly with the DB-GPT project. In recent fine-tuning efforts, an accuracy rate based on the Spider dataset has been achieved at 82.5%.

  • Data-Driven Multi-Agents Framework: DB-GPT offers a data-driven self-evolving multi-agents framework, aiming to continuously make decisions and execute based on data.

  • Data Factory: The Data Factory is mainly about cleaning and processing trustworthy knowledge and data in the era of large models.

  • Data Sources: Integrating various data sources to seamlessly connect production business data to the core capabilities of DB-GPT.

SubModule

  • DB-GPT-Hub Text-to-SQL workflow with high performance by applying Supervised Fine-Tuning (SFT) on Large Language Models (LLMs).

  • dbgpts dbgpts is the official repository which contains some data apps、AWEL operators、AWEL workflow templates and agents which build upon DB-GPT.

Text2SQL Finetune

  • support llms
    • LLaMA
    • LLaMA-2
    • BLOOM
    • BLOOMZ
    • Falcon
    • Baichuan
    • Baichuan2
    • InternLM
    • Qwen
    • XVERSE
    • ChatGLM2

More Information about Text2SQL finetune

Install

Docker Linux macOS Windows

Usage Tutorial

Features

At present, we have introduced several key features to showcase our current capabilities:

Image

🌐 AutoDL Image

Language Switching

In the .env configuration file, modify the LANGUAGE parameter to switch to different languages. The default is English (Chinese: zh, English: en, other languages to be added later).

Contribution

Contributors Wall

Licence

The MIT License (MIT)

Citation

If you want to understand the overall architecture of DB-GPT, please cite paper and Paper

If you want to learn about using DB-GPT for Agent development, please cite the paper

@article{xue2023dbgpt,
      title={DB-GPT: Empowering Database Interactions with Private Large Language Models}, 
      author={Siqiao Xue and Caigao Jiang and Wenhui Shi and Fangyin Cheng and Keting Chen and Hongjun Yang and Zhiping Zhang and Jianshan He and Hongyang Zhang and Ganglin Wei and Wang Zhao and Fan Zhou and Danrui Qi and Hong Yi and Shaodong Liu and Faqiang Chen},
      year={2023},
      journal={arXiv preprint arXiv:2312.17449},
      url={https://arxiv.org/abs/2312.17449}
}
@misc{huang2024romasrolebasedmultiagentdatabase,
      title={ROMAS: A Role-Based Multi-Agent System for Database monitoring and Planning}, 
      author={Yi Huang and Fangyin Cheng and Fan Zhou and Jiahui Li and Jian Gong and Hongjun Yang and Zhidong Fan and Caigao Jiang and Siqiao Xue and Faqiang Chen},
      year={2024},
      eprint={2412.13520},
      archivePrefix={arXiv},
      primaryClass={cs.AI},
      url={https://arxiv.org/abs/2412.13520}, 
}
@inproceedings{xue2024demonstration,
      title={Demonstration of DB-GPT: Next Generation Data Interaction System Empowered by Large Language Models}, 
      author={Siqiao Xue and Danrui Qi and Caigao Jiang and Wenhui Shi and Fangyin Cheng and Keting Chen and Hongjun Yang and Zhiping Zhang and Jianshan He and Hongyang Zhang and Ganglin Wei and Wang Zhao and Fan Zhou and Hong Yi and Shaodong Liu and Hongjun Yang and Faqiang Chen},
      year={2024},
      booktitle = "Proceedings of the VLDB Endowment",
      url={https://arxiv.org/abs/2404.10209}
}

Contact Information

We are working on building a community, if you have any ideas for building the community, feel free to contact us.

Star History Chart