privateGPT

mirror of https://github.com/imartinez/privateGPT.git synced 2025-09-22 19:47:16 +00:00

Author	SHA1	Message	Date
Pablo Orgaz	347be643f7	fix(llm): special tokens and leading space (#1831 )	2024-04-04 14:37:29 +02:00
Robin Boone	b3b0140e24	feat(llm): Ollama LLM-Embeddings decouple + longer keep_alive settings (#1800 )	2024-04-02 16:23:10 +02:00
Iván Martínez	6f6c785dac	feat(llm): Ollama timeout setting (#1773 ) * added request_timeout to ollama, default set to 30.0 in settings.yaml and settings-ollama.yaml * Update settings-ollama.yaml * Update settings.yaml * updated settings.py and tidied up settings-ollama-yaml * feat(UI): Faster startup and document listing (#1763) * fix(ingest): update script label (#1770) huggingface -> Hugging Face * Fix lint errors --------- Co-authored-by: Stephen Gresham <steve@gresham.id.au> Co-authored-by: Ikko Eltociear Ashimine <eltociear@gmail.com>	2024-03-20 21:33:46 +01:00
Brett England	134fc54d7d	feat(ingest): Created a faster ingestion mode - pipeline (#1750 ) * Unify pgvector and postgres connection settings * Remove local changes * Update file pgvector->postgres * postgresql should be postgres * Adding pipeline ingestion mode * disable hugging face parallelism. Continue on file to doc transform failure * Semaphore to limit docq async workers. ETA reporting	2024-03-19 21:24:46 +01:00
Otto L	1efac6a3fe	feat(llm - embed): Add support for Azure OpenAI (#1698 ) * Add support for Azure OpenAI * fix: wrong default api_version Should be dashes instead of underscores. see: https://learn.microsoft.com/en-us/azure/ai-services/openai/reference * fix: code styling applied "make check" changes * refactor: extend documentation * mention azopenai as available option and extras * add recommended section * include settings-azopenai.yaml configuration file * fix: documentation	2024-03-15 16:49:50 +01:00
Brett England	63de7e4930	feat: unify settings for vector and nodestore connections to PostgreSQL (#1730 ) * Unify pgvector and postgres connection settings * Remove local changes * Update file pgvector->postgres	2024-03-15 09:55:17 +01:00
Brett England	68b3a34b03	feat(nodestore): add Postgres for the doc and index store (#1706 ) * Adding Postgres for the doc and index store * Adding documentation. Rename postgres database local->simple. Postgres storage dependencies * Update documentation for postgres storage * Renaming feature to nodestore * update docstore -> nodestore in doc * missed some docstore changes in doc * Updated poetry.lock * Formatting updates to pass ruff/black checks * Correction to unreachable code! * Format adjustment to pass black test * Adjust extra inclusion name for vector pg * extra dep change for pg vector * storage-postgres -> storage-nodestore-postgres * Hash change on poetry lock	2024-03-14 17:12:33 +01:00
icsy7867	02dc83e8e9	feat(llm): adds serveral settings for llamacpp and ollama (#1703 )	2024-03-11 22:51:05 +01:00
Iván Martínez	45f05711eb	feat: Upgrade to LlamaIndex to 0.10 (#1663 ) * Extract optional dependencies * Separate local mode into llms-llama-cpp and embeddings-huggingface for clarity * Support Ollama embeddings * Upgrade to llamaindex 0.10.14. Remove legacy use of ServiceContext in ContextChatEngine * Fix vector retriever filters	2024-03-06 17:51:30 +01:00
TQ	cd40e3982b	feat(Vector): support pgvector (#1624 )	2024-02-20 15:29:26 +01:00
Ygal Blum	6bbec79583	feat(llm): Add support for Ollama LLM (#1526 )	2024-02-09 15:50:50 +01:00
Naveen Kannan	869233f0e4	fix: Adding an LLM param to fix broken generator from llamacpp (#1519 )	2024-01-17 18:10:45 +01:00
CognitiveTech	e326126d0d	feat: add mistral + chatml prompts (#1426 )	2024-01-16 22:51:14 +01:00
Matthew Hill	2d27a9f956	feat(llm): Add openailike llm mode (#1447 ) This mode behaves the same as the openai mode, except that it allows setting custom models not supported by OpenAI. It can be used with any tool that serves models from an OpenAI compatible API. Implements #1424	2023-12-26 10:26:08 +01:00
Iván Martínez	4780540870	feat(settings): Configurable context_window and tokenizer (#1437 )	2023-12-21 14:49:35 +01:00
3ly-13	a072a40a7c	Allow setting OpenAI model in settings (#1386 ) feat(settings): Allow setting openai model to be used. Default to GPT 3.5	2023-12-09 20:13:00 +01:00
Louis Melchior	a3ed14c58f	feat(llm): drop default_system_prompt (#1385 ) As discussed on Discord, the decision has been made to remove the system prompts by default, to better segregate the API and the UI usages. A concurrent PR (#1353) is enabling the dynamic setting of a system prompt in the UI. Therefore, if UI users want to use a custom system prompt, they can specify one directly in the UI. If the API users want to use a custom prompt, they can pass it directly into their messages that they are passing to the API. In the highlight of the two use case above, it becomes clear that default system_prompt does not need to exist.	2023-12-08 23:13:51 +01:00
lopagela	56af625d71	Fix the parallel ingestion mode, and make it available through conf (#1336 ) * Fix the parallel ingestion mode, and make it available through conf Also updated the documentation to show how to configure the ingest mode. * PR feedback: redirect to documentation	2023-11-30 11:41:55 +01:00
Gianni Acquisto	9c192ddd73	Added max_new_tokens as a config option to llm yaml block (#1317 ) * added max_new_tokens as a configuration option to the llm block in settings * Update fern/docs/pages/manual/settings.mdx Co-authored-by: lopagela <lpglm@orange.fr> * Update private_gpt/settings/settings.py Add default value for max_new_tokens = 256 Co-authored-by: lopagela <lpglm@orange.fr> * Addressed location of docs comment * reformatting from running 'make check' * remove default config value from settings.yaml --------- Co-authored-by: lopagela <lpglm@orange.fr>	2023-11-26 19:17:29 +01:00
lopagela	bafdd3baf1	Ingestion Speedup Multiple strategy (#1309 )	2023-11-25 20:12:09 +01:00
Iván Martínez	944c43bfa8	Multi language support - fern debug (#1307 ) --------- Co-authored-by: Louis <lpglm@orange.fr> Co-authored-by: LeMoussel <cnhx27@gmail.com>	2023-11-25 14:34:23 +01:00
Iván Martínez	510caa576b	Make qdrant the default vector db (#1285 ) * Make qdrant the default vector db --------- Co-authored-by: Pablo Orgaz <pabloogc@gmail.com> Co-authored-by: lopagela <lpglm@orange.fr>	2023-11-20 16:19:22 +01:00
Anush	03d1ae6d70	feat: Qdrant support (#1228 ) * feat: Qdrant support * Update private_gpt/components/vector_store/vector_store_component.py	2023-11-13 21:23:26 +01:00
Pablo Orgaz	022bd718e3	fix: Remove global state (#1216 ) * Remove all global settings state * chore: remove autogenerated class * chore: cleanup * chore: merge conflicts	2023-11-12 22:20:36 +01:00
lopagela	a579c9bdc5	Update poetry lock (#1209 ) * Update the version of llama_index used to fix transient openai errors * Update poetry.lock file * Make `local` mode the default mode by default	2023-11-11 22:44:19 +01:00
lopagela	0c40cfb115	Endpoint to delete documents ingested (#1163 ) A file that is ingested will be transformed into several documents (that are organized into nodes). This endpoint is deleting documents (bits of a file). These bits can be retrieved thanks to the endpoint to list all the documents.	2023-11-06 15:47:42 +01:00
Iván Martínez	ad512e3c42	Feature/sagemaker embedding (#1161 ) * Sagemaker deployed embedding model support --------- Co-authored-by: Pablo Orgaz <pabloogc@gmail.com>	2023-11-05 16:16:49 +01:00
Pierre Marais	f29df84301	Disable chromaDB anonymous information collection (#1144 ) See https://docs.trychroma.com/telemetry	2023-11-02 12:45:48 +01:00
Pablo Orgaz	a517a588c4	fix: sagemaker config and chat methods (#1142 )	2023-10-30 21:54:41 +01:00
lopagela	64c5ae214a	feat: Drop loguru and use builtin `logging` (#1133 ) * Configure simple builtin logging Changed the 2 existing `print` in the `private_gpt` code base into actual python logging, stop using loguru (dependency will be dropped in a later commit). Try to use the `key=value` logging convention in logs (to indicate what dynamic values represents, and what is dynamic vs not). Using `%s` log style, so that the string formatting is pushed inside the logger, giving the ability to the logger to determine if the string need to be formatted or not (i.e. strings from debug logs might not be formatted if the log level is not debug) The (basic) builtin log configuration have been placed in `private_gpt/__init__.py` in order to initialize the logging system even before we start to launch any python code in `private_gpt` package (ensuring we get any initialization log formatted as we want to) Disabled `uvicorn` custom logging format, resulting in having uvicorn logs being outputted in our formatted. Some more concise format could be used if we want to, especially: ``` COMPACT_LOG_FORMAT = '%(asctime)s.%(msecs)03d [%(levelname)s] %(name)s - %(message)s' ``` Python documentation and cookbook on logging for reference: * https://docs.python.org/3/library/logging.html * https://docs.python.org/3/howto/logging.html * Removing loguru from the dependencies Result of `poetry remove loguru` * PR feedback: using `logger` variable name instead of `log` --------- Co-authored-by: Louis Melchior <louis@jaris.io>	2023-10-29 19:11:02 +01:00
Pablo Orgaz	895588b82a	fix: Docker and sagemaker setup (#1118 ) * fix: docker copying extra files * feat: allow configuring mode through env vars * feat: Attempt to build and tag a docker image * fix: run docker on release * fix: typing in prompt transformation * chore: remove tutorial comments	2023-10-27 13:29:29 +02:00
Iván Martínez	78546524d0	Use OpenAI for embeddings when openai mode is selected (#1096 )	2023-10-23 10:50:42 +02:00
Iván Martínez	f5a9bf4e37	fix: chromadb max batch size (#1087 )	2023-10-20 18:24:56 +02:00
Pablo Orgaz	51cc638758	Next version of PrivateGPT (#1077 ) * Dockerize private-gpt * Use port 8001 for local development * Add setup script * Add CUDA Dockerfile * Create README.md * Make the API use OpenAI response format * Truncate prompt * refactor: add models and __pycache__ to .gitignore * Better naming * Update readme * Move models ignore to it's folder * Add scaffolding * Apply formatting * Fix tests * Working sagemaker custom llm * Fix linting * Fix linting * Enable streaming * Allow all 3.11 python versions * Use llama 2 prompt format and fix completion * Restructure (#3) Co-authored-by: Pablo Orgaz <pablo@Pablos-MacBook-Pro.local> * Fix Dockerfile * Use a specific build stage * Cleanup * Add FastAPI skeleton * Cleanup openai package * Fix DI and tests * Split tests and tests with coverage * Remove old scaffolding * Add settings logic (#4) * Add settings logic * Add settings for sagemaker --------- Co-authored-by: Pablo Orgaz <pablo@Pablos-MacBook-Pro.local> * Local LLM (#5) * Add settings logic * Add settings for sagemaker * Add settings-local-example.yaml * Delete terraform files * Refactor tests to use fixtures * Join deltas * Add local model support --------- Co-authored-by: Pablo Orgaz <pablo@Pablos-MacBook-Pro.local> * Update README.md * Fix tests * Version bump * Enable simple llamaindex observability (#6) * Enable simple llamaindex observability * Improve code through linting * Update README.md * Move to async (#7) * Migrate implementation to use asyncio * Formatting * Cleanup * Linting --------- Co-authored-by: Pablo Orgaz <pablo@Pablos-MacBook-Pro.local> * Query Docs and gradio UI * Remove unnecessary files * Git ignore chromadb folder * Async migration + DI Cleanup * Fix tests * Add integration test * Use fastapi responses * Retrieval service with partial implementation * Cleanup * Run formatter * Fix types * Fetch nodes asynchronously * Install local dependencies in tests * Install ui dependencies in tests * Install dependencies for llama-cpp * Fix sudo * Attempt to fix cuda issues * Attempt to fix cuda issues * Try to reclaim some space from ubuntu machine * Retrieval with context * Fix lint and imports * Fix mypy * Make retrieval API a POST * Make Completions body a dataclass * Fix LLM chat message order * Add Query Chunks to Gradio UI * Improve rag query prompt * Rollback CI Changes * Move to sync code * Using Llamaindex abstraction for query retrieval * Fix types * Default to CONDENSED chat mode for contextualized chat * Rename route function * Add Chat endpoint * Remove webhooks * Add IntelliJ run config to gitignore * .gitignore applied * Sync chat completion * Refactor total * Typo in context_files.py * Add embeddings component and service * Remove wrong dataclass from IngestService * Filter by context file id implementation * Fix typing * Implement context_filter and separate from the bool use_context in the API * Change chunks api to avoid conceptual class of the context concept * Deprecate completions and fix tests * Remove remaining dataclasses * Use embedding component in ingest service * Fix ingestion to have multipart and local upload * Fix ingestion API * Add chunk tests * Add configurable paths * Cleaning up * Add more docs * IngestResponse includes a list of IngestedDocs * Use IngestedDoc in the Chunk document reference * Rename ingest routes to ingest_router.py * Fix test working directory for intellij * Set testpaths for pytest * Remove unused as_chat_engine * Add .fleet ide to gitignore * Make LLM and Embedding model configurable * Fix imports and checks * Let local_data folder exist empty in the repository * Don't use certain metadata in LLM * Remove long lines * Fix windows installation * Typos * Update poetry.lock * Add TODO for linux * Script and first version of docs * No jekill build * Fix relative url to openapi json * Change default docs values * Move chromadb dependency to the general group * Fix tests to use separate local_data * Create CNAME * Update CNAME * Fix openapi.json relative path * PrivateGPT logo * WIP OpenAPI documentation metadata * Add ingest script (#11) * Add ingest script * Fix broken name refactor * Add ingest docs and Makefile script * Linting * Move transformers to main dependency * Move torch to main dependencies * Don't load HuggingFaceEmbedding in tests * Fix lint --------- Co-authored-by: Pablo Orgaz <pablo@Pablos-MacBook-Pro.local> * Rename file to camel_case * Commit settings-local.yaml * Move documentation to public docs * Fix docker image for linux * Installation and Running the Server documentation * Move back to docs folder, as it is the only supported by github pages * Delete CNAME * Create CNAME * Delete CNAME * Create CNAME * Improved API documentation * Fix lint * Completions documentation * Updated openapi scheme * Ingestion API doc * Minor doc changes * Updated openapi scheme * Chunks API documentation * Embeddings and Health API, and homogeneous responses * Revamp README with new skeleton of content * More docs * PrivateGPT logo * Improve UI * Update ingestion docu * Update README with new sections * Use context window in the retriever * Gradio Documentation * Add logo to UI * Include Contributing and Community sections to README * Update links to resources in the README * Small README.md updates * Wrap lines of README.md * Don't put health under /v1 * Add copy button to Chat * Architecture documentation * Updated openapi.json * Updated openapi.json * Updated openapi.json * Change UI label * Update documentation * Add releases link to README.md * Gradio avatar and stop debug * Readme update * Clean old files * Remove unused terraform checks * Update twitter link. * Disable minimum coverage * Clean install message in README.md --------- Co-authored-by: Pablo Orgaz <pablo@Pablos-MacBook-Pro.local> Co-authored-by: Iván Martínez <ivanmartit@gmail.com> Co-authored-by: RubenGuerrero <ruben.guerrero@boopos.com> Co-authored-by: Daniel Gallego Vico <daniel.gallego@bq.com>	2023-10-19 16:04:35 +02:00

34 Commits