langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-06-01 12:38:45 +00:00

Author	SHA1	Message	Date
Erick Friis	7bc100fd43	docs: integration package pip installs (#15762 ) More than 300 files - will fail check_diff. Will merge after Vercel deploy succeeds Still occurrences that need changing - will update more later	2024-01-09 11:13:10 -08:00
Harrison Chase	38ae4df3a1	update ragatouille integration (#15658 )	2024-01-07 10:51:34 -08:00
Erick Friis	b1fa726377	docs: langchain-openai (#15513 ) Updates docs and cookbooks to import ChatOpenAI, OpenAI, and OpenAI Embeddings from `langchain_openai` There are likely more --------- Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-06 15:54:48 -08:00
Bagatur	c5226d7a18	docs: update cohere chat integration (#15562 ) Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-01-05 16:33:29 -08:00
Harrison Chase	fd5fbb507d	fix links (#15566 ) there are still a few broken ones: - some in the chains docs, which I will delete soon :) - some pointing to a sqlite tool, which we should add	2024-01-04 21:57:30 -08:00
Harrison Chase	14966581df	add ragatouille (#15561 )	2024-01-04 13:45:20 -08:00
Bagatur	baeac236b6	langchain[patch], experimental[patch]: update utilities imports (#15438 )	2024-01-03 02:18:15 -05:00
Bagatur	fa5d49f2c1	docs, experimental[patch], langchain[patch], community[patch]: update storage imports (#15429 ) ran ```bash g grep -l "langchain.vectorstores" \| xargs -L 1 sed -i '' "s/langchain\.vectorstores/langchain_community.vectorstores/g" g grep -l "langchain.document_loaders" \| xargs -L 1 sed -i '' "s/langchain\.document_loaders/langchain_community.document_loaders/g" g grep -l "langchain.chat_loaders" \| xargs -L 1 sed -i '' "s/langchain\.chat_loaders/langchain_community.chat_loaders/g" g grep -l "langchain.document_transformers" \| xargs -L 1 sed -i '' "s/langchain\.document_transformers/langchain_community.document_transformers/g" g grep -l "langchain\.graphs" \| xargs -L 1 sed -i '' "s/langchain\.graphs/langchain_community.graphs/g" g grep -l "langchain\.memory\.chat_message_histories" \| xargs -L 1 sed -i '' "s/langchain\.memory\.chat_message_histories/langchain_community.chat_message_histories/g" gco master libs/langchain/tests/unit_tests//test_imports.py gco master libs/langchain/tests/unit_tests/*/test_public_api.py ```	2024-01-02 16:47:11 -05:00
Bagatur	480626dc99	docs, community[patch], experimental[patch], langchain[patch], cli[pa… (#15412 ) …tch]: import models from community ran ```bash git grep -l 'from langchain\.chat_models' \| xargs -L 1 sed -i '' "s/from\ langchain\.chat_models/from\ langchain_community.chat_models/g" git grep -l 'from langchain\.llms' \| xargs -L 1 sed -i '' "s/from\ langchain\.llms/from\ langchain_community.llms/g" git grep -l 'from langchain\.embeddings' \| xargs -L 1 sed -i '' "s/from\ langchain\.embeddings/from\ langchain_community.embeddings/g" git checkout master libs/langchain/tests/unit_tests/llms git checkout master libs/langchain/tests/unit_tests/chat_models git checkout master libs/langchain/tests/unit_tests/embeddings/test_imports.py make format cd libs/langchain; make format cd ../experimental; make format cd ../core; make format ```	2024-01-02 15:32:16 -05:00
Bagatur	8e0d5813c2	langchain[patch], experimental[patch]: replace langchain.schema imports (#15410 ) Import from core instead. Ran: ```bash git grep -l 'from langchain.schema\.output_parser' \| xargs -L 1 sed -i '' "s/from\ langchain\.schema\.output_parser/from\ langchain_core.output_parsers/g" git grep -l 'from langchain.schema\.messages' \| xargs -L 1 sed -i '' "s/from\ langchain\.schema\.messages/from\ langchain_core.messages/g" git grep -l 'from langchain.schema\.document' \| xargs -L 1 sed -i '' "s/from\ langchain\.schema\.document/from\ langchain_core.documents/g" git grep -l 'from langchain.schema\.runnable' \| xargs -L 1 sed -i '' "s/from\ langchain\.schema\.runnable/from\ langchain_core.runnables/g" git grep -l 'from langchain.schema\.vectorstore' \| xargs -L 1 sed -i '' "s/from\ langchain\.schema\.vectorstore/from\ langchain_core.vectorstores/g" git grep -l 'from langchain.schema\.language_model' \| xargs -L 1 sed -i '' "s/from\ langchain\.schema\.language_model/from\ langchain_core.language_models/g" git grep -l 'from langchain.schema\.embeddings' \| xargs -L 1 sed -i '' "s/from\ langchain\.schema\.embeddings/from\ langchain_core.embeddings/g" git grep -l 'from langchain.schema\.storage' \| xargs -L 1 sed -i '' "s/from\ langchain\.schema\.storage/from\ langchain_core.stores/g" git checkout master libs/langchain/tests/unit_tests/schema/ make format cd libs/experimental make format cd ../langchain make format ```	2024-01-02 15:09:45 -05:00
Ofer Mendelevitch	11accf8366	Community: Newlines before bullets in IPYNB files (Vectara) (#15330 ) - Description: updated all Vectara IPYNB files so that bullets look okay in docs (added newline) - Twitter handle: @ofermend	2023-12-30 14:04:04 -08:00
Erick Friis	75ba22793f	community: Vectara summarization (#14970 ) Description: Adding Summarization to Vectara, to reflect it provides not only vector-store type functionality but also can return a summary. Also added: MMR capability (in the Vectara platform side) Updated templates Updated documentation and IPYNB examples Tag maintainer: @baskaryan Twitter handle: @ofermend --------- Co-authored-by: Ofer Mendelevitch <ofermend@gmail.com>	2023-12-20 11:51:33 -08:00
Anush	60c70effe9	community[minor]: Qdrant sparse vector retriever (#14814 ) ## Description This PR intends to add support for Qdrant's new [sparse vector retrieval](https://qdrant.tech/articles/sparse-vectors/) by introducing a new retriever class, `QdrantSparseVectorRetriever`. Necessary usage docs and integration tests have been added for the retriever. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-20 02:22:19 -05:00
JaguarDB	992b04e475	community[minor]: added jaguar vector store (#14838 ) Description: A new vector store Jaguar is being added. Class, test scripts, and documentation is added. Issue: None -- This is the first PR contributing to LangChain Dependencies: This depends on "pip install -U jaguardb-http-client" client http package Tag maintainer: @baskaryan, @eyurtsev, @hwchase1 Twitter handle: @workbot --------- Co-authored-by: JY <jyjy@jaguardb> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-12-19 10:40:18 -05:00
Erick Friis	ab94119a53	docs[patch]: fix bullet points (#14684 ) - docs fixes - escape - bullets	2023-12-13 14:35:19 -08:00
standby24x7	d31ff30df6	docs[patch] Fix some typos in merger_retriever.ipynb (#14502 ) This patch fixes some typos. <!-- Thank you for contributing to LangChain! Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes (if applicable), - Dependencies: any dependencies required for this change, - Tag maintainer: for a quicker response, tag the relevant maintainer (see below), - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://github.com/langchain-ai/langchain/blob/master/.github/CONTRIBUTING.md If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/extras` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> Signed-off-by: Masanari Iida <standby24x7@gmail.com>	2023-12-12 17:02:45 -08:00
Bagatur	9ffca3b92a	docs[patch], templates[patch]: Import from core (#14575 ) Update imports to use core for the low-hanging fruit changes. Ran following ```bash git grep -l 'langchain.schema.runnable' {docs,templates,cookbook} \| xargs sed -i '' 's/langchain\.schema\.runnable/langchain_core.runnables/g' git grep -l 'langchain.schema.output_parser' {docs,templates,cookbook} \| xargs sed -i '' 's/langchain\.schema\.output_parser/langchain_core.output_parsers/g' git grep -l 'langchain.schema.messages' {docs,templates,cookbook} \| xargs sed -i '' 's/langchain\.schema\.messages/langchain_core.messages/g' git grep -l 'langchain.schema.chat_histry' {docs,templates,cookbook} \| xargs sed -i '' 's/langchain\.schema\.chat_history/langchain_core.chat_history/g' git grep -l 'langchain.schema.prompt_template' {docs,templates,cookbook} \| xargs sed -i '' 's/langchain\.schema\.prompt_template/langchain_core.prompts/g' git grep -l 'from langchain.pydantic_v1' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.pydantic_v1/from langchain_core.pydantic_v1/g' git grep -l 'from langchain.tools.base' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.tools\.base/from langchain_core.tools/g' git grep -l 'from langchain.chat_models.base' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.chat_models.base/from langchain_core.language_models.chat_models/g' git grep -l 'from langchain.llms.base' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.llms\.base\ /from langchain_core.language_models.llms\ /g' git grep -l 'from langchain.embeddings.base' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.embeddings\.base/from langchain_core.embeddings/g' git grep -l 'from langchain.vectorstores.base' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.vectorstores\.base/from langchain_core.vectorstores/g' git grep -l 'from langchain.agents.tools' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.agents\.tools/from langchain_core.tools/g' git grep -l 'from langchain.schema.output' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.schema\.output\ /from langchain_core.outputs\ /g' git grep -l 'from langchain.schema.embeddings' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.schema\.embeddings/from langchain_core.embeddings/g' git grep -l 'from langchain.schema.document' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.schema\.document/from langchain_core.documents/g' git grep -l 'from langchain.schema.agent' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.schema\.agent/from langchain_core.agents/g' git grep -l 'from langchain.schema.prompt ' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.schema\.prompt\ /from langchain_core.prompt_values /g' git grep -l 'from langchain.schema.language_model' {docs,templates,cookbook} \| xargs sed -i '' 's/from langchain\.schema\.language_model/from langchain_core.language_models/g' ```	2023-12-11 16:49:10 -08:00
Leonid Ganeline	18aba7fdef	docs: notebook linting (#14366 ) Many jupyter notebooks didn't pass linting. List of these files are presented in the [tool.ruff.lint.per-file-ignores] section of the pyproject.toml . Addressed these bugs: - fixed bugs; added missed imports; updated pyproject.toml Only the `document_loaders/tensorflow_datasets.ipyn`, `cookbook/gymnasium_agent_simulation.ipynb` are not completely fixed. I'm not sure about imports. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2023-12-07 15:47:48 -08:00
Leonid Ganeline	94bf733dae	docs[patch]: `AWS` platform page update (#14160 ) The `AWS` platform page has many missed integrations. - added missed integration references to the `AWS` platform page - added/updated descriptions and links in the referenced notebooks - renamed two notebook files. They have file names != page Title, which generate unordered ToC. - reroute the URLs for renamed files - fixed `amazon_textract` notebook: removed failed cell outputs	2023-12-03 15:42:52 -08:00
AthulVincent	67c55cb5b0	Implemented MongoDB Atlas Self-Query Retriever (#13321 ) # Description This PR implements Self-Query Retriever for MongoDB Atlas vector store. I've implemented the comparators and operators that are supported by MongoDB Atlas vector store according to the section titled "Atlas Vector Search Pre-Filter" from https://www.mongodb.com/docs/atlas/atlas-vector-search/vector-search-stage/. Namely: ``` allowed_comparators = [ Comparator.EQ, Comparator.NE, Comparator.GT, Comparator.GTE, Comparator.LT, Comparator.LTE, Comparator.IN, Comparator.NIN, ] """Subset of allowed logical operators.""" allowed_operators = [ Operator.AND, Operator.OR ] ``` Translations from comparators/operators to MongoDB Atlas filter operators(you can find the syntax in the "Atlas Vector Search Pre-Filter" section from the previous link) are done using the following dictionary: ``` map_dict = { Operator.AND: "$and", Operator.OR: "$or", Comparator.EQ: "$eq", Comparator.NE: "$ne", Comparator.GTE: "$gte", Comparator.LTE: "$lte", Comparator.LT: "$lt", Comparator.GT: "$gt", Comparator.IN: "$in", Comparator.NIN: "$nin", } ``` In visit_structured_query() the filters are passed as "pre_filter" and not "filter" as in the MongoDB link above since langchain's implementation of MongoDB atlas vector store(libs\langchain\langchain\vectorstores\mongodb_atlas.py) in _similarity_search_with_score() sets the "filter" key to have the value of the "pre_filter" argument. ``` params["filter"] = pre_filter ``` Test cases and documentation have also been added. # Issue #11616 # Dependencies No new dependencies have been added. # Documentation I have created the notebook mongodb_atlas_self_query.ipynb outlining the steps to get the self-query mechanism working. I worked closely with [@Farhan-Faisal](https://github.com/Farhan-Faisal) on this PR. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-29 22:05:06 -05:00
Bagatur	14799b139a	infra[patch]: add base deps and fix docs lint (#13998 )	2023-11-28 17:27:37 -08:00
david qiu	9fb6805be4	langchain[minor]: Add retriever for Knowledge Bases for Amazon Bedrock (#13980 ) - Description: Adds a retriever implementation for [Knowledge Bases for Amazon Bedrock](https://aws.amazon.com/bedrock/knowledge-bases/), a new service announced at AWS re:Invent, shortly before this PR was opened. This depends on the `bedrock-agent-runtime` service, which will be included in a future version of `boto3` and of `botocore`. We will open a follow-up PR documenting the minimum required versions of `boto3` and `botocore` after that information is available. - Issue: N/A - Dependencies: `boto3>=1.33.2, botocore>=1.33.2` - Tag maintainer: @baskaryan - Twitter handles: `@pjain7` `@dead_letter_q` This PR includes a documentation notebook under `docs/docs/integrations/retrievers`, which I (@dlqqq) have verified independently. EDIT: `bedrock-agent-runtime` service is now included in `boto3>=1.33.2`: `5cf793f493` --------- Co-authored-by: Piyush Jain <piyushjain@duck.com> Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-28 14:10:23 -08:00
Yusuf Khan	935f78c944	FEATURE: Add retriever for Outline (#13889 ) - Description: Added a retriever for the Outline API to ask questions on knowledge base - Issue: resolves #11814 - Dependencies: None - Tag maintainer: @baskaryan	2023-11-26 18:56:12 -08:00
Taranjeet Singh	47451764a7	Add embedchain retriever (#13553 ) Description: This commit adds embedchain retriever along with tests and docs. Embedchain is a RAG framework to create data pipelines. Twitter handle: - [Taranjeet's twitter](https://twitter.com/taranjeetio) and [Embedchain's twitter](https://twitter.com/embedchain) Reviewer @hwchase17 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-11-19 17:35:03 -08:00
Leonid Ganeline	17c2007e0c	DOCS updated `Activeloop DeepMemory` notebook (#13428 ) - Fixed the title of the notebook. It created an ugly ToC element as `Activeloop DeepLake's DeepMemory + LangChain + ragas or how to get +27% on RAG recall.` - Added Activeloop description - improved consistency in text - fixed ToC (it was using HTML tagas that break left-side in-page ToC). Now in-page ToC works	2023-11-16 09:56:28 -08:00
Bagatur	3596be5210	DOCS: format notebooks (#13371 )	2023-11-14 14:17:44 -08:00
Predrag Gruevski	2ebd167dba	Lint Python notebooks with ruff. (#12677 ) The new ruff version fixed the blocking bugs, and I was able to fairly easily us to a passing state: ruff fixed some issues on its own, I fixed a handful by hand, and I added a list of narrowly-targeted exclusions for files that are currently failing ruff rules that we probably should look into eventually. I went pretty lenient on the docs / cookbooks rules, allowing dead code and such things. Perhaps in the future we may want to tighten the rules further, but this is already a good set of checks that found real issues and will prevent them going forward.	2023-11-14 15:58:22 -05:00
Andrew Zhou	1a1a1a883f	fleet_context docs update (#13221 ) - Description: Changed the fleet_context documentation to use `context.download_embeddings()` from the latest release from our package. More details here: https://github.com/fleet-ai/context/tree/main#api - Issue: n/a - Dependencies: n/a - Tag maintainer: @baskaryan - Twitter handle: @andrewthezhou	2023-11-10 14:53:57 -08:00
Bagatur	850336bcf1	Update model i/o docs (#13160 )	2023-11-09 20:35:55 -08:00
Holt Skinner	fceae456b9	fix: Updates to formatting in Google Drive Retriever docs (#13015 ) - Minor updates to formatting to make easier to read	2023-11-09 16:15:55 -08:00
Bagatur	8b2a82b5ce	Bagatur/docs smith context (#13139 )	2023-11-09 10:22:49 -08:00
Bagatur	1f27104626	Fleet context (#13038 ) cc @adrwz	2023-11-07 18:57:09 -08:00
Daniel Chalef	cc3d3920e3	Zep: Summary Search and Example (#12686 ) Zep now has the ability to search over chat history summaries. This PR adds support for doing so. More here: https://blog.getzep.com/zep-v0-17/ @baskaryan @eyurtsev	2023-11-02 16:31:11 -07:00
Daniel Chalef	d966e4d13a	zep: Update Zep docs and messaging (#12764 ) Update Zep documentation with messaging, more details. @baskaryan, @eyurtsev	2023-11-02 13:39:17 -07:00
Adilkhan Sarsen	6e702b9c36	Deep memory support in LangChain (#12268 ) - Description: adding support to Activeloop's DeepMemory feature that boosts recall up to 25%. Added Jupyter notebook showcasing the feature and also made index params explicit. - Twitter handle: will really appreciate if we could announce this on twitter. --------- Co-authored-by: adolkhan <adilkhan.sarsen@alumni.nu.edu.kz>	2023-10-30 12:16:14 -07:00
Bagatur	2424fff3f1	notebook fmt (#12498 )	2023-10-29 15:50:09 -07:00
Bagatur	87af2360df	mv old integration docs (#12217 )	2023-10-24 12:38:16 -07:00
Palau	720ecacb1c	Add notebook for kay.ai press release data (#11575 ) - Description: Adding a notebook for Press Release data from Kay.ai, as discussed offline - Tag maintainer: @baskaryan @hwchase17 - Twitter handle: https://twitter.com/kaydotai https://twitter.com/vishalrohra_ --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2023-10-19 08:06:56 -07:00
volodymyr-memsql	4adabd33ac	Add example of retriever usage with SingleStoreDB vector store (#12021 ) Added a notebook with examples of the creation of a retriever from the SingleStoreDB vector store, and further usage. Co-authored-by: Volodymyr Tkachuk <vtkachuk-ua@singlestore.com>	2023-10-19 09:48:35 -04:00
Joe McElroy	c9f1768cb9	Elasticsearch Query Retriever: Use match + fuzziness for LIKE (#12023 ) Updated the elasticsearch self query retriever to use the match clause for LIKE operator instead of the non-analyzed fuzzy search clause. Other small updates include: - fixing the stack inference integration test where the index's default pipeline didn't use the inference pipeline created - adding a user-agent to the old implementation to track usage - improved the documentation for ElasticsearchStore filters	2023-10-19 09:47:21 -04:00
Holt Skinner	2661dc94f3	feat: Google Vertex AI Search Retriever - Add support for Website Data Stores (#11736 ) - Only works for Data stores with Advanced Website Indexing - https://cloud.google.com/generative-ai-app-builder/docs/about-advanced-features - Minor restructuring - Follow up to #10513 - Remove outdated docs (readded in https://github.com/langchain-ai/langchain/pull/11620) - Move legacy class into new py file to clean up the directory - Shouldn't cause backwards compatibility issues as the import works the same way for users	2023-10-18 23:41:48 -07:00
Daniel Chalef	2beb767ae5	zep: Memory Retriever MMR Support & Docs Updates (#11954 ) - Update Zep Memory and Retriever docstrings - Zep Memory Retriever: Add support for native MMR - Add MMR example to existing ZepRetriever Notebook @baskaryan	2023-10-17 16:35:11 -07:00
刘方瑞	0a24ac7388	Revised notebook and add delete to MyScale vector store (#11848 ) - Description: - Add `.delete` to myscale vector store. - Revised vector store notebooks - Tag maintainer: @baskaryan - Twitter handle: @myscaledb @mpsk_liu	2023-10-17 11:42:21 -07:00
Leonid Kuligin	d269dd2e2f	added a multiturn search based on Vertex AI Search (#11885 ) Replace this entire comment with: - Description: Added a retriever based on multi-turn Vertex AI Search - Twitter handle: lkuligin	2023-10-16 17:05:12 -07:00
Bagatur	8e6fa5f1d7	mv self-query docs to integrations (#11744 )	2023-10-12 22:36:07 -07:00
Johnny Deuss	bb2ed4615c	Fix typos (#11663 )	2023-10-12 11:44:03 -04:00
Shreyas S	3cd0827785	Update kay.ipynb (#11676 ) Fixed title display	2023-10-11 14:02:11 -07:00
Bagatur	eedfddac2d	Restructure docs (#11620 )	2023-10-10 12:55:19 -07:00

48 Commits