langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-06-05 06:33:20 +00:00

Author	SHA1	Message	Date
Aditya	ee2c55ca09	docs: Added documentation on Anthropic models on vertex (#21070 ) Description:Added documentation on Anthropic models on Vertex @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com>	2024-05-02 13:12:01 -04:00
Erick Friis	cd4c54282a	infra: cleanup docs build (#21134 ) Refactors the docs build in order to: - run the same `make build` command in both vercel and local build - incrementally build artifacts in 2 distinct steps, instead of building all docs in-place (in vercel) or in a _dist dir (locally) Highlights: - introduces `make build` in order to build the docs - collects and generates all files for the build in `docs/build/intermediate` - renders those jupyter notebook + markdown files into `docs/build/outputs` And now the outputs to host are in `docs/build/outputs`, which will need a vercel settings change. Todo: - [ ] figure out how to point the right directory (right now deleting and moving docs dir in vercel_build.sh isn't great)	2024-05-01 17:34:05 -07:00
Bagatur	8b4b75e543	docs: standardize vertexai params (#20167 ) Related to #20085 Requires https://github.com/langchain-ai/langchain-google/pull/121	2024-05-01 11:42:18 -04:00
Jacob Lee	bd38073d76	👥 Update LangChain people data (#21143 ) 👥 Update LangChain people data Co-authored-by: github-actions <github-actions@github.com>	2024-05-01 11:01:43 -04:00
East Agile	2a6f78a53f	community[minor]: Rememberizer retriever (#20052 ) Description: This pull request introduces a new feature for LangChain: the integration with the Rememberizer API through a custom retriever. This enables LangChain applications to allow users to load and sync their data from Dropbox, Google Drive, Slack, their hard drive into a vector database that LangChain can query. Queries involve sending text chunks generated within LangChain and retrieving a collection of semantically relevant user data for inclusion in LLM prompts. User knowledge dramatically improved AI applications. The Rememberizer integration will also allow users to access general purpose vectorized data such as Reddit channel discussions and US patents. Issue: N/A Dependencies: N/A Twitter handle: https://twitter.com/Rememberizer	2024-05-01 10:41:44 -04:00
Abhishek Bhagwat	86fe484e24	docs: Docs (sample notebook) for Vertex DIY RAG Ranking API (#21054 ) Vertex DIY RAG APIs helps to build complex RAG systems and provide more granular control, and are suited for custom use cases. The Ranking API takes in a list of documents and reranks those documents based on how relevant the documents are to a given query. Compared to embeddings that look purely at the semantic similarity of a document and a query, the ranking API can give you a more precise score for how well a document answers a given query. [Reference](https://cloud.google.com/generative-ai-app-builder/docs/ranking) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-05-01 05:39:39 +00:00
Ismail Hossain Polas	1fdf63fa6c	community[patch]: update package name to bagelML (#19948 ) Description This pull request updates the Bagel Network package name from "betabageldb" to "bagelML" to align with the latest changes made by the Bagel Network team. The following modifications have been made: - Updated all references to the old package name ("betabageldb") with the new package name ("bagelML") throughout the codebase. - Modified the documentation, and any relevant scripts to reflect the package name change. - Tested the changes to ensure that the functionality remains intact and no breaking changes were introduced. By merging this pull request, our project will stay up to date with the latest Bagel Network package naming convention, ensuring compatibility and smooth integration with their updated library. Please review the changes and provide any feedback or suggestions. Thank you!	2024-05-01 01:17:33 -04:00
Erick Friis	67e6744e0f	docs: fix some notebook formatting (#21136 )	2024-04-30 21:39:03 -07:00
Leonid Kuligin	a36935b520	docs: updated docs on langchain_google_community (#21064 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: updated docs on langchain_google_community" - [ ] PR message: - Description: updated docs on langchain_google_community	2024-04-30 20:20:49 -04:00
junkeon	8d2909ee25	upstage[minor]: Update few codes and add upstage loader in pdf section (#21085 ) Description: Update UpstageLayoutAnalysisParser and Loader and add upstage loader example in pdf section Dependencies: langchain_community Twitter handle: [@upstageai](https://twitter.com/upstageai) - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-30 20:15:49 -04:00
MacanPN	0f7f448603	community[patch]: add delete() method to AzureSearch vector store (#21127 ) Issue: Currently `AzureSearch` vector store does not implement `delete` method. This PR implements it. This also makes it compatible with LangChain indexer. Dependencies: None Twitter handle: @martintriska1 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 23:46:18 +00:00
Jorge Piedrahita Ortiz	3441a11b21	docs: minor changes in sambanova community integration docs (#21129 ) - Description: minor changes in sambanova community integration notebook docs --------- Co-authored-by: Renate Kempf <165940384+renate-snova@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 23:44:26 +00:00
Bagatur	6d3e9eaf84	docs: format (#21132 )	2024-04-30 23:32:41 +00:00
Christophe Bornet	d6e9bd3011	docs: Bump cassio min version in docs (#21081 ) Cassio 0.6+ is recommended for async vector store (not blocking on getting the embedding dimension) and for hybrid search support.	2024-04-30 10:25:37 -04:00
Kuro Denjiro	fa4124b821	community[minor]: add mintbase loader to langchain (#20089 ) - [x] Add Near NFT loader: "community: Load NFT near block chain using mintbase graph API" - [x] PR message: - Description: a description of the change - Twitter handle:Kurodenjiro --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-30 04:11:56 +00:00
Rahul Triptahi	c172611647	community[patch]: Add classifier_url argument in PebbloSafeLoader and documentation update. (#21030 ) Description: Add classifier_url argument in PebbloSafeLoader. Documentation: Updated PebbloSafeLoader documentation with above change and new links for pebblo github pages. --------- Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-29 17:41:09 -04:00
Rodrigo Nogueira	90f19028e5	community[patch]: Add maritalk streaming (sync and async) (#19203 ) Co-authored-by: RosevalJr <rdmalajr@gmail.com> Co-authored-by: Roseval Donisete Malaquias Junior <roseval@maritaca.ai>	2024-04-29 21:31:14 +00:00
Cahid Arda Öz	cc6191cb90	community[minor]: Add support for Upstash Vector (#20824 ) ## Description Adding `UpstashVectorStore` to utilize [Upstash Vector](https://upstash.com/docs/vector/overall/getstarted)! #17012 was opened to add Upstash Vector to langchain but was closed to wait for filtering. Now filtering is added to Upstash vector and we open a new PR. Additionally, [embedding feature](https://upstash.com/docs/vector/features/embeddingmodels) was added and we add this to our vectorstore aswell. ## Dependencies [upstash-vector](https://pypi.org/project/upstash-vector/) should be installed to use `UpstashVectorStore`. Didn't update dependencies because of [this comment in the previous PR](https://github.com/langchain-ai/langchain/pull/17012#pullrequestreview-1876522450). ## Tests Tests are added and they pass. Tests are naturally network bound since Upstash Vector is offered through an API. There was [a discussion in the previous PR about mocking the unittests](https://github.com/langchain-ai/langchain/pull/17012#pullrequestreview-1891820567). We didn't make changes to this end yet. We can update the tests if you can explain how the tests should be mocked. --------- Co-authored-by: ytkimirti <yusuftaha9@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 17:25:01 -04:00
chyroc	3e241956d3	community[minor]: add coze chat model (#20770 ) add coze chat model, to call coze.com apis	2024-04-29 12:26:16 -04:00
Patrick McFadin	3331865f6b	community[minor]: add Cassandra Database Toolkit (#20246 ) Description: ToolKit and Tools for accessing data in a Cassandra Database primarily for Agent integration. Initially, this includes the following tools: - `cassandra_db_schema` Gathers all schema information for the connected database or a specific schema. Critical for the agent when determining actions. - `cassandra_db_select_table_data` Selects data from a specific keyspace and table. The agent can pass paramaters for a predicate and limits on the number of returned records. - `cassandra_db_query` Expiriemental alternative to `cassandra_db_select_table_data` which takes a query string completely formed by the agent instead of parameters. May be removed in future versions. Includes unit test and two notebooks to demonstrate usage. Dependencies: cassio Twitter handle: @PatrickMcFadin --------- Co-authored-by: Phil Miesle <phil.miesle@datastax.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 15:51:43 +00:00
Igor Brai	b3e74f2b98	community[minor]: add mojeek search util (#20922 ) Description: This pull request introduces a new feature to community tools, enhancing its search capabilities by integrating the Mojeek search engine Dependencies: None --------- Co-authored-by: Igor Brai <igor@mojeek.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-04-29 15:49:53 +00:00
Rahul Triptahi	a64a1943fd	docs: Document update for load_extended_matadata in GoogleDriveLoader (#20950 ) Document: Updated google_drive,ipynb for loading following extended metadata. - full_path - Full path of the file/s in google drive. - owner - owner of the file/s. - size - size of the file/s. Code changes: [langchain-google/pull/179.](https://github.com/langchain-ai/langchain-google/pull/179) Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-29 11:41:57 -04:00
Tomaz Bratanic	67428c4052	community[patch]: Neo4j enhanced schema (#20983 ) Scan the database for example values and provide them to an LLM for better inference of Text2cypher	2024-04-29 10:45:55 -04:00
Leonid Kuligin	dc70c23a11	docs: switched GCSLoaders docs to langchain-google-community (#20985 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: switched GCSLoaders docs to langchain-google-community" - [ ] PR message: *Delete this entire checklist* and replace with - Description: switched GCSLoaders docs to langchain-google-community	2024-04-29 10:45:11 -04:00
Tomaz Bratanic	d36332476c	docs: Add neo4j relationship vector index docs (#20990 ) Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 14:36:47 +00:00
Karim Lalani	2ddac9a7c3	experimental[minor]: Add bind_tools and with_structured_output functions to OllamaFunctions (#20881 ) Implemented bind_tools for OllamaFunctions. Made OllamaFunctions sub class of ChatOllama. Implemented with_structured_output for OllamaFunctions. integration unit test has been updated. notebook has been updated. --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-29 14:13:33 +00:00
Vadym Barda	5e0b6b3e75	docs: update langserve link in LCEL docs (#20992 )	2024-04-29 09:06:10 -04:00
Aditya	07ce39bfe7	docs: updated tutorials for Image generation and Vector Search (#21000 ) Description: docs: updated tutorials for Image generation and Vector Search @lkuligin for review --------- Co-authored-by: adityarane@google.com <adityarane@google.com>	2024-04-29 09:04:11 -04:00
Aditya	17bbb7d2a5	docs: updated tutorial for Gemini versions, included safety attribute updates (#21006 ) Description:updated tutorial for Gemini versions, included safety attribute updates @lkuligin For review --------- Co-authored-by: adityarane@google.com <adityarane@google.com>	2024-04-29 09:01:54 -04:00
WilliamEspegren	804390ba4b	community: Spider integration (#20937 ) Added the [Spider.cloud](https://spider.cloud) document loader. [Spider](https://github.com/spider-rs/spider) is the [fastest](https://github.com/spider-rs/spider/blob/main/benches/BENCHMARKS.md) and cheapest crawler that returns LLM-ready data. ``` - Description: Adds Spider data loader - Dependencies: spider-client - Twitter handle: @WilliamEspegren ``` --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: = <=> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-27 21:45:03 +00:00
Jamie Lemon	6342217b93	docs: Moves "Using PyMuPDF" to higher up the page. (#20832 ) Description: This PR moves the PyMuPDF PDF loader solution to be underneath PyPDF. This is because it is the the 2nd most popular PyPI package after PyPDF. Please refer to these numbers, at the time of writing as follows: PyPDF https://www.pepy.tech/projects/PyPDF2 160 million PyMuPDF https://www.pepy.tech/projects/pymupdf 60 million PDFPlumber https://www.pepy.tech/projects/pdfplumber 23 million PDFMiner https://www.pepy.tech/projects/pdfminer 16 million PyPDFium2 https://www.pepy.tech/projects/pypdfium2 8 million Unstructured https://www.pepy.tech/projects/unstructured 8 million Please note I am an active contributor to https://github.com/pymupdf/PyMuPDF Many thanks! ---- Twitter handle: @artifex	2024-04-27 20:40:20 +00:00
Chouaieb Nemri	8097bec472	Added LogEntry, Any, Dict, List, Optional, TypedDict imports (#20970 ) Thank you for contributing to LangChain! - [ ] PR title: "package: docs" - [ ] PR message: - Description: Uptaded docs: Rag streaming use-cases notebook with LogEntry, Any, Dict, List, Optional, TypedDict imports - Twitter handle: c_nemri --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-04-27 20:13:54 +00:00
CT	0e917e319b	docs: Add langchainhub to pip install (#20185 ) Added langchainhub package in import statement which is required for "from langchain import hub" to work. Added sample code to add OpenAI key Co-authored-by: Chi Yan Tang <100466443+poochiekittie@users.noreply.github.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-27 02:21:40 +00:00
Chandre Van Der Westhuizen	e57cf73cf5	docs: Added MindsDB provider (#20322 ) MindsDB integrates with LangChain, enabling users to deploy, serve, and fine-tune models available via LangChain within MindsDB, making them accessible to numerous data sources. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 01:36:08 +00:00
Jorge Piedrahita Ortiz	40b2e2916b	community[minor]: Sambanova llm integration (#20955 ) - Description: Added [Sambanova systems](https://sambanova.ai/) integration, including sambaverse and sambastudio LLMs - Dependencies: sseclient-py (optional) --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 01:05:13 +00:00
Amine Djeghri	790ea75cf7	community[minor]: add exllamav2 library for GPTQ & EXL2 models (#17817 ) Added 3 files : - Library : ExLlamaV2 - Test integration - Notebook --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-27 00:44:43 +00:00
Naveen Tatikonda	8bbdb4f6a0	community[patch]: Add OpenSearch as semantic cache (#20254 ) ### Description Use OpenSearch vector store as Semantic Cache. ### Twitter Handle @OpenSearchProj --------- Signed-off-by: Naveen Tatikonda <navtat@amazon.com> Co-authored-by: Harish Tatikonda <harishtatikonda@Harishs-MacBook-Air.local> Co-authored-by: EC2 Default User <ec2-user@ip-172-31-31-155.ec2.internal> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-27 00:20:24 +00:00
Giacomo Berardi	61f14f00d7	docs: `ElasticsearchCache` in cache integrations documentation (#20790 ) The package for LangChain integrations with Elasticsearch https://github.com/langchain-ai/langchain-elastic is going to contain a LLM cache integration in the next release (see https://github.com/langchain-ai/langchain-elastic/pull/14). This is the documentation contribution on the page dedicated to cache integrations	2024-04-26 15:43:58 -07:00
Leonid Kuligin	893a924b90	core[minor], community[patch], langchain[patch]: move BaseChatLoader to core (#19607 ) Thank you for contributing to LangChain! - [ ] PR title: "core: move BaseChatLoader and BaseToolkit from community" - [ ] PR message: move BaseChatLoader and BaseToolkit --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-26 21:45:51 +00:00
Leonid Kuligin	d4aec8fc8f	docs: adding langchain_google_community to the docs (#20665 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: step1. adjusting langchain_community -> langchain_google_community" - [ ] - Description: step1. adjusting langchain_community -> langchain_google_community	2024-04-26 18:49:03 +00:00
Sean	e1c2e2fdfa	upstage: Upstage Groundedness Check parameter update (#20914 ) * Groundedness Check takes `str` or `list[Document]` as input. * Deprecate `GroundednessCheck` due to its naming. * Added `UpstageGroundednessCheck`. * Hotfix for Groundedness Check parameter. The name `query` was misleading and it should be `answer` instead. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-04-26 17:34:05 +00:00
Pengcheng Liu	d95e9fb67f	docs: add tool calling example in Tongyi chat model integration. (#20925 ) Description: add tool calling example in Tongyi chat model integration. Issue: None Dependencies: None	2024-04-26 10:18:54 -04:00
am-kinetica	b54b19ba1c	community[minor]: Implemented Kinetica Document Loader and added notebooks (#20002 ) - [ ] Kinetica Document Loader: "community: a class to load Documents from Kinetica" - [ ] Kinetica Document Loader: - Description: implemented KineticaLoader in `kinetica_loader.py` - Dependencies: install the Kinetica API using `pip install gpudb==7.2.0.1 `	2024-04-25 13:39:00 -07:00
Shengsheng Huang	fd1061e7bf	community[patch]: add more data types support to ipex-llm llm integration (#20833 ) - Description: - add support for more data types: by default `IpexLLM` will load the model in int4 format. This PR adds more data types support such as `sym_in5`, `sym_int8`, etc. Data formats like NF3, NF4, FP4 and FP8 are only supported on GPU and will be added in future PR. - Fix a small issue in saving/loading, update api docs - Dependencies: `ipex-llm` library - Document: In `docs/docs/integrations/llms/ipex_llm.ipynb`, added instructions for saving/loading low-bit model. - Tests: added new test cases to `libs/community/tests/integration_tests/llms/test_ipex_llm.py`, added config params. - Contribution maintainer: @shane-huang	2024-04-25 12:58:18 -07:00
Rahul Triptahi	dc921f0823	community[patch]: Add semantic info to metadata, classified by pebblo-server. (#20468 ) Description: Add support for Semantic topics and entities. Classification done by pebblo-server is not used to enhance metadata of Documents loaded by document loaders. Dependencies: None Documentation: Updated. Signed-off-by: Rahul Tripathi <rauhl.psit.ec@gmail.com> Co-authored-by: Rahul Tripathi <rauhl.psit.ec@gmail.com>	2024-04-25 12:55:33 -07:00
Jingpan Xiong	1202017c56	community[minor]: Add relyt vector database (#20316 ) Co-authored-by: kaka <kaka@zbyte-inc.cloud> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: jingsi <jingsi@leadincloud.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-04-25 19:49:29 +00:00
ccurme	6986e44959	docs: update chat model feature table (#20899 )	2024-04-25 15:05:43 -04:00
merdan	52896258ee	docs: hide model import in multiple_tools.ipynb (#20883 ) Description: This PR removes an unnecessary code snippet from the documentation. The snippet in question is not relevant to the content and does not contribute to the overall understanding of the topic. It contained redundant imports and unused code, potentially causing confusion for readers. Issue: There is no specific issue number associated with this change. Dependencies: No additional dependencies are required for this change. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-04-25 18:47:22 +00:00
samanhappy	37cbbc00a9	docs: Fix broken link in agents.ipynb (#20872 )	2024-04-25 10:42:06 -07:00
fzowl	a6b8ff23bd	docs: Use voyage-law-2 in the examples (#20784 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: In VoyageAI text-embedding examples use voyage-law-2 model - [x] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-04-25 10:41:36 -07:00

1 2 3 4 5 ...

3560 Commits