langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-01-16 08:07:23 +00:00

Author	SHA1	Message	Date
Christophe Bornet	4915c3cd86	[Fix] Fix Cassandra Document loader default page content mapper (#16273 ) We can't use `json.dumps` by default as many types returned by the cassandra driver are not serializable. It's safer to use `str` and let users define their own custom `page_content_mapper` if needed.	2024-01-27 11:23:02 -08:00
Nuno Campos	e86fd946c8	In stream_event and stream_log handle closed streams (#16661 ) if eg. the stream iterator is interrupted then adding more events to the send_stream will raise an exception that we should catch (and handle where appropriate) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-27 08:09:29 -08:00
Nuno Campos	52ccae3fb1	Accept message-like things in Chat models, LLMs and MessagesPlaceholder (#16418 )	2024-01-26 15:44:28 -08:00
Pasha	4e189cd89a	community[patch]: youtube loader transcript format (#16625 ) - Description: YoutubeLoader right now returns one document that contains the entire transcript. I think it would be useful to add an option to return multiple documents, where each document would contain one line of transcript with the start time and duration in the metadata. For example, [AssemblyAIAudioTranscriptLoader](https://github.com/langchain-ai/langchain/blob/master/libs/community/langchain_community/document_loaders/assemblyai.py) is implemented in a similar way, it allows you to choose between the format to use for the document loader.	2024-01-26 15:26:09 -08:00
yin1991	a936472512	docs: Update documentation to use 'model_id' rather than 'model_name' to match actual API (#16615 ) - Description: Replace 'model_name' with 'model_id' for accuracy - Issue: [link-to-issue](https://github.com/langchain-ai/langchain/issues/16577) - Dependencies: - Twitter handle:	2024-01-26 15:01:12 -08:00
Micah Parker	6543e585a5	community[patch]: Added support for Ollama's num_predict option in ChatOllama (#16633 ) Just a simple default addition to the options payload for a ollama generate call to support a max_new_tokens parameter. Should fix issue: https://github.com/langchain-ai/langchain/issues/14715	2024-01-26 15:00:19 -08:00
baichuan-assistant	70ff54eace	community[minor]: Add Baichuan Text Embedding Model and Baichuan Inc introduction (#16568 ) - Description: Adding Baichuan Text Embedding Model and Baichuan Inc introduction. Baichuan Text Embedding ranks #1 in C-MTEB leaderboard: https://huggingface.co/spaces/mteb/leaderboard Co-authored-by: BaiChuanHelper <wintergyc@WinterGYCs-MacBook-Pro.local>	2024-01-26 12:57:26 -08:00
Bagatur	5b5115c408	google-vertexai[patch]: streaming bug (#16603 ) Fixes errors seen here https://github.com/langchain-ai/langchain/actions/runs/7661680517/job/20881556592#step:9:229	2024-01-26 09:45:34 -08:00
ccurme	a989f82027	core: expand docstring for RunnableParallel (#16600 ) - Description: expand docstring for RunnableParallel - Issue: https://github.com/langchain-ai/langchain/issues/16462 Feel free to modify this or let me know how it can be improved!	2024-01-26 10:03:32 -05:00
Ghani	e30c6662df	Langchain-community : EdenAI chat integration. (#16377 ) - Description: This PR adds [EdenAI](https://edenai.co/) for the chat model (already available in LLM & Embeddings). It supports all [ChatModel] functionality: generate, async generate, stream, astream and batch. A detailed notebook was added. - Dependencies: No dependencies are added as we call a rest API. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-01-26 09:56:43 -05:00
Antonio Lanza	08d3fd7f2e	langchain[patch]: inconsistent results with `RecursiveCharacterTextSplitter`'s `add_start_index=True` (#16583 ) This PR fixes issue #16579	2024-01-25 15:50:06 -08:00
Eugene Yurtsev	42db96477f	docs: Update in code documentation for runnable with message history (#16585 ) Update the in code documentation for Runnable With Message History	2024-01-25 15:26:34 -08:00
Jatin Chawda	a79345f199	community[patch]: Fixed tool names snake_case (#16397 ) #16396 Fixed 1. golden_query 2. google_lens 3. memorize 4. merriam_webster 5. open_weather_map 6. pub_med 7. stack_exchange 8. generate_image 9. wikipedia	2024-01-25 15:24:19 -08:00
Bagatur	bcc71d1a57	openai[patch]: Release 0.0.5 (#16598 )	2024-01-25 15:20:28 -08:00
Bagatur	68f7468754	google-vertexai[patch]: Release 0.0.3 (#16597 )	2024-01-25 15:19:00 -08:00
Bagatur	61e876aad8	openai[patch]: Explicitly support embedding dimensions (#16596 )	2024-01-25 15:16:04 -08:00
Bagatur	5df8ab574e	infra: move indexing documentation test (#16595 )	2024-01-25 14:46:50 -08:00
Bagatur	f3d61a6e47	langchain[patch]: Release 0.1.4 (#16592 )	2024-01-25 14:19:18 -08:00
Bagatur	61b200947f	community[patch]: Release 0.0.16 (#16591 )	2024-01-25 14:19:09 -08:00
Bagatur	75ad0bba2d	openai[patch]: Release 0.0.4 (#16590 )	2024-01-25 14:08:46 -08:00
Bagatur	1e3ce338ca	core[patch]: Release 0.1.16 (#16589 )	2024-01-25 13:56:00 -08:00
Bagatur	6c89507988	docs: add rag citations page (#16549 )	2024-01-25 13:51:41 -08:00
Bagatur	31790d15ec	openai[patch]: accept function_call dict in bind_functions (#16483 ) Confusing that you can't pass in a dict	2024-01-25 13:47:44 -08:00
Bagatur	ef42d9d559	core[patch], community[patch], openai[patch]: consolidate openai tool… (#16485 ) … converters One way to convert anything to an OAI function: convert_to_openai_function One way to convert anything to an OAI tool: convert_to_openai_tool Corresponding bind functions on OAI models: bind_functions, bind_tools	2024-01-25 13:18:46 -08:00
Brian Burgin	148347e858	community[minor]: Add LiteLLM Router Integration (#15588 ) community: - Description: - Add new ChatLiteLLMRouter class that allows a client to use a LiteLLM Router as a LangChain chat model. - Note: The existing ChatLiteLLM integration did not cover the LiteLLM Router class. - Add tests and Jupyter notebook. - Issue: None - Dependencies: Relies on existing ChatLiteLLM integration - Twitter handle: @bburgin_0 --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-25 11:03:05 -08:00
JongRok BAEK	3b8eba32f9	anthropic[patch]: Fix message type lookup in Anthropic Partners (#16563 ) - Description: The parameters for user and assistant in Anthropic should be 'ai -> assistant,' but they are reversed to 'assistant -> ai.' Below is error code. ```python anthropic.BadRequestError: Error code: 400 - {'type': 'error', 'error': {'type': 'invalid_request_error', 'message': 'messages: Unexpected role "ai". Allowed roles are "user" or "assistant"'}} ``` [anthropic](`7177f3a71f/src/anthropic/types/beta/message_param.py (L13)`) - Issue: : #16561 - Dependencies: : None - Twitter handle: : None	2024-01-25 09:17:59 -08:00
Dmitry Tyumentsev	e86e66bad7	community[patch]: YandexGPT models - add sleep_interval (#16566 ) Added sleep between requests to prevent errors associated with simultaneous requests.	2024-01-25 09:07:19 -08:00
Bagatur	e510cfaa23	core[patch]: passthrough BaseRetriever.invoke(**kwargs) (#16551 ) Fix for #16547	2024-01-25 08:58:39 -08:00
Anders Åhsman	355ef2a4a6	langchain[patch]: Fix doc-string grammar (#16543 ) - Description: Small grammar fix in docstring for class `BaseCombineDocumentsChain`.	2024-01-25 10:00:06 -05:00
Aditya	9dd7cbb447	google-genai: added logic for method get_num_tokens() (#16205 ) <!-- Thank you for contributing to LangChain! Please title your PR "partners: google-genai", Replace this entire comment with: - Description: : added logic for method get_num_tokens() for ChatGoogleGenerativeAI , GoogleGenerativeAI, - Issue: : https://github.com/langchain-ai/langchain/issues/16204, - Dependencies: : None, - Twitter handle: @Aditya_Rane --------- Co-authored-by: adityarane@google.com <adityarane@google.com> Co-authored-by: Leonid Kuligin <lkuligin@yandex.ru>	2024-01-24 21:43:16 -07:00
James Braza	0785432e7b	langchain-google-vertexai: perserving grounding metadata (#16309 ) Revival of https://github.com/langchain-ai/langchain/pull/14549 that closes https://github.com/langchain-ai/langchain/issues/14548.	2024-01-24 21:37:43 -07:00
Erick Friis	adc008407e	exa: init pkg (#16553 )	2024-01-24 20:57:17 -07:00
Rave Harpaz	c4e9c9ca29	community[minor]: Add OCI Generative AI integration (#16548 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: Adding Oracle Cloud Infrastructure Generative AI integration. Oracle Cloud Infrastructure (OCI) Generative AI is a fully managed service that provides a set of state-of-the-art, customizable large language models (LLMs) that cover a wide range of use cases, and which is available through a single API. Using the OCI Generative AI service you can access ready-to-use pretrained models, or create and host your own fine-tuned custom models based on your own data on dedicated AI clusters. https://docs.oracle.com/en-us/iaas/Content/generative-ai/home.htm - Issue: None, - Dependencies: OCI Python SDK, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. Passed See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. we provide unit tests. However, we cannot provide integration tests due to Oracle policies that prohibit public sharing of api keys. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. --> --------- Co-authored-by: Arthur Cheng <arthur.cheng@oracle.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 18:23:50 -08:00
Bagatur	c173a69908	langchain[patch]: oai tools output parser nit (#16540 ) allow positional init args	2024-01-24 16:57:16 -08:00
arnob-sengupta	f9976b9630	core[patch]: consolidate conditional in BaseTool (#16530 ) - Description: Refactor contradictory conditional to single line - Issue: #16528	2024-01-24 16:56:58 -08:00
Bagatur	5c2538b9f7	anthropic[patch]: allow pop by field name (#16544 ) allow `ChatAnthropicMessages(model=...)`	2024-01-24 15:48:31 -07:00
Harel Gal	a91181fe6d	community[minor]: add support for Guardrails for Amazon Bedrock (#15099 ) Added support for optionally supplying 'Guardrails for Amazon Bedrock' on both types of model invocations (batch/regular and streaming) and for all models supported by the Amazon Bedrock service. @baskaryan @hwchase17 ```python llm = Bedrock(model_id="<model_id>", client=bedrock, model_kwargs={}, guardrails={"id": " <guardrail_id>", "version": "<guardrail_version>", "trace": True}, callbacks=[BedrockAsyncCallbackHandler()]) class BedrockAsyncCallbackHandler(AsyncCallbackHandler): """Async callback handler that can be used to handle callbacks from langchain.""" async def on_llm_error( self, error: BaseException, **kwargs: Any, ) -> Any: reason = kwargs.get("reason") if reason == "GUARDRAIL_INTERVENED": # kwargs contains additional trace information sent by 'Guardrails for Bedrock' service. print(f"""Guardrails: {kwargs}""") # streaming llm = Bedrock(model_id="<model_id>", client=bedrock, model_kwargs={}, streaming=True, guardrails={"id": "<guardrail_id>", "version": "<guardrail_version>"}) ``` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 14:44:19 -08:00
Martin Kolb	04651f0248	community[minor]: VectorStore integration for SAP HANA Cloud Vector Engine (#16514 ) - Description: This PR adds a VectorStore integration for SAP HANA Cloud Vector Engine, which is an upcoming feature in the SAP HANA Cloud database (https://blogs.sap.com/2023/11/02/sap-hana-clouds-vector-engine-announcement/). - Issue: N/A - Dependencies: [SAP HANA Python Client](https://pypi.org/project/hdbcli/) - Twitter handle: @sapopensource Implementation of the integration: `libs/community/langchain_community/vectorstores/hanavector.py` Unit tests: `libs/community/tests/unit_tests/vectorstores/test_hanavector.py` Integration tests: `libs/community/tests/integration_tests/vectorstores/test_hanavector.py` Example notebook: `docs/docs/integrations/vectorstores/hanavector.ipynb` Access credentials for execution of the integration tests can be provided to the maintainers. --------- Co-authored-by: sascha <sascha.stoll@sap.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-24 14:05:07 -08:00
Leonid Kuligin	1113700b09	google-genai[patch]: better error message when location is not supported (#16535 ) Replace this entire comment with: - Description: a better error message when location is not supported	2024-01-24 13:58:46 -08:00
Unai Garay Maestre	fdbfa6b2c8	Adds progress bar to VertexAIEmbeddings (#14542 ) - Description: Adds progress bar to VertexAIEmbeddings - Issue: related issue https://github.com/langchain-ai/langchain/issues/13637 Signed-off-by: ugm2 <unaigaraymaestre@gmail.com> --------- Signed-off-by: ugm2 <unaigaraymaestre@gmail.com>	2024-01-24 11:16:16 -07:00
James Braza	643fb3ab50	langchain-google-vertexai[patch]: more verbose mypy config (#16307 ) Flushing out the `mypy` config in `langchain-google-vertexai` to show error codes and other warnings This PR also bumps `mypy` to above version 1's stable release	2024-01-24 11:10:45 -07:00
Jeremi Joslin	9e95699277	community[patch]: Fix error message when litellm is not installed (#16316 ) The error message was mentioning the wrong package. I updated it to the correct one.	2024-01-23 21:42:29 -08:00
bachr	b3ed98dec0	community[patch]: avoid KeyError when language not in LANGUAGE_SEGMENTERS (#15212 ) Description: Handle unsupported languages in same way as when none is provided Issue: The following line will throw a KeyError if the language is not supported. ```python self.Segmenter = LANGUAGE_SEGMENTERS[language] ``` E.g. when using `Language.CPP` we would get `KeyError: <Language.CPP: 'cpp'>` --------- Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-23 21:09:43 -08:00
Nuno Campos	3f38e1a457	Remove double line (#16426 ) <!-- Thank you for contributing to LangChain! Please title your PR "<package>: <description>", where <package> is whichever of langchain, community, core, experimental, etc. is being modified. Replace this entire comment with: - Description: a description of the change, - Issue: the issue # it fixes if applicable, - Dependencies: any dependencies required for this change, - Twitter handle: we announce bigger features on Twitter. If your PR gets announced, and you'd like a mention, we'll gladly shout you out! Please make sure your PR is passing linting and testing before submitting. Run `make format`, `make lint` and `make test` from the root of the package you've modified to check this locally. See contribution guidelines for more information on how to write/run tests, lint, etc: https://python.langchain.com/docs/contributing/ If you're adding a new integration, please include: 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. If no one reviews your PR within a few days, please @-mention one of @baskaryan, @eyurtsev, @hwchase17. -->	2024-01-23 20:22:37 -08:00
chyroc	61da2ff24c	community[patch]: use SecretStr for yandex model secrets (#15463 )	2024-01-23 20:08:53 -08:00
Alessio Serra	d628a80a5d	community[patch]: added 'conversational' as a valid task for hugginface endopoint models (#15761 ) - Description: added the conversational task to hugginFace endpoint in order to use models designed for chatbot programming. - Dependencies: None --------- Co-authored-by: Alessio Serra (ext.) <alessio.serra@partner.bmw.de> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Bagatur <baskaryan@gmail.com>	2024-01-23 20:04:15 -08:00
Karim Lalani	4c7755778d	community[patch]: SurrealDB fix for asyncio (#16092 ) Code fix for asyncio	2024-01-23 19:46:19 -08:00
Raunak	476bf8b763	community[patch]: Load list of files using UnstructuredFileLoader (#16216 ) - Description: Updated `_get_elements()` function of `UnstructuredFileLoader `class to check if the argument self.file_path is a file or list of files. If it is a list of files then it iterates over the list of file paths, calls the partition function for each one, and appends the results to the elements list. If self.file_path is not a list, it calls the partition function as before. - Issue: Fixed #15607, - Dependencies: NA - Twitter handle: NA Co-authored-by: H161961 <Raunak.Raunak@Honeywell.com>	2024-01-23 19:37:37 -08:00
Xudong Sun	019b6ebe8d	community[minor]: Add iFlyTek Spark LLM chat model support (#13389 ) - Description: This PR enables LangChain to access the iFlyTek's Spark LLM via the chat_models wrapper. - Dependencies: websocket-client ^1.6.1 - Tag maintainer: @baskaryan ### SparkLLM chat model usage Get SparkLLM's app_id, api_key and api_secret from [iFlyTek SparkLLM API Console](https://console.xfyun.cn/services/bm3) (for more info, see [iFlyTek SparkLLM Intro](https://xinghuo.xfyun.cn/sparkapi) ), then set environment variables `IFLYTEK_SPARK_APP_ID`, `IFLYTEK_SPARK_API_KEY` and `IFLYTEK_SPARK_API_SECRET` or pass parameters when using it like the demo below: ```python3 from langchain.chat_models.sparkllm import ChatSparkLLM client = ChatSparkLLM( spark_app_id="<app_id>", spark_api_key="<api_key>", spark_api_secret="<api_secret>" ) ```	2024-01-23 19:23:46 -08:00
Ali Zendegani	80fcc50c65	langchain[patch]: Minor Fix: Enable Passing custom_headers for Authentication in GraphQL Agent/Tool (#16413 ) - Description: This PR aims to enhance the `langchain` library by enabling the support for passing `custom_headers` in the `GraphQLAPIWrapper` usage within `langchain/agents/load_tools.py`. While the `GraphQLAPIWrapper` from the `langchain_community` module is inherently capable of handling `custom_headers`, its current invocation in `load_tools.py` does not facilitate this functionality. This limitation restricts the use of the `graphql` tool with databases or APIs that require token-based authentication. The absence of support for `custom_headers` in this context also leads to a lack of error messages when attempting to interact with secured GraphQL endpoints, making debugging and troubleshooting more challenging. This update modifies the `load_tools` function to correctly handle `custom_headers`, thereby allowing secure and authenticated access to GraphQL services requiring tokens. Example usage after the proposed change: ```python tools = load_tools( ["graphql"], graphql_endpoint="https://your-graphql-endpoint.com/graphql", custom_headers={"Authorization": f"Token {api_token}"}, ) ``` - Issue: None, - Dependencies: None, - Twitter handle: None	2024-01-23 19:19:53 -08:00

1 2 3 4 5 ...

2656 Commits