langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-07-05 20:58:25 +00:00

Author	SHA1	Message	Date
Andreas Motl	6352edf77f	docs: CrateDB: Register package `langchain-cratedb`, and add minimal "provider" documentation (#28877 ) Hi Erick. Coming back from a previous attempt, we now made a separate package for the CrateDB adapter, called `langchain-cratedb`, as advised. Other than registering the package within `libs/packages.yml`, this patch includes a minimal amount of documentation to accompany the advent of this new package. Let us know about any mistakes we made, or changes you would like to see. Thanks, Andreas. ## About - Description: Register a new database adapter package, `langchain-cratedb`, providing traditional vector store, document loader, and chat message history features for a start. - Addressed to: @efriis, @eyurtsev - References: GH-27710 - Preview: [Providers » More » CrateDB](https://langchain-git-fork-crate-workbench-register-la-4bf945-langchain.vercel.app/docs/integrations/providers/cratedb/) ## Status - PyPI: https://pypi.org/project/langchain-cratedb/ - GitHub: https://github.com/crate/langchain-cratedb - Documentation (CrateDB): https://cratedb.com/docs/guide/integrate/langchain/ - Documentation (LangChain): _This PR._ ## Backlog? Is this applicable for this kind of patch? > - [ ] Add tests and docs: If you're adding a new integration, please include > 1. a test for the integration, preferably unit tests that do not rely on network access, > 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. ## Q&A 1. Notebooks that use the LangChain CrateDB adapter are currently at [CrateDB LangChain Examples](https://github.com/crate/cratedb-examples/tree/main/topic/machine-learning/llm-langchain), and the documentation refers to them. Because they are derived from very old blueprints coming from LangChain 0.0.x times, we guess they need a refresh before adding them to `docs/docs/integrations`. Is it applicable to merge this minimal package registration + documentation patch, which already includes valid code snippets in `cratedb.mdx`, and add corresponding notebooks on behalf of a subsequent patch later? 2. How would it work getting into the tabular list of _Integration Packages_ enumerated on the [documentation entrypoint page about Providers](https://python.langchain.com/docs/integrations/providers/)? /cc Please also review, @ckurze, @wierdvanderhaar, @kneth, @simonprickett, if you can find the time. Thanks!	2024-12-23 10:55:44 -05:00
Wang Ran (汪然)	e5c9da3eb6	core[patch]: remove redundant imports (#28861 ) `Graph` has been imported at Line: 62	2024-12-23 10:31:23 -05:00
Adrián Panella	8d9907088b	community(azuresearch): allow to use any valid credential (#28873 ) Add option to use any valid credential type. Differentiates async cases needed by Azure Search. This could replace the use of a static token	2024-12-23 10:05:48 -05:00
ZhangShenao	4b4d09f82b	[Doc] Improvement: Fix docs of `ChatMLX` (#28884 ) - `ChatMLX` doesn't supports the role of system. - Fix https://github.com/langchain-ai/langchain/issues/28532 #28532	2024-12-23 09:51:44 -05:00
Mohammad Mohtashim	41b6a86bbe	Community: LlamaCppEmbeddings `embed_documents` and `embed_query` (#28827 ) - Description: `embed_documents` and `embed_query` was throwing off the error as stated in the issue. The issue was that `Llama` client is returning the embeddings in a nested list which is not being accounted for in the current implementation and therefore the stated error is being raised. - Issue: #28813 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-23 09:50:22 -05:00
Darien Schettler	32917a0b98	Update dataframe.py (#28871 ) community: optimize DataFrame document loader Description: Simplify the `lazy_load` method in the DataFrame document loader by combining text extraction and metadata cleanup into a single operation. This makes the code more concise while maintaining the same functionality. Issue: N/A Dependencies: None Twitter handle: N/A	2024-12-22 19:16:16 -05:00
Erick Friis	cb4e6ac941	docs: frontmatter gen, colab/github links (#28852 )	2024-12-21 17:38:31 +00:00
Mikhail Khludnev	2a7469e619	add langchain-localai link to Providers page localai.mdx (#28855 ) follow up #28751 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-20 22:02:32 +00:00
yeounhak	f38fc89f35	community: Corrected aload func to be asynchronous from webBaseLoader (#28337 ) - Description: The aload function, contrary to its name, is not an asynchronous function, so it cannot work concurrently with other asynchronous functions. - Issue: #28336 - Test: : Done - Docs: [here](`e0a95e5646/docs/docs/integrations/document_loaders/web_base.ipynb (L201)`) - Lint: All checks passed If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-20 14:42:52 -05:00
Ahmad Elmalah	a08c76a6b2	Docs: Add langgraph to installation section for Rag tutorial (#28849 ) Issue: This tutorial depends on langgraph, however Langgraph is not mentioned on the installation section for the tutorial, which raises an error when copying and pasting the code snippets as following: ![image](https://github.com/user-attachments/assets/829c9118-fcf8-4f17-9abb-32e005ebae07) Solution: Just adding langgraph package to installation section, for both pip and Conda tabs as this tutorial requires it.	2024-12-20 12:08:19 -05:00
Mohammad Mohtashim	8cf5f20bb5	`required` tool_choice added for ChatHuggingFace (#28851 ) - Description: HuggingFace Inference Client V3 now supports `required` as tool_choice which has been added. - Issue: #28842	2024-12-20 12:06:04 -05:00
Sylvain DEPARTE	fcba567a77	partners: allow to set Prefix in AIMessage (for MistralAI) (#28846 ) Description: Added ability to set `prefix` attribute to prevent error : ``` httpx.HTTPStatusError: Error response 400 while fetching https://api.mistral.ai/v1/chat/completions: {"object":"error","message":"Expected last role User or Tool (or Assistant with prefix True) for serving but got assistant","type":"invalid_request_error","param":null,"code":null} ``` Co-authored-by: Sylvain DEPARTE <sylvain.departe@wizbii.com>	2024-12-20 11:09:45 -05:00
Jacob Mansdorfer	6d81137325	community: adding langchain-predictionguard partner package documentation (#28832 ) - [x] PR title: "community: adding langchain-predictionguard partner package documentation" - [x] PR message: - Description: This PR adds documentation for the langchain-predictionguard package to main langchain repo, along with deprecating current Prediction Guard LLMs package. The LLMs package was previously broken, so I also updated it one final time to allow it to continue working from this point onward. . This enables users to chat with LLMs through the Prediction Guard ecosystem. - Package Links: - [PyPI](https://pypi.org/project/langchain-predictionguard/) - [Github Repo](https://www.github.com/predictionguard/langchain-predictionguard) - Issue: None - Dependencies: None - Twitter handle: [@predictionguard](https://x.com/predictionguard) - [x] Add tests and docs: All docs have been added for the partner package, and the current LLMs package test was updated to reflect changes. - [x] Lint and test: Linting tests are all passing. --------- Co-authored-by: ccurme <chester.curme@gmail.com>	2024-12-20 10:51:44 -05:00
Leonid Ganeline	5135bf1002	docs: integrations google packages (#28840 ) Issue: several Google integrations are implemented on the [github.com/googleapis](https://github.com/googleapis) organization repos and these integrations are almost lost. But they are essential integrations. Change: added a list of all packages that have Google integrations. Added a description of this situation. --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-12-20 09:46:06 -05:00
Barry McCardel	5a351a133c	fix tiny 'lil typo in tutorial page (#28839 ) ez pz	2024-12-19 16:33:59 -08:00
ccurme	f0e858b4e3	core[patch]: release 0.3.28 (#28837 )	2024-12-19 17:52:32 -05:00
ccurme	137d1e9564	langchain[patch]: fix test following update to langchain-openai (#28838 )	2024-12-19 22:39:48 +00:00
Emmanuel Leroy	c8db5a19ce	langchain_community.chat_models.oci_generative_ai: Fix a bug when using optional parameters in tools (#28829 ) When using tools with optional parameters, the parameter `type` is not longer available since langchain update to 0.3 (because of the pydantic upgrade?) and there is now an `anyOf` field instead. This results in the `type` being `None` in the chat request for the tool parameter, and the LLM call fails with the error: ``` oci.exceptions.ServiceError: {'target_service': 'generative_ai_inference', 'status': 400, 'code': '400', 'opc-request-id': '...', 'message': 'Parameter definition must have a type.', 'operation_name': 'chat' ... } ``` Example code that fails: ``` from langchain_community.chat_models.oci_generative_ai import ChatOCIGenAI from langchain_core.tools import tool from typing import Optional llm = ChatOCIGenAI( model_id="cohere.command-r-plus", service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com", compartment_id="ocid1.compartment.oc1...", auth_profile="your_profile", auth_type="API_KEY", model_kwargs={"temperature": 0, "max_tokens": 3000}, ) @tool def test(example: Optional[str] = None): """This is the tool to use to test things Args: example: example variable, defaults to None """ return "this is a test" llm_with_tools = llm.bind_tools([test]) result = llm_with_tools.invoke("can you make a test for g") ``` This PR sets the param type to `any` in that case, and fixes the problem. Co-authored-by: Erick Friis <erick@langchain.dev>	2024-12-19 22:17:34 +00:00
Bagatur	c3ccd93c12	patch openai json mode test (#28831 )	2024-12-19 21:43:32 +00:00
Bagatur	ce6748dbfe	xfail openai image token count test (#28828 )	2024-12-19 21:23:30 +00:00
Anusha Karkhanis	26bdf40072	Langchain_Community: SQL LanguageParser (#28430 ) ## Description (This PR has contributions from @khushiDesai, @ashvini8, and @ssumaiyaahmed). This PR addresses Issue #11229 which addresses the need for SQL support in document parsing. This is integrated into the generic TreeSitter parsing library, allowing LangChain users to easily load codebases in SQL into smaller, manageable "documents." This pull request adds a new ```SQLSegmenter``` class, which provides the SQL integration. ## Issue Issue #11229: Add support for a variety of languages to LanguageParser ## Testing We created a file ```test_sql.py``` with several tests to ensure the ```SQLSegmenter``` is functional. Below are the tests we added: - ```def test_is_valid```: Checks SQL validity. - ```def test_extract_functions_classes```: Extracts individual SQL statements. - ```def test_simplify_code```: Simplifies SQL code with comments. --------- Co-authored-by: Syeda Sumaiya Ahmed <114104419+ssumaiyaahmed@users.noreply.github.com> Co-authored-by: ashvini hunagund <97271381+ashvini8@users.noreply.github.com> Co-authored-by: Khushi Desai <khushi.desai@advantawitty.com> Co-authored-by: Khushi Desai <59741309+khushiDesai@users.noreply.github.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-12-19 20:30:57 +00:00
Bagatur	a7f2148061	openai[patch]: Release 0.2.14 (#28826 )	2024-12-19 11:56:44 -08:00
Bagatur	1378ddfa5f	openai[patch]: type reasoning_effort (#28825 )	2024-12-19 19:36:49 +00:00
Erick Friis	6a37899b39	core: dont mutate tool_kwargs during tool run (#28824 ) fixes https://github.com/langchain-ai/langchain/issues/24621	2024-12-19 18:11:56 +00:00
Qun	033ac41760	fix crash when using create_xml_agent with parameterless function as … (#26002 ) When using `create_xml_agent` or `create_json_chat_agent` to create a agent, and the function corresponding to the tool is a parameterless function, the `XMLAgentOutputParser` or `JSONAgentOutputParser` will parse the tool input into an empty string, `BaseTool` will parse it into a positional argument. So, the program will crash finally because we invoke a parameterless function but with a positional argument.Specially, below code will raise StopIteration in [_parse_input](https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/tools/base.py#L419) ```python from langchain import hub from langchain.agents import AgentExecutor, create_json_chat_agent, create_xml_agent from langchain_openai import ChatOpenAI prompt = hub.pull("hwchase17/react-chat-json") llm = ChatOpenAI() # agent = create_xml_agent(llm, tools, prompt) agent = create_json_chat_agent(llm, tools, prompt) agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True) agent_executor.invoke(......) ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-19 13:00:46 -05:00
Luke	f69695069d	text_splitters: Add HTMLSemanticPreservingSplitter (#25911 ) Description: With current HTML splitters, they rely on secondary use of the `RecursiveCharacterSplitter` to further chunk the document into manageable chunks. The issue with this is it fails to maintain important structures such as tables, lists, etc within HTML. This Implementation of a HTML splitter, allows the user to define a maximum chunk size, HTML elements to preserve in full, options to preserve `<a>` href links in the output and custom handlers. The core splitting begins with headers, similar to `HTMLHeaderSplitter`. If these sections exceed the length of the `max_chunk_size` further recursive splitting is triggered. During this splitting, elements listed to preserve, will be excluded from the splitting process. This can cause chunks to be slightly larger then the max size, depending on preserved length. However, all contextual relevance of the preserved item remains intact. Custom Handlers: Sometimes, companies such as Atlassian have custom HTML elements, that are not parsed by default with `BeautifulSoup`. Custom handlers allows a user to provide a function to be ran whenever a specific html tag is encountered. This allows the user to preserve and gather information within custom html tags that `bs4` will potentially miss during extraction. Dependencies: User will need to install `bs4` in their project to utilise this class I have also added in `how_to` and unit tests, which require `bs4` to run, otherwise they will be skipped. Flowchart of process: ![HTMLSemanticPreservingSplitter](https://github.com/user-attachments/assets/20873c36-22ed-4c80-884b-d3c6f433f5a7) --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-19 12:09:22 -05:00
Tommaso De Lorenzo	24bfa062bf	langchain: add support for Google Anthropic Vertex AI model garden provider in init_chat_model (#28177 ) Simple modification to add support for anthropic models deployed in Google Vertex AI model garden in `init_chat_model` importing `ChatAnthropicVertex` - [v] Lint and test	2024-12-19 12:06:21 -05:00
Erick Friis	ff7b01af88	anthropic: less pydantic for client (#28823 )	2024-12-19 08:00:02 -08:00
Erick Friis	f1d783748a	anthropic: sdk bump (#28820 )	2024-12-19 15:39:21 +00:00
Erick Friis	907f36a6e9	fireworks: fix lint (#28821 )	2024-12-19 15:36:36 +00:00
Erick Friis	6526db4871	community: bump core (#28819 )	2024-12-19 06:41:53 -08:00
Vignesh A	4c9acdfbf1	Community : Add OpenAI prompt caching and reasoning tokens tracking (#27135 ) Added Token tracking for OpenAI's prompt caching and reasoning tokens Costs updated from https://openai.com/api/pricing/ usage example ```python from langchain_community.callbacks import get_openai_callback from langchain_openai import ChatOpenAI llm = ChatOpenAI(model_name="o1-mini",temperature=1) with get_openai_callback() as cb: response = llm.invoke("hi "*1500) print(cb) ``` Output ``` Tokens Used: 1720 Prompt Tokens: 1508 Prompt Tokens Cached: 1408 Completion Tokens: 212 Reasoning Tokens: 192 Successful Requests: 1 Total Cost (USD): $0.0049559999999999995 ``` --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-19 09:31:13 -05:00
ScriptShi	97f1e1d39f	community: tablestore vector store check the dimension of the embedding when writing it to store. (#28812 ) Added some restrictions to a vectorstore I released in the community before.	2024-12-19 09:30:43 -05:00
fzowl	024f020f04	docs: Adding VoyageAI to 'integrations/text_embedding/' dropdown (#28817 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: Adding VoyageAI's text_embedding to 'integrations/text_embedding/' - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-12-19 09:29:30 -05:00
Wang Ran (汪然)	f48755d35b	core: typo `Utilities for tests.` -> `Utilities for pydantic.` (#28814 ) Description: typo	2024-12-19 09:26:17 -05:00
Wang Ran (汪然)	51b8ddaf10	core: typo in runnable (#28815 ) Thank you for contributing to LangChain! Description: Typo	2024-12-19 09:25:57 -05:00
Leonid Ganeline	c823cc532d	docs: integration providers index update (#28808 ) Issue: integrations related to a provider can be spread across several packages and classes. It is very hard to find a provider using only ToCs. Fix: we have a very useful and helpful tool to search by provider name. It is the `Search` field. So, I've added recommendations for using this field. It seems obvious but it is not. --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-12-19 02:28:37 +00:00
wangda	24c4af62b0	docs:Correcting spelling mistakes (#28780 )	2024-12-18 21:11:50 -05:00
Erick Friis	3b036a1cf2	partners/fireworks: release 0.2.6 (#28805 )	2024-12-18 22:48:35 +00:00
Erick Friis	4eb8bf7793	partners/anthropic: release 0.3.1 (#28801 )	2024-12-18 22:45:38 +00:00
Lu Peng	50afa7c4e7	community: add new parameter default_headers (#28700 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - "community: 1. add new parameter `default_headers` for oci model deployments and oci chat model deployments. 2. updated k parameter in OCIModelDeploymentLLM class." - [x] PR message: - Description: 1. add new parameters `default_headers` for oci model deployments and oci chat model deployments. 2. updated k parameter in OCIModelDeploymentLLM class. - [x] Add tests and docs: 1. unit tests 2. notebook --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-12-18 22:33:23 +00:00
Christophe Bornet	1e88adaca7	all: Add pre-commit hook (#26993 ) This calls `make format` on projects that have modified files. So `poetry install --with lint` must have been done for those projects. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-12-18 22:22:58 +00:00
Erick Friis	cc616de509	partners/xai: release 0.1.1 (#28806 )	2024-12-18 22:15:24 +00:00
Erick Friis	ba8c1b0d8c	partners/groq: release 0.2.2 (#28804 )	2024-12-18 22:12:02 +00:00
Erick Friis	a119cae5bd	partners/mistralai: release 0.2.4 (#28803 )	2024-12-18 22:11:48 +00:00
Erick Friis	514d78516b	partners/ollama: release 0.2.2 (#28802 )	2024-12-18 22:11:08 +00:00
Bagatur	68940dd0d6	openai[patch]: Release 0.2.13 (#28800 )	2024-12-18 22:08:47 +00:00
Erick Friis	4dc28b43ac	community: release 0.3.13 (#28798 )	2024-12-18 21:58:46 +00:00
Bagatur	557f63c2e6	core[patch]: Release 0.3.27 (#28799 )	2024-12-18 21:58:03 +00:00
Bagatur	4a531437bb	core[patch], openai[patch]: Handle OpenAI developer msg (#28794 ) - Convert developer openai messages to SystemMessage - store additional_kwargs={"__openai_role__": "developer"} so that the correct role can be reconstructed if needed - update ChatOpenAI to read in openai_role --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-12-18 21:54:07 +00:00

1 2 3 4 5 ...

12416 Commits