langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-02-21 14:43:07 +00:00

Author	SHA1	Message	Date
Tomaz Bratanic	eac3f09430	community[patch]: Better error message for neo4j vector when text is null (#21861 )	2024-06-20 13:52:15 -07:00
Stefano Lottini	a2294da9c1	cli[minor]: fix import path for two Astra DB classes in the migration json data (#21926 ) This PR fixes two mistakes in the import paths from community for the json data aiding the cli migration to 0.2. It is intended as a quick follow-up to https://github.com/langchain-ai/langchain/pull/21913 . @nicoloboschi FYI	2024-06-20 13:52:15 -07:00
WilliamEspegren	5bc7e8e5a0	doc list not empty (#21208 ) Make sure the doc list is not empty, and set Metadata: true in param, to enable the user to disable metadata for slightly faster crawls.	2024-06-20 13:52:15 -07:00
David Charles	db5404c365	langchain[minor]: add libs/partners to dev.Dockerfile (#21902 ) Resolves #21886 by adding "COPY libs/partners ../partners/" to libs/dev.Dockerfile Twitter: @kabakongo	2024-06-20 13:52:15 -07:00
TJ	48741e618a	community[patch]: Update documentation string in databricks chat model (#21915 ) Update typos in documentation string in databricks chat model	2024-06-20 13:52:15 -07:00
Nicolò Boschi	58f756471d	cli[minor]: add astradb in the cli migration to 0.2 (#21913 ) astradb has a new partner package but the automatic migration cli tool doesn't take care of migration astradb integrations	2024-06-20 13:52:15 -07:00
Coozywana	8fb415dcbd	Fix base.py typo (#21862 ) ChatOpenaAI --> ChatOpenAI Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-06-20 13:52:15 -07:00
fzowl	5691abc614	partners: Remove unnecessary print from voyageai embeddings (#21865 ) Thank you for contributing to LangChain! Remove unnecessary print from voyageai embeddings - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-06-20 13:52:15 -07:00
Bagatur	7798ab2590	docs: lcel how to and cheatsheet (#21851 )	2024-06-20 13:52:15 -07:00
Nuno Campos	c8cb4eade4	core: Tap output of sync iterators for astream_events (#21842 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, hwchase17.	2024-06-20 13:52:15 -07:00
Eugene Yurtsev	17a2c21653	core[patch]: Check if event loop is closed in memory stream (#21841 ) Check if event stream is closed in memory loop. Using try/except here to avoid race condition, but this may incur a small overhead in versions prios to 3.11	2024-06-20 13:52:14 -07:00
Erick Friis	e71f24c254	experimental: release 0.0.59 (#21835 )	2024-06-20 13:52:14 -07:00
Erick Friis	6eab69da76	community: release 0.2.0 (#21834 )	2024-06-20 13:52:14 -07:00
Erick Friis	ed20039f4f	langchain: release 0.2.0, fix min deps (#21833 )	2024-06-20 13:52:14 -07:00
Erick Friis	a497d0e29b	text-splitters: release 0.2.0 (#21832 )	2024-06-20 13:52:14 -07:00
Erick Friis	31c3919ce8	langchain: release 0.2.0 (#21831 )	2024-06-20 13:52:14 -07:00
Erick Friis	8756b36afc	core: release 0.2.0 (#21829 )	2024-06-20 13:52:14 -07:00
Eugene Yurtsev	6c68d2553a	docs: clean up link to bing search (#21825 ) Documentation should be inlined, not linking to medium article.	2024-06-20 13:52:14 -07:00
ccurme	f7f17c7081	core, standard tests, partner packages: add test for model params (#21677 ) 1. Adds `.get_ls_params` to BaseChatModel which returns ```python class LangSmithParams(TypedDict, total=False): ls_provider: str ls_model_name: str ls_model_type: Literal["chat"] ls_temperature: Optional[float] ls_max_tokens: Optional[int] ls_stop: Optional[List[str]] ``` by default it will only return ```python {ls_model_type="chat", ls_stop=stop} ``` 2. Add these params to inheritable metadata in `CallbackManager.configure` 3. Implement `.get_ls_params` and populate all params for Anthropic + all subclasses of BaseChatOpenAI Sample trace: https://smith.langchain.com/public/d2962673-4c83-47c7-b51e-61d07aaffb1b/r OpenAI: <img width="984" alt="Screenshot 2024-05-17 at 10 03 35 AM" src="https://github.com/langchain-ai/langchain/assets/26529506/2ef41f74-a9df-4e0e-905d-da74fa82a910"> Anthropic: <img width="978" alt="Screenshot 2024-05-17 at 10 06 07 AM" src="https://github.com/langchain-ai/langchain/assets/26529506/39701c9f-7da5-4f1a-ab14-84e9169d63e7"> Mistral (and all others for which params are not yet populated): <img width="977" alt="Screenshot 2024-05-17 at 10 08 43 AM" src="https://github.com/langchain-ai/langchain/assets/26529506/37d7d894-fec2-4300-986f-49a5f0191b03">	2024-06-20 13:52:14 -07:00
Sen Lin	7960df6590	community[patch]: fix typo in ValueError message in load_local function (#21818 ) Description: Corrected an error in the `allow_dangerous_deserialization` message within the `load_local` functions	2024-06-20 13:52:14 -07:00
Jorge Piedrahita Ortiz	a90ddd23f9	community: sambaverse api update (#21816 ) - Description: fix sambaverse integration to make it compatible with sambaverse API update / minor changes in docs	2024-06-20 13:52:14 -07:00
maang-h	d5587cb3c5	community[patch]: Fix unintended newline in print statement in exception for BaichuanTextEmbeddings (#21820 ) - Code: langchain_community/embeddings/baichuan.py:82 - Description: When I make an error using 'baichuan embeddings', the printed error message is wrapped (there is actually no need to wrap) ```python # example from langchain_community.embeddings import BaichuanTextEmbeddings # error key BAICHUAN_API_KEY = "sk-xxxxxxxxxxxxx" embeddings = BaichuanTextEmbeddings(baichuan_api_key=BAICHUAN_API_KEY) text_1 = "今天天气不错" query_result = embeddings.embed_query(text_1) ``` ![unintended newline](https://github.com/langchain-ai/langchain/assets/55082429/e1178ce8-62bb-405d-a4af-e3b28eabc158)	2024-06-20 13:52:14 -07:00
Bakar Tavadze	31f8ac8140	langchain-robocorp[minor]: Enable passing additional headers to the action server. (#21809 ) Actions can optionally receive secrets via request headers. This PR enables this functionality.	2024-06-20 13:52:14 -07:00
Asaf Joseph Gardin	78ea3fc025	partners: Revert AI21 Labs docs scan feature (#21699 ) Description: Reverted commit #21614 --------- Co-authored-by: Asaf Gardin <asafg@ai21.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-20 13:52:14 -07:00
Eugene Yurtsev	13cf33287a	langchain[patch],community[patch]: Move unit tests that depend on community to community (#21685 )	2024-06-20 13:52:14 -07:00
Marco Lamina	8b2b48b4c4	community: Add token cost for GPT-4o model (#21771 ) Adding [token cost for the new GPT-4o model](https://openai.com/api/pricing/): * Input cost US$5.00 / 1M tokens * Output cost US$15.00 / 1M tokens	2024-06-20 13:52:14 -07:00
Massimiliano Pronesti	5b522ee1f6	feat(community): support semantic hybrid score threshold in Azure AI Search (#21527 ) Support semantic hybrid search with a score threshold -- similar to what we do for similarity search and for hybrid search (#20907).	2024-06-20 13:52:14 -07:00
Bagatur	092340f61d	anthropic[patch]: Release 0.1.13, tool_choice support (#21773 )	2024-06-20 13:52:14 -07:00
Stefano Lottini	63108ebe25	community: init signature revision for Cassandra LLM cache classes + small maintenance (#17765 ) This PR improves on the `CassandraCache` and `CassandraSemanticCache` classes, mainly in the constructor signature, and also introduces several minor improvements around these classes. ### Init signature A (sigh) breaking change is tentatively introduced to the constructor. To me, the advantages outweigh the possible discomfort: the new syntax places the DB-connection objects `session` and `keyspace` later in the param list, so that they can be given a default value. This is what enables the pattern of _not_ specifying them, provided one has previously initialized the Cassandra connection through the versatile utility method `cassio.init(...)`. In this way, a much less unwieldy instantiation can be done, such as `CassandraCache()` and `CassandraSemanticCache(embedding=xyz)`, everything else falling back to defaults. A downside is that, compared to the earlier signature, this might turn out to be breaking for those doing positional instantiation. As a way to mitigate this problem, this PR typechecks its first argument trying to detect the legacy usage. (And to make this point less tricky in the future, most arguments are left to be keyword-only). If this is considered too harsh, I'd like guidance on how to further smoothen this transition. Our plan is to make the pattern of optional session/keyspace a standard across all Cassandra classes, so that a repeatable strategy would be ideal. A possibility would be to keep positional arguments for legacy reasons but issue a deprecation warning if any of them is actually used, to later remove them with 0.2 - please advise on this point. ### Other changes - class docstrings: enriched, completely moved to class level, added note on `cassio.init(...)` pattern, added tiny sample usage code. - semantic cache: revised terminology to never mention "distance" (it is in fact a similarity!). Kept the legacy constructor param with a deprecation warning if used. - `llm_caching` notebook: uniform flow with the Cassandra and Astra DB separate cases; better and Cassandra-first description; all imports made explicit and from community where appropriate. - cache integration tests moved to community (incl. the imported tools), env var bugfix for `CASSANDRA_CONTACT_POINTS`. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-06-20 13:52:14 -07:00
Kyle Cassidy	476981022c	Standardized openai init params (#21739 ) ## Patch Summary community:openai[patch]: standardize init args ## Details I made changes to the OpenAI Chat API wrapper test in the Langchain open-source repository - File: `libs/community/tests/unit_tests/chat_models/test_openai.py` - Changes: - Updated `max_retries` with Pydantic Field - Updated the corresponding unit test - Related Issues: #20085 - Updated max_retries with Pydantic Field, updated the unit test. --------- Co-authored-by: JuHyung Son <sonju0427@gmail.com>	2024-06-20 13:52:13 -07:00
Ethan Yang	23647b44e0	community: update openvino doc with streaming support (#21519 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-20 13:52:13 -07:00
ccurme	56cff57fe4	community: fix CI (#21766 )	2024-06-20 13:52:13 -07:00
Mish Ushakov	54a56b91a1	community: updated Browserbase loader (#21757 ) Thank you for contributing to LangChain! - [x] PR title: "community: updated Browserbase loader" - [x] PR message: Updates the Browserbase loader with more options and improved docs. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2024-06-20 13:52:13 -07:00
Eugene Yurtsev	a26476f866	core[major]: only use function description (#21622 ) Do not prefix function signature --- * Reason for this is that information is already present with tool calling models. * This will save on tokens for those models, and makes it more obvious what the description is! * The @tool can get more parameters to allow a user to re-introduce the the signature if we want	2024-06-20 13:52:13 -07:00
William FH	46c1b56b09	Finish agent migration doc (#21731 )	2024-06-20 13:52:13 -07:00
Cheese	732a4bc329	community: Implement `bind_tools` for ChatTongyi (#20725 ) ## Description Implement `bind_tools` in ChatTongyi. Usage example: ```py from langchain_core.tools import tool from langchain_community.chat_models.tongyi import ChatTongyi @tool def multiply(first_int: int, second_int: int) -> int: """Multiply two integers together.""" return first_int * second_int llm = ChatTongyi(model="qwen-turbo") llm_with_tools = llm.bind_tools([multiply]) msg = llm_with_tools.invoke("What's 5 times forty two") print(msg) ``` Streaming is also supported. ## Dependencies No Dependency is required for this change. --------- Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-06-20 13:52:13 -07:00
Bagatur	658a87c50e	docs: add aca-ds (#21746 )	2024-06-20 13:52:13 -07:00
Erick Friis	119e11acd9	pinecone: bump min core version (#21742 )	2024-06-20 13:52:13 -07:00
Erick Friis	d0f0db2256	fireworks: bump min core version (#21741 )	2024-06-20 13:52:13 -07:00
Erick Friis	592af9f33d	airbyte[patch]: airbyte-cdk compatible pydantic versions (#21738 )	2024-06-20 13:52:13 -07:00
Erick Friis	7169b25d8a	ibm[patch]: release 0.1.7 (#21737 )	2024-06-20 13:52:13 -07:00
Erick Friis	83dcb567dd	openai[patch]: fix embedding float precision issue (#21736 ) also clean up + comment some of the embedding batching code	2024-06-20 13:52:13 -07:00
JuHyung Son	9636c0f7e3	upstage: Support batch input in embedding request. (#21730 ) Description: upstage embedding now supports batch input.	2024-06-20 13:52:13 -07:00
Harrison Chase	16a07bc743	Harrison/move flashrank rerank (#21448 ) third party integration, should be in community	2024-06-20 13:52:13 -07:00
Erick Friis	b12c2fb0bf	multiple: releases with relaxed core dep (#21724 )	2024-06-20 13:52:13 -07:00
Bagatur	b317b90af5	openai[patch]: Release 0.1.7, bump tiktoken 0.7.0 (#21723 )	2024-06-20 13:52:13 -07:00
William FH	9cda8b7ee8	[Core] Check is async callable (#21714 ) To permit proper coercion of objects like the following: ```python class MyAsyncCallable: async def __call__(self, foo): return await ... class MyAsyncGenerator: async def __call__(self, foo): await ... yield ```	2024-06-20 13:52:13 -07:00
Eugene Yurtsev	8ac6f8648f	core[minor]: Add v2 implementation of astream events (#21638 ) This PR introduces a v2 implementation of astream events that removes intermediate abstractions and fixes some issues with v1 implementation. The v2 implementation significantly reduces relevant code that's associated with the astream events implementation together with overhead. After this PR, the astream events implementation: - Uses an async callback handler - No longer relies on BaseTracer - No longer relies on json patch As a result of this re-write, a number of issues were discovered with the existing implementation. ## Changes in V2 vs. V1 ### on_chat_model_end `output` The outputs associated with `on_chat_model_end` changed depending on whether it was within a chain or not. As a root level runnable the output was: ```python "data": {"output": AIMessageChunk(content="hello world!", id='some id')} ``` As part of a chain the output was: ``` "data": { "output": { "generations": [ [ { "generation_info": None, "message": AIMessageChunk( content="hello world!", id=AnyStr() ), "text": "hello world!", "type": "ChatGenerationChunk", } ] ], "llm_output": None, } }, ``` After this PR, we will always use the simpler representation: ```python "data": {"output": AIMessageChunk(content="hello world!", id='some id')} ``` NOTE Non chat models (i.e., regular LLMs) are still associated with the more verbose format. ### Remove some `_stream` events `on_retriever_stream` and `on_tool_stream` events were removed -- these were not real events, but created as an artifact of implementing on top of astream_log. The same information is already available in the `x_on_end` events. ### Propagating Names Names of runnables have been updated to be more consistent ```python model = GenericFakeChatModel(messages=infinite_cycle).configurable_fields( messages=ConfigurableField( id="messages", name="Messages", description="Messages return by the LLM", ) ) ``` Before: ```python "name": "RunnableConfigurableFields", ``` After: ```python "name": "GenericFakeChatModel", ``` ### on_retriever_end on_retriever_end will always return `output` which is a list of documents (rather than a dict containing a key called "documents") ### Retry events Removed the `on_retry` callback handler. It was incorrectly showing that the failed function being retried has invoked `on_chain_end` https://github.com/langchain-ai/langchain/pull/21638/files#diff-e512e3f84daf23029ebcceb11460f1c82056314653673e450a5831147d8cb84dL1394	2024-06-20 13:52:12 -07:00
Rajendra Kadam	fb08b03801	langchain[minor]: Add PebbloRetrievalQA chain with Identity & Semantic Enforcement support (#20641 ) - Description: PebbloRetrievalQA chain introduces identity enforcement using vector-db metadata filtering - Dependencies: None - Issue: None - Documentation: Adding documentation for PebbloRetrievalQA chain in a separate PR(https://github.com/langchain-ai/langchain/pull/20746) - Unit tests: New unit-tests added --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>	2024-06-20 13:52:12 -07:00
Bagatur	3fa1219d2c	fmt	2024-05-14 16:02:50 -07:00

1 2 3 4 5 ...

4243 Commits