langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-14 15:16:21 +00:00

Author	SHA1	Message	Date
Sydney Runkle	7e926520d5	packaging: remove Python upper bound for langchain and co libs (#31025 ) Follow up to https://github.com/langchain-ai/langsmith-sdk/pull/1696, I've bumped the `langsmith` version where applicable in `uv.lock`. Type checking problems here because deps have been updated in `pyproject.toml` and `uv lock` hasn't been run - we should enforce that in the future - goes with the other dependabot todos :).	2025-04-28 14:44:28 -04:00
Sydney Runkle	8c6734325b	partners[lint]: run `pyupgrade` to get code in line with 3.9 standards (#30781 ) Using `pyupgrade` to get all `partners` code up to 3.9 standards (mostly, fixing old `typing` imports).	2025-04-11 07:18:44 -04:00
Tin Lai	4d03ba4686	langchain_qdrant: fix showing the missing sparse vector name (#30701 ) Description: The error message was supposed to display the missing vector name, but instead, it includes only the existing collection configs. This simple PR just includes the correct variable name, so that the user knows the requested vector does not exist in the collection. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. Signed-off-by: Tin Lai <tin@tinyiu.com>	2025-04-07 09:19:08 -04:00
Erick Friis	1a225fad03	multiple: fix uv path deps (#29790 ) file:// format wasn't working with updates - it doesn't install as an editable dep move to tool.uv.sources with path= instead	2025-02-13 21:32:34 +00:00
Vincent Emonet	3645181d0e	qdrant: Add `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` (#29641 ) Added `similarity_search_with_score_by_vector()` function to the `QdrantVectorStore` class. It is required when we want to query multiple time with the same embeddings. It was present in the now deprecated original `Qdrant` vectorstore implementation, but was absent from the new one. It is also implemented in a number of others `VectorStore` implementations I have added tests for this new function Note that I also argued in this discussion that it should be part of the general `VectorStore` https://github.com/langchain-ai/langchain/discussions/29638 Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 00:55:58 +00:00
ccurme	3450bfc806	infra: add UV_FROZEN to makefiles (#29642 ) These are set in Github workflows, but forgot to add them to most makefiles for convenience when developing locally. `uv run` will automatically sync the lock file. Because many of our development dependencies are local installs, it will pick up version changes and update the lock file. Passing `--frozen` or setting this environment variable disables the behavior.	2025-02-06 14:36:54 -05:00
ccurme	d172984c91	infra: migrate to uv (#29566 )	2025-02-06 13:36:26 -05:00
Erick Friis	fcc9cdd100	multiple: disable socket for unit tests (#29080 )	2025-01-07 15:31:50 -08:00
ccurme	a8b21afc08	qdrant[patch]: run python 3.13 in CI (#28394 )	2024-11-27 12:22:17 -05:00
ZhangShenao	c2072d909a	Improvement[Partner] Improve qdrant vector store (#27251 ) - Add static method decorator - Add args for api doc - Fix word spelling Co-authored-by: Erick Friis <erick@langchain.dev>	2024-11-07 02:42:41 +00:00
Bagatur	dfa83531ad	qdrant,nomic[minor]: bump core deps (#27849 )	2024-11-04 20:19:50 +00:00
Erick Friis	92ae61bcc8	multiple: rely on asyncio_mode auto in tests (#27200 )	2024-10-15 16:26:38 +00:00
ccurme	0592c29e9b	qdrant[patch]: release 0.1.4 (#26534 ) `langchain-qdrant` imports pydantic but was importing pydantic proper before 0.3 release: `042e84170b/libs/partners/qdrant/langchain_qdrant/sparse_embeddings.py (L5-L8)`	2024-09-16 13:04:12 -04:00
Erick Friis	c2a3021bb0	multiple: pydantic 2 compatibility, v0.3 (#26443 ) Signed-off-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com> Co-authored-by: Dan O'Donovan <dan.odonovan@gmail.com> Co-authored-by: Tom Daniel Grande <tomdgrande@gmail.com> Co-authored-by: Grande <Tom.Daniel.Grande@statsbygg.no> Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com> Co-authored-by: Tomaz Bratanic <bratanic.tomaz@gmail.com> Co-authored-by: ZhangShenao <15201440436@163.com> Co-authored-by: Friso H. Kingma <fhkingma@gmail.com> Co-authored-by: ChengZi <chen.zhang@zilliz.com> Co-authored-by: Nuno Campos <nuno@langchain.dev> Co-authored-by: Morgante Pell <morgantep@google.com>	2024-09-13 14:38:45 -07:00
Bagatur	3ec93c2817	standard-tests[patch]: add Ser/Des test	2024-09-04 10:24:06 -07:00
Christophe Bornet	038c287b3a	all: Improve make lint command (#25344 ) * Removed `ruff check --select I` as `I` is already selected and checked in the main `ruff check` command * Added checks for non-empty `PYTHON_FILES` * Run `ruff check` only on `PYTHON_FILES` Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-23 18:23:52 -07:00
Christophe Bornet	7f1e444efa	partners: Use simsimd types (#25299 ) The simsimd package [now has types](https://github.com/ashvardanian/SimSIMD/releases/tag/v5.0.0)	2024-08-23 10:41:39 -04:00
Anush	472527166f	qdrant: Update API reference link and install command (#25245 ) ## Description As the title goes. The current API reference links to the deprecated class.	2024-08-11 16:54:14 -04:00
Isaac Francisco	a72fddbf8d	[docs]: vector store integration pages (#24858 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-08-06 17:20:27 +00:00
ZhangShenao	2c3e3dc6b1	patch[Partners] Unified fix of incorrect variable declarations in all check_imports (#25014 ) There are some incorrect declarations of variable `has_failure` in check_imports. The purpose of this PR is to uniformly fix these errors.	2024-08-03 13:49:41 -04:00
Bagatur	e81ddb32a6	docs: fix kwargs docstring (#25010 ) Fix: ![Screenshot 2024-08-02 at 5 33 37 PM](https://github.com/user-attachments/assets/7c56cdeb-ee81-454c-b3eb-86aa8a9bdc8d)	2024-08-02 19:54:54 -07:00
Anush	4585eaef1b	qdrant: Fix vectors_config access (#24606 ) ## Description Fixes #24558 by accessing `vectors_config` after asserting it to be a dict.	2024-07-24 10:54:33 -04:00
Erick Friis	3dce2e1d35	all: add release notes to pypi (#24519 )	2024-07-22 13:59:13 -07:00
Anush	a653b209ba	qdrant: test new QdrantVectorStore (#24165 ) ## Description This PR adds integration tests to follow up on #24164. By default, the tests use an in-memory instance. To run the full suite of tests, with both in-memory and Qdrant server: ``` $ docker run -p 6333:6333 qdrant/qdrant $ make test $ make integration_test ``` --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 23:59:30 +00:00
Christian D. Glissov	474b88326f	langchain_qdrant: Added method "_asimilarity_search_with_relevance_scores" to Qdrant class (#23954 ) I stumbled upon a bug that led to different similarity scores between the async and sync similarity searches with relevance scores in Qdrant. The reason being is that _asimilarity_search_with_relevance_scores is missing, this makes langchain_qdrant use the method of the vectorstore baseclass leading to drastically different results. To illustrate the magnitude here are the results running an identical search in a test vectorstore. Output of asimilarity_search_with_relevance_scores: [0.9902903374601824, 0.9472135924938804, 0.8535534011299859] Output of similarity_search_with_relevance_scores: [0.9805806749203648, 0.8944271849877607, 0.7071068022599718] Co-authored-by: Erick Friis <erick@langchain.dev>	2024-07-12 23:25:20 +00:00
Anush	d09dda5a08	qdrant: Bump patch version (#24168 ) # Description To release a new version of `langchain-qdrant` after #24165 and #24166.	2024-07-12 14:48:50 -07:00
Anush	7014d07cab	qdrant: new Qdrant implementation (#24164 )	2024-07-12 04:52:02 +02:00
Anush	7790d67f94	qdrant: New sparse embeddings provider interface - PART 1 (#24015 ) ## Description This PR introduces a new sparse embedding provider interface to work with the new Qdrant implementation that will follow this PR. Additionally, an implementation of this interface is provided with https://github.com/qdrant/fastembed. This PR will be followed by https://github.com/Anush008/langchain/pull/3.	2024-07-11 17:07:25 -07:00
Bagatur	a0c2281540	infra: update mypy 1.10, ruff 0.5 (#23721 ) ```python """python scripts/update_mypy_ruff.py""" import glob import tomllib from pathlib import Path import toml import subprocess import re ROOT_DIR = Path(__file__).parents[1] def main(): for path in glob.glob(str(ROOT_DIR / "libs/*/pyproject.toml"), recursive=True): print(path) with open(path, "rb") as f: pyproject = tomllib.load(f) try: pyproject["tool"]["poetry"]["group"]["typing"]["dependencies"]["mypy"] = ( "^1.10" ) pyproject["tool"]["poetry"]["group"]["lint"]["dependencies"]["ruff"] = ( "^0.5" ) except KeyError: continue with open(path, "w") as f: toml.dump(pyproject, f) cwd = "/".join(path.split("/")[:-1]) completed = subprocess.run( "poetry lock --no-update; poetry install --with typing; poetry run mypy . --no-color", cwd=cwd, shell=True, capture_output=True, text=True, ) logs = completed.stdout.split("\n") to_ignore = {} for l in logs: if re.match("^(.)\:(\d+)\: error:.\[(.)\]", l): path, line_no, error_type = re.match( "^(.)\:(\d+)\: error:.\[(.*)\]", l ).groups() if (path, line_no) in to_ignore: to_ignore[(path, line_no)].append(error_type) else: to_ignore[(path, line_no)] = [error_type] print(len(to_ignore)) for (error_path, line_no), error_types in to_ignore.items(): all_errors = ", ".join(error_types) full_path = f"{cwd}/{error_path}" try: with open(full_path, "r") as f: file_lines = f.readlines() except FileNotFoundError: continue file_lines[int(line_no) - 1] = ( file_lines[int(line_no) - 1][:-1] + f" # type: ignore[{all_errors}]\n" ) with open(full_path, "w") as f: f.write("".join(file_lines)) subprocess.run( "poetry run ruff format .; poetry run ruff --select I --fix .", cwd=cwd, shell=True, capture_output=True, text=True, ) if __name__ == "__main__": main() ```	2024-07-03 10:33:27 -07:00
Anush	e002c855bd	qdrant[patch]: Use collection_exists API instead of exceptions (#22764 ) ## Description Currently, the Qdrant integration relies on exceptions raised by [`get_collection` ](https://qdrant.tech/documentation/concepts/collections/#collection-info) to check if a collection exists. Using [`collection_exists`](https://qdrant.tech/documentation/concepts/collections/#check-collection-existence) is recommended to avoid missing any unhandled exceptions. This PR addresses this. ## Testing All integration and unit tests pass. No user-facing changes.	2024-06-13 20:01:32 -07:00
Anush	80560419b0	qdrant[patch]: Make path optional in from_existing_collection() (#21875 ) ## Description The `path` param is used to specify the local persistence directory, which isn't required if using Qdrant server. This is a breaking but necessary change.	2024-06-06 10:37:08 -07:00
Joydeep Banik Roy	3796672c67	community, milvus, pinecone, qdrant, mongo: Broadcast operation failure while using simsimd beyond v3.7.7 (#22271 ) - [ ] Packages affected: - community: fix `cosine_similarity` to support simsimd beyond 3.7.7 - partners/milvus: fix `cosine_similarity` to support simsimd beyond 3.7.7 - partners/mongodb: fix `cosine_similarity` to support simsimd beyond 3.7.7 - partners/pinecone: fix `cosine_similarity` to support simsimd beyond 3.7.7 - partners/qdrant: fix `cosine_similarity` to support simsimd beyond 3.7.7 - [ ] Broadcast operation failure while using simsimd beyond v3.7.7: - Description: I was using simsimd 4.3.1 and the unsupported operand type issue popped up. When I checked out the repo and ran the tests, they failed as well (have attached a screenshot for that). Looks like it is a variant of https://github.com/langchain-ai/langchain/issues/18022 . Prior to 3.7.7, simd.cdist returned an ndarray but now it returns simsimd.DistancesTensor which is ineligible for a broadcast operation with numpy. With this change, it also remove the need to explicitly cast `Z` to numpy array - Issue: #19905 - Dependencies: No - Twitter handle: https://x.com/GetzJoydeep <img width="1622" alt="Screenshot 2024-05-29 at 2 50 00 PM" src="https://github.com/langchain-ai/langchain/assets/31132555/fb27b383-a9ae-4a6f-b355-6d503b72db56"> - [ ] Considerations: 1. I started with community but since similar changes were there in Milvus, MongoDB, Pinecone, and QDrant so I modified their files as well. If touching multiple packages in one PR is not the norm, then I can remove them from this PR and raise separate ones 2. I have run and verified that the tests work. Since, only MongoDB had tests, I ran theirs and verified it works as well. Screenshots attached : <img width="1573" alt="Screenshot 2024-05-29 at 2 52 13 PM" src="https://github.com/langchain-ai/langchain/assets/31132555/ce87d1ea-19b6-4900-9384-61fbc1a30de9"> <img width="1614" alt="Screenshot 2024-05-29 at 3 33 51 PM" src="https://github.com/langchain-ai/langchain/assets/31132555/6ce1d679-db4c-4291-8453-01028ab2dca5"> I have added a test for simsimd. I feel it may not go well with the CI/CD setup as installing simsimd is not a dependency requirement. I have just imported simsimd to ensure simsimd cosine similarity is invoked. However, its not a good approach. Suggestions are welcome and I can make the required changes on the PR. Please provide guidance on the same as I am new to the community. --------- Co-authored-by: Bagatur <baskaryan@gmail.com> Co-authored-by: Bagatur <22008038+baskaryan@users.noreply.github.com>	2024-06-04 17:36:31 +00:00
Bagatur	50186da0a1	infra: rm unused # noqa violations (#22049 ) Updating #21137	2024-05-22 15:21:08 -07:00
Erick Friis	aca98fd150	multiple: releases with relaxed core dep (#21724 )	2024-05-15 19:29:35 +00:00
Anush	edd68e4ad4	qdrant: init package (#21146 ) ## Description This PR introduces the new `langchain-qdrant` partner package, intending to deprecate the community package. ## Changes - Moved the Qdrant vector store implementation `/libs/partners/qdrant` with integration tests. - The conditional imports of the client library are now regular with minor implementation improvements. - Added a deprecation warning to `langchain_community.vectorstores.qdrant.Qdrant`. - Replaced references/imports from `langchain_community` with either `langchain_core` or by moving the definitions to the `langchain_qdrant` package itself. - Updated the Qdrant vector store documentation to reflect the changes. ## Testing - `QDRANT_URL` and [`QDRANT_API_KEY`](`583e36bf6b`) env values need to be set to [run integration tests](`d608c93d1f`) in the [cloud](https://cloud.qdrant.tech). - If a Qdrant instance is running at `http://localhost:6333`, the integration tests will use it too. - By default, tests use an [`in-memory`](https://github.com/qdrant/qdrant-client?tab=readme-ov-file#local-mode) instance(Not comprehensive). --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-05-13 18:20:03 -07:00

35 Commits