langchain/libs/community/tests/unit_tests/vectorstores
Martin Triska 3fc0ea510e
community : [bugfix] Use document ids as keys in AzureSearch vectorstore (#25486)
# Description
[Vector store base
class](4cdaca67dc/libs/core/langchain_core/vectorstores/base.py (L65))
currently expects `ids` to be passed in and that is what it passes along
to the AzureSearch vector store when attempting to `add_texts()`.
However AzureSearch expects `keys` to be passed in. When they are not
present, AzureSearch `add_embeddings()` makes up new uuids. This is a
problem when trying to run indexing. [Indexing code
expects](b297af5482/libs/core/langchain_core/indexing/api.py (L371))
the documents to be uploaded using provided ids. Currently AzureSearch
ignores `ids` passed from `indexing` and makes up new ones. Later when
`indexer` attempts to delete removed file, it uses the `id` it had
stored when uploading the document, however it was uploaded under
different `id`.

**Twitter handle: @martintriska1**
2024-09-19 09:37:18 -04:00
..
redis multiple: pydantic 2 compatibility, v0.3 (#26443) 2024-09-13 14:38:45 -07:00
__init__.py community[major], core[patch], langchain[patch], experimental[patch]: Create langchain-community (#14463) 2023-12-11 13:53:30 -08:00
test_aerospike.py infra: update mypy 1.10, ruff 0.5 (#23721) 2024-07-03 10:33:27 -07:00
test_azure_search.py community : [bugfix] Use document ids as keys in AzureSearch vectorstore (#25486) 2024-09-19 09:37:18 -04:00
test_databricks_vector_search.py community[patch]: Fix Hybrid Search for non-Databricks managed embeddings (#25590) 2024-08-23 08:57:13 +00:00
test_elasticsearch.py infra: update mypy 1.10, ruff 0.5 (#23721) 2024-07-03 10:33:27 -07:00
test_faiss.py core[patch]: add standard tracing params for retrievers (#25240) 2024-08-12 14:51:59 +00:00
test_hanavector.py community[minor]: VectorStore integration for SAP HANA Cloud Vector Engine (#16514) 2024-01-24 14:05:07 -08:00
test_imports.py community[minor]: Add ApertureDB as a vectorstore (#24088) 2024-07-16 09:32:59 -07:00
test_indexing_docs.py community[minor]: Add ApertureDB as a vectorstore (#24088) 2024-07-16 09:32:59 -07:00
test_inmemory.py multiple: pydantic 2 compatibility, v0.3 (#26443) 2024-09-13 14:38:45 -07:00
test_neo4j.py community[patch]: Add the ability to pass maps to neo4j retrieval query (#19758) 2024-03-29 08:33:48 -07:00
test_pgvector.py infra: update mypy 1.10, ruff 0.5 (#23721) 2024-07-03 10:33:27 -07:00
test_sklearn.py infra: update mypy 1.10, ruff 0.5 (#23721) 2024-07-03 10:33:27 -07:00
test_tencentvectordb.py community[patch], langchain[minor]: Enhance Tencent Cloud VectorDB, langchain: make Tencent Cloud VectorDB self query retrieve compatible (#19651) 2024-04-09 16:50:48 +00:00
test_utils.py infra: update mypy 1.10, ruff 0.5 (#23721) 2024-07-03 10:33:27 -07:00