langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-08-10 13:27:36 +00:00

Author	SHA1	Message	Date
Rafael Miller	fc14f675f1	Community: Updated Firecrawl Document Loader to v1 (#26548 ) This PR updates the Firecrawl Document Loader to use the recently released V1 API of Firecrawl. Key Updates: Firecrawl V1 Integration: Updated the document loader to leverage the new Firecrawl V1 API for improved performance, reliability, and developer experience. Map Functionality Added: Introduced the map mode for more flexible document loading options. These updates enhance the integration and provide access to the latest features of Firecrawl. --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>	2024-10-15 13:13:28 +00:00
Max Tran	8fea07f92e	community: fixed KeyError: 'client' (#27345 ) Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Updated - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! twitter: @MaxHTran - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Not needed due to small change - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Max Tran <maxtra@amazon.com>	2024-10-14 20:51:13 +00:00
Martin Triska	8dc4bec947	[community] [Bugfix] base_o365 document loader metadata needs to be JSON serializable (#26322 ) In order for indexer to work, all metadata in the documents need to be JSON serializable. Timestamps are not. See here: https://github.com/langchain-ai/langchain/blob/master/libs/core/langchain_core/indexing/api.py#L83-L89 @eyurtsev could you please review? It's a tiny PR :-)	2024-10-14 12:48:31 -04:00
Marcelo Nunes Alves	5647276998	community: Problem with embeddings in new versions of clickhouse. (#26041 ) Starting with Clickhouse version 24.8, a different type of configuration has been introduced in the vectorized data ingestion, and if this configuration occurs, an error occurs when generating the table. As can be seen below: ![Screenshot from 2024-09-04 11-48-00](https://github.com/user-attachments/assets/70840a93-1001-490c-921a-26924c51d9eb) --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-11 18:54:50 +00:00
Vittorio Rigamonti	7da2efd9d3	community[minor]: VectorStore Infinispan. Adding TLS and authentication (#23522 ) Description: this PR enable VectorStore TLS and authentication (digest, basic) with HTTP/2 for Infinispan server. Based on httpx. Added docker-compose facilities for testing Added documentation Dependencies: requires `pip install httpx[http2]` if HTTP2 is needed Twitter handle: https://twitter.com/infinispan	2024-10-09 10:51:39 -04:00
Stefano Lottini	d05fdd97dd	community: Cassandra Vector Store: extend metadata-related methods (#27078 ) Description: this PR adds a set of methods to deal with metadata associated to the vector store entries. These, while essential to the Graph-related extension of the `Cassandra` vector store, are also useful in themselves. These are (all come in their sync+async versions): - `[a]delete_by_metadata_filter` - `[a]replace_metadata` - `[a]get_by_document_id` - `[a]metadata_search` Additionally, a `[a]similarity_search_with_embedding_id_by_vector` method is introduced to better serve the store's internal working (esp. related to reranking logic). Issue: no issue number, but now all Document's returned bear their `.id` consistently (as a consequence of a slight refactoring in how the raw entries read from DB are made back into `Document` instances). Dependencies: (no new deps: packaging comes through langchain-core already; `cassio` is now required to be version 0.1.10+) Add tests and docs Added integration tests for the relevant newly-introduced methods. (Docs will be updated in a separate PR). Lint and test Lint and (updated) test all pass. --------- Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-09 06:41:34 +00:00
Serena Ruan	a7c1ce2b3f	[community] Add timeout control and retry for UC tool execution (#26645 ) Add timeout at client side for UCFunctionToolkit and add retry logic. Users could specify environment variable `UC_TOOL_CLIENT_EXECUTION_TIMEOUT` to increase the timeout value for retrying to get the execution response if the status is pending. Default timeout value is 120s. - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. Tested in Databricks: <img width="1200" alt="image" src="https://github.com/user-attachments/assets/54ab5dfc-5e57-4941-b7d9-bfe3f8ad3f62"> - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Signed-off-by: serena-ruan <serena.rxy@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-09 06:31:48 +00:00
Tomaz Bratanic	481bd25d29	community: Fix database connections for neo4j (#27190 ) Fixes https://github.com/langchain-ai/langchain/issues/27185 Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 23:47:55 +00:00
ccurme	e3920f2320	community[patch]: fix structured_output in llamacpp integration (#27202 ) Resolves https://github.com/langchain-ai/langchain/issues/25318.	2024-10-08 15:16:59 -04:00
Ajayeswar Reddy	9b7bdf1a26	Fixed typo in llibs/community/langchain_community/storage/sql.py (#27029 ) - [ ] PR title: docs: fix typo in SQLStore import path - [ ] PR message: - Description: This PR corrects a typo in the docstrings for the class SQLStore(BaseStore[str, bytes]). The import path in the docstring currently reads from langchain_rag.storage import SQLStore, which should be changed to langchain_community.storage import SQLStore. This typo is also reflected in the official documentation. - Issue: N/A - Dependencies: None - Twitter handle: N/A Co-authored-by: Erick Friis <erick@langchain.dev>	2024-10-08 17:51:26 +00:00
Oleksii Pokotylo	37ca468d03	community: AzureSearch: fix reranking for empty lists (#27104 ) Description: Fix reranking for empty lists Issue: ``` ValueError: not enough values to unpack (expected 3, got 0) documents, scores, vectors = map(list, zip(*docs)) File langchain_community/vectorstores/azuresearch.py", line 1680, in _reorder_results_with_maximal_marginal_relevance ``` Co-authored-by: Oleksii Pokotylo <oleksii.pokotylo@pwc.com>	2024-10-07 15:27:09 -04:00
Christophe Bornet	c4ebccfec2	core[minor]: Improve support for id in VectorStore (#26660 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-07 15:01:08 -04:00
Jorge Piedrahita Ortiz	14de81b140	community: sambastudio chat model (#27056 ) Description:: sambastudio chat model integration added, previously only LLM integration included docs and tests --------- Co-authored-by: luisfucros <luisfucros@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-10-07 14:31:39 -04:00
ogawa	07dd8dd3d7	community[patch]: update gpt-4o cost (#27038 ) updated OpenAI cost definition according to the following: https://openai.com/api/pricing/	2024-10-07 09:06:30 -04:00
Bagatur	4935a14314	core,integrations[minor]: Dont error on fields in model_kwargs (#27110 ) Given the current erroring behavior, every time we've moved a kwarg from model_kwargs and made it its own field that was a breaking change. Updating this behavior to support the old instantiations / serializations. Assuming build_extra_kwargs was not something that itself is being used externally and needs to be kept backwards compatible	2024-10-04 11:30:27 -07:00
Tibor Reiss	47a9199fa6	community[patch]: Fix missing protected_namespaces (#27076 ) Fixes #26861 --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-10-03 20:12:11 +00:00
Bharat Ramanathan	103e573f9b	community[patch]: chore warn deprecate the wandb callback handler (#27062 ) - Description:: This PR deprecates the wandb callback handler in favor of the new [WeaveTracer](https://weave-docs.wandb.ai/guides/integrations/langchain#using-weavetracer) in W&B - Dependencies: No dependencies, just a deprecation warning. - Twitter handle: @parambharat @baskaryan	2024-10-03 11:59:20 -04:00
Erick Friis	95a87291fd	community: deprecate community ollama integrations (#26733 )	2024-10-01 09:18:07 -07:00
ZhangShenao	e317d457cf	Bug-Fix[Community] Fix `FastEmbedEmbeddings` (#26764 ) #26759 - Fix https://github.com/langchain-ai/langchain/issues/26759 - Change `model` param from private to public, which may not be initiated. - Add test case	2024-09-30 21:23:08 -04:00
Erick Friis	7ecd720120	multiple: update docs urls to latest 2 (#26837 )	2024-09-30 17:37:07 -07:00
Tomaz Bratanic	446144e7c6	Update neo4j vector procedures (#26775 )	2024-09-30 14:45:09 -07:00
Eugene Yurtsev	7fde2791dc	core[patch]: Add kwargs to Runnable (#27008 ) Fixes #26685 --------- Co-authored-by: Tibor Reiss <tibor.reiss@gmail.com>	2024-09-30 16:45:29 -04:00
Christophe Bornet	2a6abd3f0a	community[patch]: Add docstring for Links (#25969 ) Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2024-09-30 20:33:50 +00:00
Ben Chambers	29bf89db25	community: Add conversions from GVS to networkx (#26906 ) These allow converting linked documents (such as those used with GraphVectorStore) to networkx for rendering and/or in-memory graph algorithms such as community detection.	2024-09-27 16:48:55 -04:00
Subhrajyoty Roy	7f37fd8b80	community[patch]: callback before yield for cloudflare (#26927 ) Description: Moves yield to after callback for `_stream` function for the cloudfare workersai model in the community llm package Issue: #16913	2024-09-27 08:42:01 -04:00
Youshin Kim	2d9a09dfa4	Fix typo in mlflow code example in mlflow.py (#26931 ) - [x] PR title: Fix typo in code example in mlflow.py - In libs/community/langchain_community/chat_models/mlflow.py	2024-09-27 12:41:39 +00:00
Subhrajyoty Roy	7037ba0f06	community[patch]: callback before yield for mlx pipeline (#26928 ) Description: Moves yield to after callback for `_stream` function for the MLX pipeline model in the community llm package Issue: #16913	2024-09-27 08:41:34 -04:00
Subhrajyoty Roy	adcfecdb67	community[patch]: callback before yield for textgen (#26929 ) Description: Moves callback to before yield for `_stream` and `_astream` function for the textgen model in the community llm package Issue: #16913	2024-09-27 08:41:13 -04:00
Subhrajyoty Roy	5f2cc4ecb2	community[patch]: callback before yield for titan takeoff (#26930 ) Description: Moves yield to after callback for `_stream` function for the titan takeoff model in the community llm package Issue: #16913	2024-09-27 08:40:22 -04:00
Abhi Agarwal	696114e145	community: add sqlite-vec vectorstore (#25003 ) Description: Adds a vector store integration with [sqlite-vec](https://alexgarcia.xyz/sqlite-vec/), the successor to sqlite-vss that is a single C file with no external dependencies. Pretty straightforward, just copy-pasted the sqlite-vss integration and made a few tweaks and added integration tests. Only question is whether all documentation should be directed away from sqlite-vss if it is defacto deprecated (cc @asg017). --------- Co-authored-by: Erick Friis <erick@langchain.dev> Co-authored-by: philippe-oger <philippe.oger@adevinta.com>	2024-09-26 17:37:10 +00:00
Subhrajyoty Roy	ba467f1a36	community[patch]: callback before yield for gigachat (#26881 ) Description: Moves yield to after callback for `_stream` and `_astream` function for the gigachat model in the community llm package Issue: #16913	2024-09-26 12:47:28 -04:00
Subhrajyoty Roy	11e703a97e	community[patch]: callback before yield for google palm (#26882 ) Description: Moves yield to after callback for `_stream` function for the google palm model in the community package Issue: #16913	2024-09-26 12:47:05 -04:00
Rajendra Kadam	51c4393298	community[patch]: Fix validation error in SettingsConfigDict across multiple Langchain modules (#26852 ) - Description: This pull request addresses the validation error in `SettingsConfigDict` due to extra fields in the `.env` file. The issue is prevalent across multiple Langchain modules. This fix ensures that extra fields in the `.env` file are ignored, preventing validation errors. Changes include: - Applied fixes to modules using `SettingsConfigDict`. - Issue: NA, similar https://github.com/langchain-ai/langchain/issues/26850 - Dependencies: NA	2024-09-25 10:02:14 -04:00
Rajendra Kadam	7e5a9c317f	community[minor]: [Pebblo] Enhance PebbloSafeLoader to take anonymize flag (#26812 ) - Description: The flag is named `anonymize_snippets`. When set to true, the Pebblo server will anonymize snippets by redacting all personally identifiable information (PII) from the snippets going into VectorDB and the generated reports - Issue: NA - Dependencies: NA - docs: Updated	2024-09-25 09:33:06 -04:00
Rajendra Kadam	92003b3724	community[patch]: [SharePointLoader] Fix validation error in _O365Settings due to extra fields in .env file (#26851 ) Description: Fix validation error in _O365Settings by ignoring extra fields in .env file Issue: https://github.com/langchain-ai/langchain/issues/26850 Dependencies: NA	2024-09-25 09:31:59 -04:00
Subhrajyoty Roy	b61fb98466	community[patch]: callback before yield for friendli (#26842 ) Description: Moves yield to after callback for `_stream` and `_astream` function for the friendli model in the community package Issue: #16913	2024-09-25 09:31:12 -04:00
Subhrajyoty Roy	b1da532522	community[patch]: callback before yield for deepsparse llm (#26822 ) Description: Moves yield to after callback for `_stream` and `_astream` function for the deepsparse model in the community package Issue: #16913	2024-09-24 13:55:52 -04:00
Jorge Piedrahita Ortiz	408a930d55	community: Add Sambanova Cloud Chat model community integration (#26333 ) Description: : Add SambaNova Cloud Chat model community integration Includes - chat model integration (following Standardize ChatModel docstrings) - tests - docs usage notebook (following Standardize ChatModel integration docs) https://cloud.sambanova.ai/ --------- Co-authored-by: luisfucros <luisfucros@gmail.com> Co-authored-by: ccurme <chester.curme@gmail.com>	2024-09-24 14:11:32 +00:00
Tom	2b83c7c3ab	community[patch]: Fix `tool_calls` parsing when streaming from DeepInfra (#26813 ) - Description: This PR fixes the response parsing logic for `ChatDeepInfra`, more specifially `_convert_delta_to_message_chunk()`, which is invoked when streaming via `ChatDeepInfra`. - Issue: Streaming from DeepInfra via `ChatDeepInfra` is currently broken because the response parsing logic doesn't handle that `tool_calls` can be `None`. (There is no GitHub issue for this problem yet.) - Dependencies: – - Twitter handle: – Keeping this here as a reminder: > If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-09-24 13:47:36 +00:00
Subhrajyoty Roy	997d95c8f8	community[patch]: callback before yield for bedrock llm (#26804 ) Description: Moves yield to after callback for `_prepare_input_and_invoke_stream` and `_aprepare_input_and_invoke_stream` for bedrock llm in community package. Issue: #16913	2024-09-24 12:14:59 +00:00
Gabriel Altay	bb40a0fb32	Remove pydantic restricted namespaces from HuggingFaceInferenceAPIEmbedings (#26744 ) without this `model_config` importing this package produces warnings about "model_name" having conflicts with protected namespace "model_". Thank you for contributing to LangChain! - [ ] PR title: "package: description" - Where "package" is whichever of langchain, community, core, experimental, etc. is being modified. Use "docs: ..." for purely docs changes, "templates: ..." for template changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-09-22 08:05:37 -04:00
Gor Hayrapetyan	f97ac92f00	community[patch]: Handle empty PR body in get_pull_request in Github utility (#26739 ) Description: When PR body is empty `get_pull_request` method fails with bellow exception. Issue: ``` TypeError('expected string or buffer')Traceback (most recent call last): File ".../.venv/lib/python3.9/site-packages/langchain_core/tools/base.py", line 661, in run response = context.run(self._run, tool_args, tool_kwargs) File ".../.venv/lib/python3.9/site-packages/langchain_community/tools/github/tool.py", line 52, in _run return self.api_wrapper.run(self.mode, query) File ".../.venv/lib/python3.9/site-packages/langchain_community/utilities/github.py", line 816, in run return json.dumps(self.get_pull_request(int(query))) File ".../.venv/lib/python3.9/site-packages/langchain_community/utilities/github.py", line 495, in get_pull_request add_to_dict(response_dict, "body", pull.body) File ".../.venv/lib/python3.9/site-packages/langchain_community/utilities/github.py", line 487, in add_to_dict tokens = get_tokens(value) File ".../.venv/lib/python3.9/site-packages/langchain_community/utilities/github.py", line 483, in get_tokens return len(tiktoken.get_encoding("cl100k_base").encode(text)) File "....venv/lib/python3.9/site-packages/tiktoken/core.py", line 116, in encode if match := _special_token_regex(disallowed_special).search(text): TypeError: expected string or buffer ``` Twitter:* __gorros__	2024-09-22 01:56:24 +00:00
Christophe Bornet	58f339a67c	community: Fix links in GraphVectorStore pydoc (#25959 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-20 23:17:53 +00:00
Christophe Bornet	e49c413977	core: Add docstring for GraphVectorStoreRetriever (#26224 ) Co-authored-by: Erick Friis <erickfriis@gmail.com>	2024-09-20 23:16:37 +00:00
William FH	19ce95d3c9	Avoid copying runs (#26689 ) Also, re-unify run trees. Use a single shared client.	2024-09-20 10:57:41 -07:00
Eric	90031b1b3e	support epsilla cloud vector database in langchain (#26065 ) Description - support epsilla cloud in langchain --------- Co-authored-by: Leonid Ganeline <leo.gan.57@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2024-09-20 17:14:23 +00:00
stein1988	91594928c5	fix:fix ChatZhipuAI tool call bug (#26693 ) - [ ] PR title: "community:fix ChatZhipuAI tool call bug" - [ ] Description: ZhipuAI api response as follows: {'id': '20240920132549e379a9152a6a4d7c', 'created': 1726809949, 'model': 'glm-4-flash', 'choices': [{'index': 0, 'finish_reason': 'tool_calls', 'delta': {'role': 'assistant', 'tool_calls': [{'id': 'call_20240920132549e379a9152a6a4d7c', 'index': 0, 'type': 'function', 'function': {'name': 'get_datetime_offline', 'arguments': '{}'}}]}}]} so, tool_calls = dct.get("tool_call", None) in _convert_delta_to_message_chunk should be "tool_calls"	2024-09-20 13:06:42 +00:00
Erick Friis	311f861547	core, community: move graph vectorstores to community (#26678 ) remove beta namespace from core, add to community	2024-09-19 11:38:14 -07:00
Serena Ruan	c77c28e631	[community] Fix WorkspaceClient error with pydantic validation (#26649 ) Thank you for contributing to LangChain! Fix error like <img width="1167" alt="image" src="https://github.com/user-attachments/assets/2e219b26-ec7e-48ef-8111-e0ff2f5ac4c0"> After the fix: <img width="584" alt="image" src="https://github.com/user-attachments/assets/48f36fe7-628c-48b6-81b2-7fe741e4ca85"> - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Signed-off-by: serena-ruan <serena.rxy@gmail.com> Co-authored-by: Erick Friis <erick@langchain.dev>	2024-09-19 18:25:33 +00:00
Rajendra Kadam	60dc19da30	[community] Added PebbloTextLoader for loading text data in PebbloSafeLoader (#26582 ) - Description: Added PebbloTextLoader for loading text in PebbloSafeLoader. - Since PebbloSafeLoader wraps document loaders, this new loader enables direct loading of text into Documents using PebbloSafeLoader. - Issue: NA - Dependencies: NA - [x] Tests: Added/Updated tests	2024-09-19 09:59:04 -04:00

1 2 3 4 5 ...

1474 Commits