langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-09-14 22:17:15 +00:00

Author	SHA1	Message	Date
JING	36037c9251	fix(docs): update Anthropic model name and add version warnings (#32807 ) Description: This PR fixes the broken Anthropic model example in the documentation introduction page and adds a comment field to display model version warnings in code blocks. The changes ensure that users can successfully run the example code and are reminded to check for the latest model versions. Issue: https://github.com/langchain-ai/langchain/issues/32806 Changes made: - Update Anthropic model from broken "claude-3-5-sonnet-latest" to working "claude-3-7-sonnet-20250219" - Add comment field to display model version warnings in code blocks - Improve user experience by providing working examples and version guidance Dependencies: None required	2025-09-03 15:25:13 -04:00
Rostyslav Borovyk	b2b835cb36	docs(docs): add Oxylabs document loader (#32429 ) Thank you for contributing to LangChain! Follow these steps to mark your pull request as ready for review. If any of these steps are not completed, your PR will not be considered for review. - [x] PR title: Follows the format: {TYPE}({SCOPE}): {DESCRIPTION} - Examples: - feat(core): add multi-tenant support - fix(cli): resolve flag parsing error - docs(openai): update API usage examples - Allowed `{TYPE}` values: - feat, fix, docs, style, refactor, perf, test, build, ci, chore, revert, release - Allowed `{SCOPE}` values (optional): - core, cli, langchain, standard-tests, docs, anthropic, chroma, deepseek, exa, fireworks, groq, huggingface, mistralai, nomic, ollama, openai, perplexity, prompty, qdrant, xai - Note: the `{DESCRIPTION}` must not start with an uppercase letter. - Once you've written the title, please delete this checklist item; do not include it in the PR. - [x] PR message: *Delete this entire checklist* and replace with - Description: a description of the change. Include a [closing keyword](https://docs.github.com/en/issues/tracking-your-work-with-issues/using-issues/linking-a-pull-request-to-an-issue#linking-a-pull-request-to-an-issue-using-a-keyword) if applicable to a relevant issue. - Issue: the issue # it fixes, if applicable (e.g. Fixes #123) - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [x] Add tests and docs: If you're adding a new integration, you must include: 1. A test for the integration, preferably unit tests that do not rely on network access, 2. An example notebook showing its use. It lives in `docs/docs/integrations` directory. - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. We will not consider a PR unless these three are passing in CI. See [contribution guidelines](https://python.langchain.com/docs/contributing/) for more. Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to `pyproject.toml` files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. --------- Co-authored-by: Mason Daugherty <github@mdrxy.com>	2025-08-15 10:46:26 -04:00
William Espegren	d2ac3b375c	fix(docs): add Spider as a webpage loader (#32453 ) [Spider](https://spider.cloud/) is a webpage loader and should be listed under the ["Webpages"](https://python.langchain.com/docs/integrations/document_loaders/#webpages) table on the Document loaders page. Twitter: https://x.com/WilliamEspegren --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-11 21:23:03 +00:00
Yasien Dwieb	155e3740bc	fix(docs): handle collection not found error on RAG tutorial when qdrant is selected as vectorStore (#32099 ) In [Rag Part 1 Tutorial](https://python.langchain.com/docs/tutorials/rag/), when QDrant vector store is selected, the sample code does not work It fails with error `ValueError: Collection test not found` So, this fix is creating that collection and ensuring its dimension size is matching the selection the embedding size of the selected LLM Model --------- Co-authored-by: Mason Daugherty <mason@langchain.dev> Co-authored-by: Mason Daugherty <github@mdrxy.com> Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>	2025-08-11 20:31:24 +00:00
lineuman	afc3b1824c	docs(deepseek): Add DeepSeek model option (#32481 )	2025-08-11 09:20:39 -04:00
Kanav Bansal	9de0892a77	fix(docs): update package names across multiple integration docs (#32393 ) ## Description: Updated incorrect package names across multiple integration docs by replacing underscores with hyphens to reflect their actual names on PyPI. This aligns with the actual PyPI package names and prevents potential confusion or installation issues. ## Issue: N/A ## Dependencies: None ## Twitter handle: N/A --------- Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-08-04 17:38:29 +00:00
Kanav Bansal	84c5048cb8	fix(docs): correct package names in FeatureTables.js (#32377 ) ## Description: Updated incorrect package names in `FeatureTables.js` by replacing underscores with hyphens to reflect their actual names on PyPI. This aligns with the actual PyPI package names and prevents potential confusion or installation issues. The following package names were corrected: - `langchain_aws` ➝ `langchain-aws` - `langchain_community` ➝ `langchain-community` - `langchain_elasticsearch` ➝ `langchain-elasticsearch` - `langchain_google_community` ➝ `langchain-google-community` ## Issue: N/A ## Dependencies: None ## Twitter handle: N/A	2025-08-04 10:51:32 -04:00
Mason Daugherty	8db16b5633	fix: use new Google model names in examples (#32288 )	2025-07-28 19:03:42 +00:00
dishaprakash	a0671676ae	feat(docs): add PGVectorStore (#30950 ) Thank you for contributing to LangChain! - Adding documentation for PGVectorStore: docs: Adding documentation for the new PGVectorStore as a part of langchain-postgres - Add docs: The notebook for PGVectorStore is now added to the directory `docs/docs/integrations`. As a part of this change, we've also updated the VectorStore features table and VectorStoreTabs --------- Co-authored-by: Chester Curme <chester.curme@gmail.com> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-07-25 13:22:58 -04:00
ccurme	6aeda24a07	docs(chroma): update feature table (#32193 ) Supports multi-tenancy.	2025-07-22 20:55:07 +00:00
ccurme	3fc27e7a95	docs: update feature table for Chroma (#32182 )	2025-07-22 18:21:17 +00:00
Copilot	fc802d8f9f	docs: fix vectorstore feature table - correct "IDs in add Documents" values (#32153 ) The vectorstore feature table in the documentation was showing incorrect information for the "IDs in add Documents" capability. Most vectorstores were marked as ❌ (not supported) when they actually support extracting IDs from documents. ## Problem The issue was an inconsistency between two sources of truth: - JavaScript feature table (`docs/src/theme/FeatureTables.js`): Hardcoded `idsInAddDocuments: false` for most vectorstores - Python script (`docs/scripts/vectorstore_feat_table.py`): Correctly showed `"IDs in add Documents": True` for most vectorstores ## Root Cause All vectorstores inherit the base `VectorStore.add_documents()` method which automatically extracts document IDs: ```python # From libs/core/langchain_core/vectorstores/base.py lines 277-284 if "ids" not in kwargs: ids = [doc.id for doc in documents] # If there's at least one valid ID, we'll assume that IDs should be used. if any(ids): kwargs["ids"] = ids ``` Since no vectorstores override `add_documents()`, they all inherit this behavior and support IDs in documents. ## Solution Updated `idsInAddDocuments` from `false` to `true` for 13 vectorstores: - AstraDBVectorStore, Chroma, Clickhouse, DatabricksVectorSearch - ElasticsearchStore, FAISS, InMemoryVectorStore, MongoDBAtlasVectorSearch - PGVector, PineconeVectorStore, Redis, Weaviate, SQLServer The other 4 vectorstores (CouchbaseSearchVectorStore, Milvus, openGauss, QdrantVectorStore) were already correctly marked as `true`. ## Impact Users visiting https://python.langchain.com/docs/integrations/vectorstores/ will now see accurate information. The "IDs in add Documents" column will correctly show ✅ for all vectorstores instead of incorrectly showing ❌ for most of them. This aligns with the API documentation which states: "if kwargs contains ids and documents contain ids, the ids in the kwargs will receive precedence" - clearly indicating that document IDs are supported. Fixes #30622. <!-- START COPILOT CODING AGENT TIPS --> --- 💬 Share your feedback on Copilot coding agent for the chance to win a $200 gift card! Click [here](https://survey.alchemer.com/s3/8343779/Copilot-Coding-agent) to start the survey. --------- Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com> Co-authored-by: mdrxy <61371264+mdrxy@users.noreply.github.com>	2025-07-21 20:29:34 -04:00
Kanav Bansal	50a12a7ee5	fix(docs): fix broken link in VertexAILLM and NVIDIA LLM integrations (#32096 ) ## Description: This PR updates the `link` values for the following integration metadata entries: 1. VertexAILLM - Changed from: `google_vertexai` - To: `google_vertex_ai_palm` 2. NVIDIA - Changed from: `NVIDIA` - To: `nvidia_ai_endpoints` These changes ensure that the documentation links correspond to the correct integration paths, improving documentation navigation and consistency with the integration structure. ## Issue: N/A ## Dependencies: None ## Twitter handle: N/A Co-authored-by: Mason Daugherty <mason@langchain.dev>	2025-07-18 14:00:49 +00:00
Kanav Bansal	72a0f425ec	docs(docs): correct package name from langchain-google_vertexai to langchain-google-vertexai for VertexAILLM (#32095 ) - Description: This PR updates the `package` field for the VertexAI integration in the documentation metadata. The original value was `langchain-google_vertexai`, which has been corrected to `langchain-google-vertexai` to reflect the actual package name used in PyPI and LangChain integrations. - Issue: N/A - Dependencies: None - Twitter handle: N/A	2025-07-18 09:45:28 -04:00
Mason Daugherty	a1519af513	fix(docs): fix broken links (#32083 )	2025-07-17 10:38:51 -04:00
Kanav Bansal	2c0e8dce0d	docs(docs): fix broken link in Google Gemini text embedding integration (#32082 ) - Description: Corrected the `link` path in the Google Gemini integration entry from `/docs/integrations/text_embedding/google-generative-ai` to `/docs/integrations/text_embedding/google_generative_ai` to align with actual directory structure and prevent broken documentation links. - Issue: N/A - Dependencies: None - Twitter handle: N/A	2025-07-17 09:58:07 -04:00
Nithish Raghunandanan	8bdb1de006	[docs] Update couchbase provider, vector store & features list (#31719 )	2025-07-05 13:34:48 -04:00
Anush	2d3020f6cd	docs: Update vectorstores feature matrix for Qdrant (#31786 ) ## Description - `Qdrant` vector store supports `add_documents` with IDs. - Multi-tenancy is supported via [payload filters](https://qdrant.tech/documentation/guides/multiple-partitions/) and [JWT](https://qdrant.tech/documentation/guides/security/#granular-access-control-with-jwt) if needed.	2025-06-30 14:02:07 -04:00
Eugene Yurtsev	eb08b064bb	docs: Remove giscus comments (#31755 ) Remove giscus comments from langchain	2025-06-27 09:56:55 -04:00
ccurme	7cdd53390d	docs: fix embeddings links (#31715 ) This table is referenced in multiple places, so links should be global.	2025-06-24 11:27:59 -04:00
Cheney Zhang	993e34fafb	docs: Update Milvus feature table (#31472 ) We found the [table of langchain milvus feature](https://python.langchain.com/docs/integrations/vectorstores/) is not consistent with the currently implemented code. So we change it with a PR. - searchByVector: code is [here](`e29ff1bff5/libs/milvus/langchain_milvus/vectorstores/milvus.py (L1543)`) - passesStandardTests: All methods will be tested(including unittest and integration test) , see an example [here]( https://github.com/langchain-ai/langchain-milvus/actions/runs/15347213828/job/43186093988) , the test code it [here](https://github.com/langchain-ai/langchain-milvus/tree/main/libs/milvus/tests) and the github workflow is defined [here](https://github.com/langchain-ai/langchain-milvus/blob/main/.github/workflows/_test.yml) - multiTenancy: milvus supports different kinds of [multi tenancy](https://milvus.io/docs/multi_tenancy.md#Implement-Multi-tenancy), they also implemented by langchain_milvus - database level: specify the database name in [connection_args](`e29ff1bff5/libs/milvus/langchain_milvus/vectorstores/milvus.py (L374)`) - collection level: specify the collection in [collection_name param](`e29ff1bff5/libs/milvus/langchain_milvus/vectorstores/milvus.py (L337)`) - partition level: specify the [partition-related params ](`e29ff1bff5/libs/milvus/langchain_milvus/vectorstores/milvus.py (L280)`) - idsInAddDocuments: [add document method](`e29ff1bff5/libs/milvus/langchain_milvus/vectorstores/milvus.py (L2030)`) supports ids param passed in( then passed to add_texts method [here](`e29ff1bff5/libs/milvus/langchain_milvus/vectorstores/milvus.py (L1102)`)) @ccurme please take a review, thanks. Signed-off-by: ChengZi <chen.zhang@zilliz.com>	2025-06-03 16:56:52 -04:00
ccurme	394d42b4ae	docs: update default model (#31420 )	2025-05-29 14:28:05 -04:00
Rares Vernica	4f41b54bcb	docs:Fix Google GenAI Embedding params (#31188 ) Extend Google parameters in the embeddings tab to include Google GenAI (Gemini) Description: Update embeddings tab to include example for Google GenAI (Gemini) Issue: N/A Dependencies: N/A Twitter handle: N/A - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-05-14 08:50:11 -04:00
Michael Li	0ef4ac75b7	docs: remove duplicated and inaccurate mulvus doc (part of langchain-ai#31104) (#31154 )	2025-05-10 19:38:11 +00:00
Philipp Schmid	79a537d308	Update Chat and Embedding guides (#31017 ) Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-27 18:06:59 +00:00
Philipp Schmid	ae4b6380d9	Documentation: Add Google Gemini dropdown (#30995 ) This PR adds Google Gemini (via AI Studio and Gemini API). Feel free to change the ordering, if needed. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-24 10:00:16 -04:00
mpb159753	bb2c2fd885	docs: Add openGauss vector store documentation (#30742 ) Hey LangChain community! 👋 Excited to propose official documentation for our new openGauss integration that brings powerful vector capabilities to the stack! ### What's Inside 📦 1. Full Integration Guide Introducing [langchain-opengauss](https://pypi.org/project/langchain-opengauss/) on PyPI - your new toolkit for: 🔍 Native hybrid search (vectors + metadata) 🚀 Production-grade connection pooling 🧩 Automatic schema management 2. Rigorous Testing Passed ✅ ![Benchmark Results](https://github.com/user-attachments/assets/ae3b21f7-aeea-4ae7-a142-f2aec57936a0) - 100% non-async test coverage ps: Current implementation resides in my personal repository: https://github.com/mpb159753/langchain-opengauss, How can I transfer process to langchain-ai org?? Keen to hear your thoughts and make this integration shine! ✨ --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev> Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>	2025-04-11 20:31:39 +00:00
Sydney Runkle	3814bd1ea7	partners: Add Perplexity Chat Integration (#30618 ) Perplexity's importance in the space has been growing, so we think it's time to add an official integration! Note: following the release of `langchain-perplexity` to `pypi`, we should be able to add `perplexity` as an extra in `libs/langchain/pyproject.toml`, but we're blocked by a circular import for now. --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com> Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-04-03 16:09:14 +00:00
Brandon Luu	bbbd4e1db8	docs: Update VectorStoreTab vector store initializations (#30413 ) Description: Update vector store tab inits to match either the docs or api_ref (whichever was more comprehensive) List of changes per vector stores: - In-memory - no change - AstraDB - match to docs - docs/api_refs match (excluding embeddings) - Chroma - match to docs - api_refs is less descriptive - FAISS - match to docs - docs/api_refs match (excluding embeddings) - Milvus - match to docs to use Milvus Lite with Flat index - api_refs does not have index_param for generalization - MongoDB - match to docs - api_refs are sparser - PGVector - match to api_ref - changed to include docker cmd directly in code - docs/api_ref has comment to view docker command in separate code block - Pinecone - match to api_refs - docs have code dispersed - Qdrant - match to api_ref - docs has size=3072, api_ref has size=1536 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-22 17:29:45 -04:00
ccurme	54eab796ab	docs: update chat model tabs (#30330 )	2025-03-17 15:39:10 -04:00
Jason Zhang	49bdd3b6fe	docs: Add AgentQL provider doc, tool/toolkit doc and documentloader doc (#30144 ) - Description: Added AgentQL docs for the provider page, tools page and documentloader page - Twitter handle: @AgentQL Repo: https://github.com/tinyfish-io/agentql-integrations/tree/main/langchain PyPI: https://pypi.org/project/langchain-agentql/ If no one reviews your PR within a few days, please @-mention one of baskaryan, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-03-11 21:57:40 -04:00
Tiest van Gool	476cd26f57	Add xAI to ChatModelTabs drop down (#30028 ) Thank you for contributing to LangChain! - [ ] PR title: "docs: add xAI to ChatModelTabs" - [ ] PR message: - Description: Added `ChatXAI` to `ChatModelTabs` dropdown to improve visibility of xAI chat models (e.g., "grok-2", "grok-3"). - Issue: Follow-up to #30010 - Dependencies: none - Twitter handle: @tiestvangool If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-28 09:08:12 -05:00
Lakindu Boteju	e0e9e560b3	PyMuPDF4LLM integration to LangChain (#29953 ) ## PyMuPDF4LLM integration to LangChain for PDF content extraction in Markdown format ### Description [PyMuPDF4LLM](https://github.com/pymupdf/RAG) makes it easier to extract PDF content in Markdown format, needed for LLM & RAG applications. (License: GNU Affero General Public License v3.0) [langchain-pymupdf4llm](https://github.com/lakinduboteju/langchain-pymupdf4llm) integrates PyMuPDF4LLM to LangChain as a Document Loader. (License: MIT License) This pull request introduces the integration of [PyMuPDF4LLM](https://pymupdf.readthedocs.io/en/latest/pymupdf4llm) into the LangChain project as an integration package: [`langchain-pymupdf4llm`](https://github.com/lakinduboteju/langchain-pymupdf4llm). The most important changes include adding new Jupyter notebooks to document the integration and updating the package configuration file to include the new package. ### Documentation: * `docs/docs/integrations/providers/pymupdf4llm.ipynb`: Added a new Jupyter notebook to document the integration of `PyMuPDF4LLM` with LangChain, including installation instructions and class imports. * `docs/docs/integrations/document_loaders/pymupdf4llm.ipynb`: Added a new Jupyter notebook to document the usage of `langchain-pymupdf4llm` as a LangChain integration package in detail. ### Package registration: * `libs/packages.yml`: Updated the package configuration file to include the `langchain-pymupdf4llm` package. ### Additional information * Related to: https://github.com/langchain-ai/langchain/pull/29848 --------- Co-authored-by: Chester Curme <chester.curme@gmail.com>	2025-02-26 15:59:12 -05:00
Mateusz Szewczyk	8147679169	docs: Rename IBM product name to `IBM watsonx` (#29802 ) Thank you for contributing to LangChain! Rename IBM product name to `IBM watsonx` - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2025-02-15 21:48:02 -05:00
Mateusz Szewczyk	8d0e31cbc5	docs: Fix `model_id` on EmbeddingTabs page (#29784 ) Thank you for contributing to LangChain! Fix `model_id` in IBM provider on EmbeddingTabs page - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2025-02-13 09:41:51 -08:00
Mateusz Szewczyk	61f1be2152	docs: Added IBM to ChatModelTabs and EmbeddingTabs (#29774 ) Thank you for contributing to LangChain! Added IBM to ChatModelTabs and EmbeddingTabs - [x] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/	2025-02-13 08:43:42 -08:00
ccurme	0040d93b09	docs: showcase extras in chat model tabs (#29677 ) Co-authored-by: Erick Friis <erick@langchain.dev>	2025-02-07 18:16:44 -05:00
Erick Friis	eb9eddae0c	docs: use init_chat_model (#29623 )	2025-02-07 12:39:27 -08:00
Sunish Sheth	25ce1e211a	docs: Updating the imports for langchain-databricks to databricks-langchain (#29646 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" - [ ] PR message: *Delete this entire checklist* and replace with - Description: a description of the change - Issue: the issue # it fixes, if applicable - Dependencies: any dependencies required for this change - Twitter handle: if your PR gets announced, and you'd like a mention, we'll gladly shout you out! - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-02-06 13:28:07 -08:00
Erick Friis	ab67137fa3	docs: chat model order experiment (#29480 )	2025-02-03 18:55:18 +00:00
Nikhil Shahi	335ca3a606	docs: add HyperbrowserLoader docs (#29143 ) ### Description This PR adds docs for the [langchain-hyperbrowser](https://pypi.org/project/langchain-hyperbrowser/) package. It includes a document loader that uses Hyperbrowser to scrape or crawl any urls and return formatted markdown or html content as well as relevant metadata. [Hyperbrowser](https://hyperbrowser.ai) is a platform for running and scaling headless browsers. It lets you launch and manage browser sessions at scale and provides easy to use solutions for any webscraping needs, such as scraping a single page or crawling an entire site. ### Issue None ### Dependencies None ### Twitter Handle `@hyperbrowser`	2025-01-13 10:45:39 -05:00
Panos Vagenas	858f655a25	docs: add Docling loader docs (#29104 ) ### Description This adds the docs for the Docling document loader. [Docling](https://github.com/DS4SD/docling) parses PDF, DOCX, PPTX, HTML, and other formats into a rich unified representation including document layout, tables etc., making them ready for generative AI workflows like RAG. Some references: - https://research.ibm.com/blog/docling-generative-AI - https://www.redhat.com/en/blog/docling-missing-document-processing-companion-generative-ai - [Docling Technical Report](https://arxiv.org/abs/2408.09869) The introduced `DoclingLoader` enables users to: - use various document types in their LLM applications with ease and speed, and - leverage Docling's rich representation for advanced, document-native grounding. ### Issue Replacing PR #27987 as discussed with @efriis [here](https://github.com/langchain-ai/langchain/pull/27987#issuecomment-2489354930). ### Dependencies None --------- Signed-off-by: Panos Vagenas <35837085+vagenas@users.noreply.github.com>	2025-01-09 10:15:35 -05:00
Inah Jeon	fa6f08faa1	docs: Add upstage document parse loader to pdf loaders (#29099 ) Add upstage document parse loader to pdf loaders Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2025-01-08 15:32:39 -05:00
Steve Kim	0fd4a68d34	docs: Update VectorStoreTabs.js (#28916 ) - Title: Fix typo to correct "embedding" to "embeddings" in PGVector initialization example - Problem: There is a typo in the example code for initializing the PGVector class. The current parameter "embedding" is incorrect as the class expects "embeddings". - Correction: The corrected code snippet is: vector_store = PGVector( embeddings=embeddings, collection_name="my_docs", connection="postgresql+psycopg://...", )	2024-12-26 14:31:58 -05:00
Erick Friis	5991b45a88	docs: change margin (#28908 )	2024-12-24 21:04:08 +00:00
Erick Friis	17f1ec8610	docs: remove console log (#28894 )	2024-12-23 21:22:21 +00:00
Erick Friis	cb4e6ac941	docs: frontmatter gen, colab/github links (#28852 )	2024-12-21 17:38:31 +00:00
fzowl	024f020f04	docs: Adding VoyageAI to 'integrations/text_embedding/' dropdown (#28817 ) Thank you for contributing to LangChain! - [x] PR title: "package: description" - Where "package" is whichever of langchain, community, core, etc. is being modified. Use "docs: ..." for purely docs changes, "infra: ..." for CI changes. - Example: "community: add foobar LLM" Description: Adding VoyageAI's text_embedding to 'integrations/text_embedding/' - [ ] Add tests and docs: If you're adding a new integration, please include 1. a test for the integration, preferably unit tests that do not rely on network access, 2. an example notebook showing its use. It lives in `docs/docs/integrations` directory. - [ ] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. See contribution guidelines for more: https://python.langchain.com/docs/contributing/ Additional guidelines: - Make sure optional dependencies are imported within a function. - Please do not add dependencies to pyproject.toml files (even optional ones) unless they are required for unit tests. - Most PRs should not touch more than one package. - Changes should be backwards compatible. - If you are adding something to community, do not re-import it in langchain. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.	2024-12-19 09:29:30 -05:00
ccurme	9c55c75eb5	docs: dropdowns for embeddings and vector stores (#28713 )	2024-12-13 16:48:02 -05:00
ccurme	4802c31a53	docs: update intro page (#28639 )	2024-12-13 15:24:14 -05:00

1 2 3

135 Commits