mirror of https://github.com/hwchase17/langchain.git synced 2025-08-01 09:04:03 +00:00

⚡ Building applications with LLMs through composability ⚡

Go to file

Philippe PRADOS 6dd621d636 community[minor]: Add CloudBlobLoader that supports loading data from cloud buckets (#21957 ) Thank you for contributing to LangChain! - [ ] PR title: "Add CloudBlobLoader" - community: Add CloudBlobLoader - [ ] PR message: Add cloud blob loader - Description: Langchain provides several approaches to read different file formats: Specific loaders (`CVSLoader`) or blob-compatible loaders (`FileSystemBlobLoader`). The only implementation proposed for BlobLoader is `FileSystemBlobLoader`. Many projects retrieve files from cloud storage. We propose a new implementation of `BlobLoader` to read files from the three cloud storage systems. The interface is strictly identical to `FileSystemBlobLoader`. The only difference is the constructor, which takes a cloud "url" object such as `s3://my-bucket`, `az://my-bucket`, or `gs://my-bucket`. By streamlining the process, this novel implementation eliminates the requirement to pre-download files from cloud storage to local temporary files (which are seldom removed). The code relies on the [CloudPathLib](https://cloudpathlib.drivendata.org/stable/) library to interpret cloud URLs. This has been added as an optional dependency. ```Python loader = CloudBlobLoader("s3://mybucket/id") for blob in loader.yield_blobs(): print(blob) ``` - [X] Dependencies: CloudPathLib - [X] Twitter handle: pprados - [X] Add tests and docs: Add unit test, but it's easy to convert to integration test, with some files in a cloud storage (see `test_cloud_blob_loader.py`) - [X] Lint and test: Run `make format`, `make lint` and `make test` from the root of the package(s) you've modified. Hello from Paris @hwchase17. Can you review this PR? --------- Co-authored-by: Eugene Yurtsev <eugene@langchain.dev>		2024-05-23 10:59:55 -04:00
.devcontainer	infra: Sync devcontainer.json and compose file mount location (#20461 )	2024-05-01 01:32:12 -04:00
.github	infra: rm unused # noqa violations (#22049 )	2024-05-22 15:21:08 -07:00
cookbook	infra: rm unused # noqa violations (#22049 )	2024-05-22 15:21:08 -07:00
docker	community[minor]: Add VDMS vectorstore (#19551 )	2024-03-28 03:12:11 +00:00
docs	docs: concepts callbacks fix admonition (#22048 )	2024-05-22 20:33:28 -04:00
libs	community[minor]: Add CloudBlobLoader that supports loading data from cloud buckets (#21957 )	2024-05-23 10:59:55 -04:00
templates	infra: rm unused # noqa violations (#22049 )	2024-05-22 15:21:08 -07:00
.gitattributes
.gitignore	Add docstrings for Clickhouse class methods (#19195 )	2024-03-19 04:03:12 +00:00
.readthedocs.yaml	infra: update rtd yaml (#17502 )	2024-02-13 18:16:44 -08:00
CITATION.cff
LICENSE	Library Licenses (#13300 )	2023-11-28 17:34:27 -08:00
Makefile	docs: Remove unnecessary comment marks from the Makefile help section (#21749 )	2024-05-16 09:05:44 -07:00
MIGRATE.md	Update main readme (#13298 )	2023-11-13 17:37:54 -08:00
poetry.lock	community[minor]: Add CloudBlobLoader that supports loading data from cloud buckets (#21957 )	2024-05-23 10:59:55 -04:00
poetry.toml
pyproject.toml	partner[patch]: Upgrade to Ruff v0.4.2 (#21108 )	2024-04-30 15:06:42 -04:00
README.md	README: Update downloads to show downloads of langchain-core (#21387 )	2024-05-13 11:26:50 -04:00
SECURITY.md	Updated security policy (#19089 )	2024-03-14 20:58:47 +00:00

README.md

🦜️🔗 LangChain

⚡ Build context-aware reasoning applications ⚡

Looking for the JS/TS library? Check out LangChain.js.

To help you ship LangChain apps to production faster, check out LangSmith. LangSmith is a unified developer platform for building, testing, and monitoring LLM applications. Fill out this form to speak with our sales team.

Quick Install

With pip:

pip install langchain

With conda:

conda install langchain -c conda-forge

🤔 What is LangChain?

LangChain is a framework for developing applications powered by large language models (LLMs).

For these applications, LangChain simplifies the entire application lifecycle:

Open-source libraries: Build your applications using LangChain's modular building blocks and components. Integrate with hundreds of third-party providers.
Productionization: Inspect, monitor, and evaluate your apps with LangSmith so that you can constantly optimize and deploy with confidence.
Deployment: Turn any chain into a REST API with LangServe.

Open-source libraries

langchain-core: Base abstractions and LangChain Expression Language.
langchain-community: Third party integrations.
- Some integrations have been further split into partner packages that only rely on langchain-core. Examples include langchain_openai and langchain_anthropic.
langchain: Chains, agents, and retrieval strategies that make up an application's cognitive architecture.
LangGraph: A library for building robust and stateful multi-actor applications with LLMs by modeling steps as edges and nodes in a graph.

Productionization:

LangSmith: A developer platform that lets you debug, test, evaluate, and monitor chains built on any LLM framework and seamlessly integrates with LangChain.

Deployment:

LangServe: A library for deploying LangChain chains as REST APIs.

🧱 What can you build with LangChain?

❓ Question answering with RAG

Documentation
End-to-end Example: Chat LangChain and repo

🧱 Extracting structured output

Documentation
End-to-end Example: SQL Llama2 Template

🤖 Chatbots

Documentation
End-to-end Example: Web LangChain (web researcher chatbot) and repo

And much more! Head to the Use cases section of the docs for more.

🚀 How does LangChain help?

The main value props of the LangChain libraries are:

Components: composable building blocks, tools and integrations for working with language models. Components are modular and easy-to-use, whether you are using the rest of the LangChain framework or not
Off-the-shelf chains: built-in assemblages of components for accomplishing higher-level tasks

Off-the-shelf chains make it easy to get started. Components make it easy to customize existing chains and build new ones.

LangChain Expression Language (LCEL)

LCEL is the foundation of many of LangChain's components, and is a declarative way to compose chains. LCEL was designed from day 1 to support putting prototypes in production, with no code changes, from the simplest “prompt + LLM” chain to the most complex chains.

Overview: LCEL and its benefits
Interface: The standard interface for LCEL objects
Primitives: More on the primitives LCEL includes

Components

Components fall into the following modules:

📃 Model I/O:

This includes prompt management, prompt optimization, a generic interface for chat models and LLMs, and common utilities for working with model outputs.

📚 Retrieval:

Retrieval Augmented Generation involves loading data from a variety of sources, preparing it, then retrieving it for use in the generation step.

🤖 Agents:

Agents allow an LLM autonomy over how a task is accomplished. Agents make decisions about which Actions to take, then take that Action, observe the result, and repeat until the task is complete done. LangChain provides a standard interface for agents, a selection of agents to choose from, and examples of end-to-end agents.

📖 Documentation

Please see here for full documentation, which includes:

Getting started: installation, setting up the environment, simple examples
Use case walkthroughs and best practice guides
Overviews of the interfaces, components, and integrations

You can also check out the full API Reference docs.

🌐 Ecosystem

🦜🛠️ LangSmith: Tracing and evaluating your language model applications and intelligent agents to help you move from prototype to production.
🦜🕸️ LangGraph: Creating stateful, multi-actor applications with LLMs, built on top of (and intended to be used with) LangChain primitives.
🦜🏓 LangServe: Deploying LangChain runnables and chains as REST APIs.
- LangChain Templates: Example applications hosted with LangServe.

💁 Contributing

As an open-source project in a rapidly developing field, we are extremely open to contributions, whether it be in the form of a new feature, improved infrastructure, or better documentation.

For detailed information on how to contribute, see here.

README.md Unescape Escape