Readme rewrite (#12615)

Co-authored-by: Lance Martin <lance@langchain.dev> Co-authored-by: Harrison Chase <hw.chase.17@gmail.com>
2025-09-06 21:43:44 +00:00 · 2023-10-31 00:06:02 -07:00
parent 00766c9f31
commit a1fae1fddd
60 changed files with 2669 additions and 675 deletions
--- a/templates/llama2-functions/README.md
+++ b/templates/llama2-functions/README.md
@@ -1,18 +1,70 @@
-# Extraction with LLaMA2 Function Calling

-This template shows how to do extraction of structured data from unstructured data, using LLaMA2 [fine-tuned for grammars and jsonschema](https://replicate.com/andreasjansson/llama-2-13b-chat-gguf).
+# llama2-functions

-[Query transformations](https://blog.langchain.dev/query-transformations/) are one great application area for open source, private LLMs:
+This template performs extraction of structured data from unstructured data using a [LLaMA2 model that supports a specified JSON output schema](https://github.com/ggerganov/llama.cpp/blob/master/grammars/README.md). 

-* The tasks are often narrow and well-defined (e.g., generatae multiple questions from a user input)
-* They also are tasks that users may want to run locally (e.g., in a RAG workflow)
+The extraction schema can be set in `chain.py`.

-Specify the scehma you want to extract in `chain.py`
+## Environment Setup

-##  LLM
+This will use a [LLaMA2-13b model hosted by Replicate](https://replicate.com/andreasjansson/llama-2-13b-chat-gguf/versions).

-This template will use a `Replicate` [hosted version](https://replicate.com/andreasjansson/llama-2-13b-chat-gguf) of LLaMA2 that has support for grammars and jsonschema. 
+Ensure that `REPLICATE_API_TOKEN` is set in your environment.

-Based on the `Replicate` example, the JSON schema is supplied directly in the prompt.
+## Usage

-Be sure that `REPLICATE_API_TOKEN` is set in your environment.
+To use this package, you should first have the LangChain CLI installed:
+
+```shell
+pip install -U "langchain-cli[serve]"
+```
+
+To create a new LangChain project and install this as the only package, you can do:
+
+```shell
+langchain app new my-app --package llama2-functions
+```
+
+If you want to add this to an existing project, you can just run:
+
+```shell
+langchain app add llama2-functions
+```
+
+And add the following code to your `server.py` file:
+```python
+from llama2_functions import chain as llama2_functions_chain
+
+add_routes(app, llama2_functions_chain, path="/llama2-functions")
+```
+
+(Optional) Let's now configure LangSmith. 
+LangSmith will help us trace, monitor and debug LangChain applications. 
+LangSmith is currently in private beta, you can sign up [here](https://smith.langchain.com/). 
+If you don't have access, you can skip this section
+
+```shell
+export LANGCHAIN_TRACING_V2=true
+export LANGCHAIN_API_KEY=<your-api-key>
+export LANGCHAIN_PROJECT=<your-project>  # if not specified, defaults to "default"
+```
+
+If you are inside this directory, then you can spin up a LangServe instance directly by:
+
+```shell
+langchain serve
+```
+
+This will start the FastAPI app with a server is running locally at 
+[http://localhost:8000](http://localhost:8000)
+
+We can see all templates at [http://127.0.0.1:8000/docs](http://127.0.0.1:8000/docs)
+We can access the playground at [http://127.0.0.1:8000/llama2-functions/playground](http://127.0.0.1:8000/llama2-functions/playground)  
+
+We can access the template from code with:
+
+```python
+from langserve.client import RemoteRunnable
+
+runnable = RemoteRunnable("http://localhost:8000/llama2-functions")
+```