mirror of https://github.com/hwchase17/langchain.git synced 2025-10-22 01:32:24 +00:00

Files

Martin Krasser 79ed66f870 EXPERIMENTAL Generic LLM wrapper to support chat model interface with configurable chat prompt format (#8295 )

## Update 2023-09-08

This PR now supports further models in addition to Lllama-2 chat models.
See [this comment](#issuecomment-1668988543) for further details. The
title of this PR has been updated accordingly.

## Original PR description

This PR adds a generic `Llama2Chat` model, a wrapper for LLMs able to
serve Llama-2 chat models (like `LlamaCPP`,
`HuggingFaceTextGenInference`, ...). It implements `BaseChatModel`,
converts a list of chat messages into the [required Llama-2 chat prompt
format](https://huggingface.co/blog/llama2#how-to-prompt-llama-2) and
forwards the formatted prompt as `str` to the wrapped `LLM`. Usage
example:

```python
# uses a locally hosted Llama2 chat model
llm = HuggingFaceTextGenInference(
    inference_server_url="http://127.0.0.1:8080/",
    max_new_tokens=512,
    top_k=50,
    temperature=0.1,
    repetition_penalty=1.03,
)

# Wrap llm to support Llama2 chat prompt format.
# Resulting model is a chat model
model = Llama2Chat(llm=llm)

messages = [
    SystemMessage(content="You are a helpful assistant."),
    MessagesPlaceholder(variable_name="chat_history"),
    HumanMessagePromptTemplate.from_template("{text}"),
]

prompt = ChatPromptTemplate.from_messages(messages)
memory = ConversationBufferMemory(memory_key="chat_history", return_messages=True)
chain = LLMChain(llm=model, prompt=prompt, memory=memory)

# use chat model in a conversation
# ...
```

Also part of this PR are tests and a demo notebook.

- Tag maintainer: @hwchase17
- Twitter handle: `@mrt1nz`

---------

Co-authored-by: Erick Friis <erick@langchain.dev>

2023-11-17 16:32:13 -08:00

api_reference

api doc newlines (#13378 )

2023-11-14 19:16:31 -08:00

docs

EXPERIMENTAL Generic LLM wrapper to support chat model interface with configurable chat prompt format (#8295 )

2023-11-17 16:32:13 -08:00

scripts

DOCS: format notebooks (#13371 )

2023-11-14 14:17:44 -08:00

src

add cookbook table (#12043 )

2023-10-19 14:05:24 -07:00

static

DOCS: langchain stack img update (#13421 )

2023-11-15 14:10:02 -08:00

.local_build.sh

Harrison/docs smith serve (#12898 )

2023-11-06 07:07:25 -08:00

babel.config.js

…

code-block-loader.js

…

docusaurus.config.js

FEAT docs integration cards site (#13379 )

2023-11-14 19:49:17 -08:00

package-lock.json

Upgrade docs postcss (#13031 )

2023-11-07 15:50:25 -08:00

package.json

…

README.md

…

settings.ini

…

sidebars.js

template readme's in docs (#13152 )

2023-11-09 23:36:21 -08:00

vercel_build.sh

template readme's in docs (#13152 )

2023-11-09 23:36:21 -08:00

vercel_requirements.txt

…

vercel.json

DOCS updated async-faiss example (#13434 )

2023-11-16 17:41:26 -08:00

README.md

Website

This website is built using Docusaurus 2, a modern static website generator.

Installation

$ yarn

Local Development

$ yarn start

This command starts a local development server and opens up a browser window. Most changes are reflected live without having to restart the server.

Build

$ yarn build

This command generates static content into the build directory and can be served using any static contents hosting service.

Deployment

Using SSH:

$ USE_SSH=true yarn deploy

Not using SSH:

$ GIT_USER=<Your GitHub username> yarn deploy

If you are using GitHub pages for hosting, this command is a convenient way to build the website and push to the gh-pages branch.

Continuous Integration

Some common defaults for linting/formatting have been set for you. If you integrate your project with an open-source Continuous Integration system (e.g. Travis CI, CircleCI), you may check for issues using the following command.

$ yarn ci