langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-04-23 20:23:59 +00:00

Files

Armin Stepanyan 641efcf41c community: add runtime kwargs to HuggingFacePipeline (#17005 )

This PR enables changing the behaviour of huggingface pipeline between
different calls. For example, before this PR there's no way of changing
maximum generation length between different invocations of the chain.
This is desirable in cases, such as when we want to scale the maximum
output size depending on a dynamic prompt size.

Usage example:

```python
from langchain_community.llms.huggingface_pipeline import HuggingFacePipeline
from transformers import AutoModelForCausalLM, AutoTokenizer, pipeline

model_id = "gpt2"
tokenizer = AutoTokenizer.from_pretrained(model_id)
model = AutoModelForCausalLM.from_pretrained(model_id)
pipe = pipeline("text-generation", model=model, tokenizer=tokenizer)
hf = HuggingFacePipeline(pipeline=pipe)

hf("Say foo:", pipeline_kwargs={"max_new_tokens": 42})
```

---------

Co-authored-by: Bagatur <baskaryan@gmail.com>

2024-02-08 13:58:31 -08:00

cli

2024-02-07 14:52:37 -08:00

community

community: add runtime kwargs to HuggingFacePipeline (#17005 )

2024-02-08 13:58:31 -08:00

core

langchain[minor], core[minor]: update json, pydantic parser. add openai-json structured output runnable (#16914 )

2024-02-08 11:59:06 -08:00

experimental

community[minor]: SQLDatabase Add fetch mode cursor, query parameters, query by selectable, expose execution options, and documentation (#17191 )

2024-02-07 22:23:43 -05:00

langchain

langchain: adds recursive json splitter (#17144 )

2024-02-08 13:45:34 -08:00

partners

google-genai[patch]: added parsing of function call / response (#17245 )

2024-02-08 13:34:46 -08:00