langchain

mirror of https://github.com/hwchase17/langchain.git synced 2025-10-11 20:07:52 +00:00

Files

Taqi Jaffri b7290f01d8 Batching for hf_pipeline (#10795 )

The huggingface pipeline in langchain (used for locally hosted models)
does not support batching. If you send in a batch of prompts, it just
processes them serially using the base implementation of _generate:
https://github.com/docugami/langchain/blob/master/libs/langchain/langchain/llms/base.py#L1004C2-L1004C29

This PR adds support for batching in this pipeline, so that GPUs can be
fully saturated. I updated the accompanying notebook to show GPU batch
inference.

---------

Co-authored-by: Taqi Jaffri <tjaffri@docugami.com>

2023-09-25 18:23:11 +01:00

callbacks

Update argilla.ipynb with spelling fix (#10611 )

2023-09-19 08:06:28 -07:00

chat

docs: add vLLM chat notebook (#10993 )

2023-09-24 18:23:19 -07:00

chat_loaders

Harrison/stop importing from init (#10690 )

2023-09-16 17:22:48 -07:00

document_loaders

fix broken link in docugami loader docs (#10753 )