langchain

mirror of https://github.com/hwchase17/langchain.git synced 2026-06-09 18:50:33 +00:00

Files

qonnop 747efa16ec community: fix CPU support for FasterWhisperParser (implicit compute type for WhisperModel) (#30263 )

FasterWhisperParser fails on a machine without an NVIDIA GPU: "Requested
float16 compute type, but the target device or backend do not support
efficient float16 computation." This problem arises because the
WhisperModel is called with compute_type="float16", which works only for
NVIDIA GPU.

According to the [CTranslate2
docs](https://opennmt.net/CTranslate2/quantization.html#bit-floating-points-float16)
float16 is supported only on NVIDIA GPUs. Removing the compute_type
parameter solves the problem for CPUs. According to the [CTranslate2
docs](https://opennmt.net/CTranslate2/quantization.html#quantize-on-model-loading)
setting compute_type to "default" (standard when omitting the parameter)
uses the original compute type of the model or performs implicit
conversion for the specific computation device (GPU or CPU). I suggest
to remove compute_type="float16".

@hulitaitai you are the original author of the FasterWhisperParser - is
there a reason for setting the parameter to float16?

Thanks for reviewing the PR!

Co-authored-by: qonnop <qonnop@users.noreply.github.com>

2025-03-14 22:22:29 -04:00

cli

cli: update integration doc template for tools (#30188 )

2025-03-09 21:14:43 +00:00

community

community: fix CPU support for FasterWhisperParser (implicit compute type for WhisperModel) (#30263 )

2025-03-14 22:22:29 -04:00

core

core: release 0.3.45 (#30277 )

2025-03-13 22:44:23 +00:00

experimental

experimental: migrate to external repo (#26879 )