Create OpenAIWhisperParser for generating Documents from audio files (#5580)

# OpenAIWhisperParser

This PR creates a new parser, `OpenAIWhisperParser`, that uses the
[OpenAI Whisper
model](https://platform.openai.com/docs/guides/speech-to-text/quickstart)
to perform transcription of audio files to text (`Documents`). Please
see the notebook for usage.
This commit is contained in:
Lance Martin
2023-06-05 15:51:13 -07:00
committed by GitHub
parent a4c9053d40
commit aea090045b
5 changed files with 122 additions and 0 deletions

View File

@@ -5,6 +5,7 @@ def test_parsers_public_api_correct() -> None:
"""Test public API of parsers for breaking changes."""
assert set(__all__) == {
"BS4HTMLParser",
"OpenAIWhisperParser",
"PyPDFParser",
"PDFMinerParser",
"PyMuPDFParser",