langchain/libs/community/langchain_community/document_loaders/parsers
Philippe PRADOS 2921597c71
community[patch]: Refactoring PDF loaders: 01 prepare (#29062)
- **Refactoring PDF loaders step 1**: "community: Refactoring PDF
loaders to standardize approaches"

- **Description:** Declare CloudBlobLoader in __init__.py. file_path is
Union[str, PurePath] anywhere
- **Twitter handle:** pprados

This is one part of a larger Pull Request (PR) that is too large to be
submitted all at once.
This specific part focuses to prepare the update of all parsers.

For more details, see [PR
28970](https://github.com/langchain-ai/langchain/pull/28970).

@eyurtsev it's the start of a PR series.
2025-01-07 11:00:04 -05:00
..
html docs: community docstring updates (#21040) 2024-04-29 17:40:23 -04:00
language Langchain_Community: SQL LanguageParser (#28430) 2024-12-19 20:30:57 +00:00
__init__.py community[patch]: Fix remaining __inits__ in community (#22037) 2024-05-22 17:42:17 +00:00
audio.py community: add AzureOpenAIWhisperParser (#27796) 2024-10-31 12:37:41 -04:00
doc_intelligence.py (Community): Fix Keyword argument for AzureAIDocumentIntelligenceParser (#28959) 2025-01-02 11:27:12 -05:00
docai.py multiple: update removal targets (#25361) 2024-08-14 09:50:39 -04:00
documentloader_adapter.py community: DocumentLoaderAsParser wrapper (#27749) 2024-12-18 12:47:08 -05:00
generic.py docs: fix mimetype parser docstring (#25463) 2024-08-15 16:16:52 -07:00
grobid.py Update grobid.py (#23399) 2024-06-26 09:11:02 -04:00
msword.py all: test 3.13 ci (#27197) 2024-10-25 12:56:58 -07:00
pdf.py community[patch]: Refactoring PDF loaders: 01 prepare (#29062) 2025-01-07 11:00:04 -05:00
registry.py infra: update mypy 1.10, ruff 0.5 (#23721) 2024-07-03 10:33:27 -07:00
txt.py infra: update mypy 1.10, ruff 0.5 (#23721) 2024-07-03 10:33:27 -07:00
vsdx.py infra: add print rule to ruff (#16221) 2024-02-09 16:13:30 -08:00