langchain/docs/extras
Pau Ramon Revilla 87802c86d9
Added a MHTML document loader (#6311)
MHTML is a very interesting format since it's used both for emails but
also for archived webpages. Some scraping projects want to store pages
in disk to process them later, mhtml is perfect for that use case.

This is heavily inspired from the beautifulsoup html loader, but
extracting the html part from the mhtml file.

---------

Co-authored-by: rlm <pexpresss31@gmail.com>
2023-06-25 13:12:08 -07:00
..
_templates Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
additional_resources Doc refactor (#6300) 2023-06-16 11:52:56 -07:00
ecosystem Fix Typo in LangChain MyScale Integration Doc (#6705) 2023-06-25 11:54:00 -07:00
guides Add OpenLLM wrapper(#6578) 2023-06-22 01:18:14 -07:00
modules Added a MHTML document loader (#6311) 2023-06-25 13:12:08 -07:00
use_cases MD header text splitter returns Documents (#6571) 2023-06-22 09:25:38 -07:00