Files
langchain/tests/integration_tests
Chetanya Rastogi 50c511d75f Add new loader to load pdf as html content (#2607)
Adds a new pdf loader using the existing dependency on PDFMiner. 

The new loader can be helpful for chunking texts semantically into
sections as the output html content can be parsed via `BeautifulSoup` to
get more structured and rich information about font size, page numbers,
pdf headers/footers, etc. which may not be available otherwise with
other pdf loaders
2023-04-09 17:57:25 -07:00
..
2023-04-02 14:57:45 -07:00
2023-04-04 06:48:34 -07:00
2023-04-06 14:41:06 -07:00
2023-03-29 10:10:09 -07:00
2022-10-24 14:51:15 -07:00
2023-02-20 21:15:45 -08:00