Chetanya Rastogi
50c511d75f
Add new loader to load pdf as html content ( #2607 )
...
Adds a new pdf loader using the existing dependency on PDFMiner.
The new loader can be helpful for chunking texts semantically into
sections as the output html content can be parsed via `BeautifulSoup` to
get more structured and rich information about font size, page numbers,
pdf headers/footers, etc. which may not be available otherwise with
other pdf loaders
2023-04-09 17:57:25 -07:00
..
2023-04-04 06:48:34 -07:00
2023-03-26 19:49:46 -07:00
2023-03-30 20:58:14 -07:00
2023-03-26 19:49:46 -07:00
2023-03-27 16:28:08 -07:00
2023-03-27 16:28:08 -07:00
2023-03-28 08:17:22 -07:00
2023-03-26 19:49:46 -07:00
2023-03-27 16:28:08 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-27 16:32:55 -07:00
2023-03-26 19:49:46 -07:00
2023-03-27 19:51:34 -07:00
2023-04-04 06:48:34 -07:00
2023-03-31 11:16:21 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-29 22:11:45 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-28 08:07:09 -07:00
2023-03-26 19:49:46 -07:00
2023-04-09 17:57:25 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-28 22:56:29 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00
2023-03-30 20:45:31 -07:00
2023-04-02 14:05:00 -07:00
2023-03-27 16:28:08 -07:00
2023-03-27 23:43:45 -07:00
2023-03-26 19:49:46 -07:00
2023-03-26 19:49:46 -07:00