mirror of
https://github.com/hwchase17/langchain.git
synced 2025-05-31 20:19:43 +00:00
there is a case where "coords" does not exist in the "sentence" therefore, the "split(";")" will lead to error. we can fix that by adding "if sentence.get("coords") is not None:" the resulting empty "sbboxes" from this scenario will raise error at "sbboxes[0]["page"]" because sbboxes are empty. the PDF from https://pubmed.ncbi.nlm.nih.gov/23970373/ can replicate those errors. |
||
---|---|---|
.. | ||
html | ||
language | ||
__init__.py | ||
audio.py | ||
doc_intelligence.py | ||
docai.py | ||
generic.py | ||
grobid.py | ||
msword.py | ||
pdf.py | ||
registry.py | ||
txt.py |