mirror of
https://github.com/hwchase17/langchain.git
synced 2025-05-19 14:01:50 +00:00
# Unstructured XML Loader Adds an `UnstructuredXMLLoader` class for .xml files. Works with unstructured>=0.6.7. A plain text representation of the text with the XML tags will be available under the `page_content` attribute in the doc. ### Testing ```python from langchain.document_loaders import UnstructuredXMLLoader loader = UnstructuredXMLLoader( "example_data/factbook.xml", ) docs = loader.load() ``` ## Who can review? @hwchase17 @eyurtsev |
||
---|---|---|
.. | ||
fake_discord_data | ||
test_repo1@7e525a3b91 | ||
conllu.conllu | ||
facebook_chat.json | ||
factbook.xml | ||
fake_conversations.json | ||
fake_rule.toml | ||
fake-content.html | ||
fake-email.eml | ||
fake-email.msg | ||
fake-power-point.pptx | ||
fake.docx | ||
fake.odt | ||
layout-parser-paper.pdf | ||
mlb_teams_2012.csv | ||
notebook.ipynb | ||
sitemap.xml | ||
stanley-cups.xlsx | ||
telegram.json | ||
testing.enex | ||
testmw_pages_current.xml | ||
whatsapp_chat.txt |