langchain/docs
Martin Schade 0c7f1d8b21
Textract linearizer (#12446)
**Description:** Textract PDF Loader generating linearized output,
meaning it will replicate the structure of the source document as close
as possible based on the features passed into the call (e. g. LAYOUT,
FORMS, TABLES). With LAYOUT reading order for multi-column documents or
identification of lists and figures is supported and with TABLES it will
generate the table structure as well. FORMS will indicate "key: value"
with columms.
  - **Issue:** the issue fixes #12068 
- **Dependencies:** amazon-textract-textractor is added, which provides
the linearization
  - **Tag maintainer:** @3coins 

---------

Co-authored-by: Bagatur <baskaryan@gmail.com>
2023-10-30 18:02:10 -07:00
..
api_reference Merge pull request #12433 2023-10-29 21:22:36 -04:00
docs Textract linearizer (#12446) 2023-10-30 18:02:10 -07:00
docs_skeleton/docs/guides/langsmith mv old integration docs (#12217) 2023-10-24 12:38:16 -07:00
extras/guides/langsmith Bagatur/mv singlestore doc (#12053) 2023-10-19 15:06:26 -07:00
scripts notebook fmt (#12498) 2023-10-29 15:50:09 -07:00
src add cookbook table (#12043) 2023-10-19 14:05:24 -07:00
static Docs: QA Privacy Nit (#12025) 2023-10-19 09:43:47 -04:00
.local_build.sh langserve doc (#12357) 2023-10-26 11:40:57 -07:00
babel.config.js Restructure docs (#11620) 2023-10-10 12:55:19 -07:00
code-block-loader.js Restructure docs (#11620) 2023-10-10 12:55:19 -07:00
docusaurus.config.js Add dev guide to docs(#12291) 2023-10-25 12:28:43 -07:00
package-lock.json Bump @babel/traverse from 7.22.8 to 7.23.2 in /docs (#12453) 2023-10-27 14:13:58 -07:00
package.json Restructure docs (#11620) 2023-10-10 12:55:19 -07:00
README.md Fix typos (#11663) 2023-10-12 11:44:03 -04:00
settings.ini Restructure docs (#11620) 2023-10-10 12:55:19 -07:00
sidebars.js Docs: consolidate top nav (#12219) 2023-10-24 12:28:08 -07:00
vercel_build.sh langserve doc (#12357) 2023-10-26 11:40:57 -07:00
vercel_requirements.txt
vercel.json docs: Google Cloud Documentation Cleanup (#12224) 2023-10-24 14:54:43 -07:00

Website

This website is built using Docusaurus 2, a modern static website generator.

Installation

$ yarn

Local Development

$ yarn start

This command starts a local development server and opens up a browser window. Most changes are reflected live without having to restart the server.

Build

$ yarn build

This command generates static content into the build directory and can be served using any static contents hosting service.

Deployment

Using SSH:

$ USE_SSH=true yarn deploy

Not using SSH:

$ GIT_USER=<Your GitHub username> yarn deploy

If you are using GitHub pages for hosting, this command is a convenient way to build the website and push to the gh-pages branch.

Continuous Integration

Some common defaults for linting/formatting have been set for you. If you integrate your project with an open-source Continuous Integration system (e.g. Travis CI, CircleCI), you may check for issues using the following command.

$ yarn ci