Skip to content

asterbini/UnstructuredCat

 
 

Repository files navigation

Unstructured Cat

Unstructured Cat

A Cheshire Cat AI plugin for document ingestion using the Unstructured lib.

required linux packages

  • libreoffice
  • python3-opencv
  • libmagic-dev
  • pandoc
  • poppler-utils
  • tesseract-ocr
  • tesseract-ocs-LANG (ita, eng ...)

About

A document ingesting plugin using the 'unstructured' lib

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%