Developing state of the art document intelligence models.
11
Public repositories
68,769
Total stars
706
Followers
Convert PDF to markdown + JSON quickly with high accuracy
OCR, layout analysis, reading order, table recognition in 90+ languages
OCR model that handles complex tables, forms, handwriting with full layout.
Extract structured text from pdfs quickly
No description provided for this repository.
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)
Scripts to run Datalab's self-service on-prem container
No description provided for this repository.
No description provided for this repository.
No description provided for this repository.
No description provided for this repository.
Monitor Datalab with RepoGuard and get alerted the moment a new public repository appears.
Monitor this account