01Exportable data formats including ALTO XML, PAGE XML, and structured JSON for archival use.
028 GitHub stars
03Highly customizable pipelines via YAML configurations for specific model and segmentation needs.
04Batch processing support for efficient transcription of multiple document images in a single call.
05Generates interactive inline viewer artifacts with embedded dependencies for instant review.
06Multilingual handwritten text recognition for Swedish, Norwegian, English, and Medieval scripts.