Index | index by Group | index by Distribution | index by Vendor | index by creation date | index by Name | Mirrors | Help | Search |
Name: hocr-tools | Distribution: OpenMandriva Lx |
Version: 20091007 | Vendor: OpenMandriva |
Release: 2 | Build date: Sun Nov 1 15:25:41 2020 |
Group: Office | Build host: c64one.openmandriva.org |
Size: 31695 | Source RPM: hocr-tools-20091007-2.src.rpm |
Packager: bero <bero@lindev.ch> | |
Url: https://code.google.com/p/hocr-tools/ | |
Summary: Tools for manipulating and evaluating the hOCR format |
OCR is a format for representing OCR output, including layout information, character confidences, bounding boxes, and style information. It embeds this information invisibly in standard HTML. By building on standard HTML, it automatically inherits well-defined support for most scripts, languages, and common layout options. Furthermore, unlike previous OCR formats, the recognized text and OCR-related information co-exist in the same file and survives editing and manipulation. hOCR markup is independent of the presentation. Included command line programs: - hocr-check -- check the hOCR file for errors - hocr-combine -- combine pages in multiple hOCR files into a single document - hocr-eval -- compute number of segmentation and OCR errors - hocr-eval-geom -- compute over, under, and mis-segmentations - hocr-eval-lines -- compute OCR errors of hOCR output relative to text ground truth - hocr-split -- split an hOCR file into individual pages - hocr-merge-dc -- merge Dublin Core meta data into the hOCR HTML header
Apache License
* Fri Nov 11 2011 Andrey Smirnov <asmirnov@mandriva.org> 20091007-1 + Revision: 730075 - imported package hocr-tools
/usr/bin/hocr-check /usr/bin/hocr-combine /usr/bin/hocr-eval /usr/bin/hocr-eval-geom /usr/bin/hocr-eval-lines /usr/bin/hocr-extract-g1000 /usr/bin/hocr-extract-images /usr/bin/hocr-lines /usr/bin/hocr-merge-dc /usr/bin/hocr-split /usr/share/doc/hocr-tools /usr/share/doc/hocr-tools/README
Generated by rpm2html 1.8.1
Fabrice Bellet, Fri Nov 15 23:04:11 2024