mirror of
https://git.FreeBSD.org/ports.git
synced 2024-12-31 05:41:08 +00:00
22 lines
850 B
Plaintext
22 lines
850 B
Plaintext
PyOCR is an optical character recognition (OCR) tool wrapper for python. That
|
|
is, it helps using various OCR tools from a Python program.
|
|
|
|
It has been tested only on GNU/Linux systems. It should also work on similar
|
|
systems (*BSD, etc). It may or may not work on Windows, MacOSX, etc.
|
|
|
|
Supported OCR tools:
|
|
* Libtesseract (Python bindings for the C API)
|
|
* Tesseract (wrapper: fork + exec)
|
|
* Cuneiform (wrapper: fork + exec)
|
|
|
|
Features:
|
|
* Supports all the image formats supported by Pillow, including jpeg, png, gif,
|
|
bmp, tiff and others
|
|
* Various output types: text only, bounding boxes, etc.
|
|
* Orientation detection (Tesseract and libtesseract only)
|
|
* Can focus on digits only (Tesseract and libtesseract only)
|
|
* Can save and reload boxes in hOCR format
|
|
* PDF generation (libtesseract only)
|
|
|
|
WWW: https://gitlab.gnome.org/World/OpenPaperwork/pyocr
|