mirror of
https://git.FreeBSD.org/ports.git
synced 2024-12-01 01:17:02 +00:00
696f060d47
- While I'm here, update to 20131113 PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows to obtain the exact location of texts in a page, as well as other information such as fonts or lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes instead of text analysis. WWW: http://www.unixuser.org/~euske/python/pdfminer/index.html PR: ports/181992 Submitted by: Kozlov Sergey <kozlov.sergey.404@gmail.com>
10 lines
527 B
Plaintext
10 lines
527 B
Plaintext
PDFMiner is a tool for extracting information from PDF documents. Unlike other
|
|
PDF-related tools, it focuses entirely on getting and analyzing text data.
|
|
PDFMiner allows to obtain the exact location of texts in a page, as well as
|
|
other information such as fonts or lines. It includes a PDF converter that can
|
|
transform PDF files into other text formats (such as HTML).
|
|
It has an extensible PDF parser that can be used for other purposes instead
|
|
of text analysis.
|
|
|
|
WWW: http://www.unixuser.org/~euske/python/pdfminer/index.html
|