1
0
mirror of https://git.FreeBSD.org/ports.git synced 2024-11-28 01:06:17 +00:00
freebsd-ports/textproc/py-acora/pkg-descr
Antoine Brodin 618563f0d3 New port: textproc/py-acora
Acora is 'fgrep' for Python, a fast multi-keyword text search engine.
Based on a set of keywords, it generates a search automaton (DFA) and runs it
over string input, either unicode or bytes.  It is based on the Aho-Corasick
algorithm and an NFA-to-DFA powerset construction.  Acora comes with both a pure
Python implementation and a fast binary module written in Cython.  However, note
that the current construction algorithm is not suitable for really large sets of
keywords (i.e. more than a couple of thousand).

WWW: https://github.com/scoder/acora/
2014-08-23 08:16:57 +00:00

10 lines
555 B
Plaintext

Acora is 'fgrep' for Python, a fast multi-keyword text search engine.
Based on a set of keywords, it generates a search automaton (DFA) and runs it
over string input, either unicode or bytes. It is based on the Aho-Corasick
algorithm and an NFA-to-DFA powerset construction. Acora comes with both a pure
Python implementation and a fast binary module written in Cython. However, note
that the current construction algorithm is not suitable for really large sets of
keywords (i.e. more than a couple of thousand).
WWW: https://github.com/scoder/acora/