mirror of
https://git.FreeBSD.org/ports.git
synced 2025-01-11 07:22:22 +00:00
c434e1fb8b
normalization functions for easier re-use. These functions accept a snippet of unicode or utf-8 encoded text and remove various classes of characters, such as diacritics, punctuation etc. This is useful as a preparation to further text analysis. WWW: https://github.com/pudo/normality PR: 229527 Submitted by: freebsd_ports@k-worx.org Sponsored by: iXsystems Inc.
8 lines
356 B
Plaintext
8 lines
356 B
Plaintext
Normality is a Python micro-package that contains a small set of text
|
|
normalization functions for easier re-use. These functions accept a snippet of
|
|
unicode or utf-8 encoded text and remove various classes of characters, such as
|
|
diacritics, punctuation etc. This is useful as a preparation to further text
|
|
analysis.
|
|
|
|
WWW: https://github.com/pudo/normality
|