1
0
mirror of https://git.FreeBSD.org/ports.git synced 2024-12-25 04:43:33 +00:00
freebsd-ports/chinese/p5-Lingua-ZH-WordSegmenter/pkg-descr
Sunpoet Po-Chuan Hsieh 58a9f2a8df - Add LICENSE
- Remove unnecessary MASTER_SITE_SUBDIR
- Reformat pkg-descr
- Use single space after WWW:
2014-07-06 18:49:31 +00:00

14 lines
591 B
Plaintext

This is a perl version of simplified Chinese word segmentation.
The algorithm for this segmenter is to search the longest word at each point
from both left and right directions, and choose the one with higher frequency
product.
The original program is from the CPAN module Lingua::ZH::WordSegment
(http://search.cpan.org/~chenyr/) I did the follwing changes: 1) make the
interface object oriented; 2) make the internal string into utf8; 3) using
sogou's dictionary (http://www.sogou.com/labs/dl/w.html) as the default
dictionary.
WWW: http://search.cpan.org/dist/Lingua-ZH-WordSegmenter/