1
0
mirror of https://git.FreeBSD.org/ports.git synced 2024-12-20 04:02:27 +00:00
freebsd-ports/textproc/scws/pkg-descr
Torsten Zuehlsdorff 0a3058e33c New port: textproc/scws
SCWS (Simple Chinese Word Segmentation) is a frequency dictionary based Chinese
word segmentation engine, it can cut a whole section of the Chinese text into
words. Word is the smallest unit of morpheme in Chinese, but in Chinese words
are not separated by spaces,so word segmentation is an important step for
Chinese language process.SCWS is written in C without other dependencies and
accept GBK and UTF-8 encoding for both the Simple Chinese (zh_CN) and the
Traditional Chinese (such as zh_TW).

WWW: http://www.xunsearch.com/scws/index.php

PR:           219132
Submitted by: Jov <amutu@amutu.com>
2017-07-17 10:16:04 +00:00

10 lines
544 B
Plaintext

SCWS (Simple Chinese Word Segmentation) is a frequency dictionary based Chinese
word segmentation engine, it can cut a whole section of the Chinese text into
words. Word is the smallest unit of morpheme in Chinese, but in Chinese words
are not separated by spaces,so word segmentation is an important step for
Chinese language process.SCWS is written in C without other dependencies and
accept GBK and UTF-8 encoding for both the Simple Chinese (zh_CN) and the
Traditional Chinese (such as zh_TW).
WWW: http://www.xunsearch.com/scws/index.php