dbacl is a digramic Bayesian text classifier. Given some text,
it calculates the posterior probabilities that the input resembles
one of any number of previously learned document collections.
It can be used to sort incoming email into arbitrary categories
such as spam, work, and play, or simply to distinguish an English text
from a French text. It fully supports international character sets,
and uses sophisticated statistical models based on the
Maximum Entropy Principle.
Author: Laird A. Breyer <laird@lbreyer.com>
WWW: http://dbacl.sourceforge.net/
PR: 58733
Submitted by: Cheng-Lung Sung <clsung@dragon2.net>
C++ library which allows rapid development of database applications.
Note: hk_classes works with mysql, postgresql und ODBC, but this port only
supports mysql at the moment.
Prodded by: enoal <sreco@xmission.com>
PR: 41380
mboxstats creates several top-10 lists from a file containing message
in mbox-format. List of top10 lists:
o Top writes
o Top receivers
o Top subjects
o Top cc'ers
o Top top-level-domain
o Top timezones
o Top organisations
o Top useragents (mailprograms)
o Top month/day-of-month/day-of-week/hour
o Average number of lines per message
o All kinds of per-user statistics
And much more!
WWW: http://vanheusden.com/mboxstats/
PR: 58748
Submitted by: Cheng-Lung Sung <clsung@dragon2.net>
minirsyslogd is a minimalistic, fast and secure (through lack of bloat)
remote-only syslog receiver suitable for hardened log receiver hosts
and/or central log receivers that receive several gigabyte of logs each day.
It will not deal with local syslog data. It does not have a multitude
of configuration, alerting or scripting options. It will however
automatically split inbound syslog data according to IP address,
date and current hour, and do so as rapidly and (I hope) securely as
possible.
Author: Mikael Olsson <mikael.olsson@clavister.com>
WWW: http://www.clueby4.org/minirsyslogd/
PR: 58737
Submitted by: lx@redundancy.redundancy.org