mirror of
https://git.FreeBSD.org/ports.git
synced 2024-12-16 03:24:07 +00:00
f98ad549cc
is simple: Using "Text::ExtractWords" and "Lingua::StopWords" from CPAN, it determines how many of the known stopwords the document contains for each language supported by "Lingua::StopWords". Each word in the document recognized as stopword of a particular language scores one point for this language. The "language_guess()" function takes a document as a parameter and returns the abbreviation of the language that it is most likely written in. Author: Mike Schilli <cpan@perlmeister.com> WWW: http://search.cpan.org/~mschilli/Text-Language-Guess-0.02/ PR: ports/103571 Submitted by: Masahiro Teramoto <markun@onohara.to>
10 lines
373 B
Plaintext
10 lines
373 B
Plaintext
@comment $FreeBSD$
|
|
bin/language-guess
|
|
%%SITE_PERL%%/%%PERL_ARCH%%/auto/Text/Language/Guess/.packlist
|
|
%%SITE_PERL%%/Text/Language/Guess.pm
|
|
@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Text/Language/Guess
|
|
@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Text/Language
|
|
@dirrmtry %%SITE_PERL%%/%%PERL_ARCH%%/auto/Text
|
|
@dirrmtry %%SITE_PERL%%/Text/Language
|
|
@dirrmtry %%SITE_PERL%%/Text
|