mirror of
https://git.FreeBSD.org/ports.git
synced 2025-01-28 10:08:24 +00:00
5db6b69ce0
Readability algorithm to detect the main body of the page, usually skipping headers, footers, navigation, etc. WWW: http://search.cpan.org/dist/HTML-ExtractMain/ PR: ports/163557 Submitted by: Jui-Nan Lin <jnlin@csie.nctu.edu.tw>
6 lines
232 B
Plaintext
6 lines
232 B
Plaintext
HTML::ExtractMain is a module which takes HTML content, and uses the
|
|
Readability algorithm to detect the main body of the page, usually
|
|
skipping headers, footers, navigation, etc.
|
|
|
|
WWW: http://search.cpan.org/dist/HTML-ExtractMain/
|