mirror of
https://git.FreeBSD.org/ports.git
synced 2024-12-18 03:46:03 +00:00
989772c9ac
had both lines: Author: ... WWW: .... So standardize on that, and move them to the end of the file when necessary. Also fix some more whitespace, and remove more "signature tags" of varying forms, like -- name, etc. s/AUTHOR/Author/ A few other various formatting issues
19 lines
963 B
Plaintext
19 lines
963 B
Plaintext
A simple string tokenizer which takes a string and splits it on
|
|
whitespace. It also optionally takes a string of characters to use as
|
|
delimiters, and returns them with the token set as well. This allows for
|
|
splitting the string in many different ways.
|
|
|
|
This is a very basic tokenizer, so more complex needs should be either
|
|
addressed with a custom written tokenizer or post-processing of the output
|
|
generated by this module. Basically, this will not fill everyones needs,
|
|
but it spans a gap between simple split / /, $string and the other options
|
|
that involve much larger and complex modules.
|
|
|
|
Also note that this is not a lexical analyser. Many people confuse
|
|
tokenization with lexical analysis. A tokenizer mearly splits its input
|
|
into specific chunks, a lexical analyzer classifies those chunks.
|
|
Sometimes these two steps are combined, but not here.
|
|
|
|
Author: stevan little <stevan@iinteractive.com>
|
|
WWW: http://search.cpan.org/dist/String-Tokenizer/
|