A regular expression describing punctuation to strip from the beginning of tokens; matches will be stripped by replacing them with their first match group.
A regular expression describing punctuation to strip from the beginning of tokens; matches will be stripped by replacing them with their first match group. Override this definition to customize tokenizer behavior. Defaults to
"(\\s)[^\\sA-Za-z0-9-_/]+|()^[^\\sA-Za-z0-9-_/]+".
"(\\s)[\\sA-Za-z0-9-_/]+|()[^\\sA-Za-z0-9-_/]+"
A regular expression describing punctuation to strip from within tokens; matches will be stripped by replacing them with the empty string.
A regular expression describing punctuation to strip from within tokens; matches will be stripped by replacing them with the empty string. Override this definition to customize tokenizer behavior. Defaults to
"[^A-Za-z0-9-_./:@]"if not overridden.
"[^A-Za-z0-9-_./:@]"
Splits a log message into a sequence of tokens, by
Splits a log message into a sequence of tokens, by
a sequence of tokens
A regular expression describing punctuation to strip from the end of tokens; matches will be stripped by replacing them with their first match group.
A regular expression describing punctuation to strip from the end of tokens; matches will be stripped by replacing them with their first match group. Override this definition to customize tokenizer behavior. Defaults to
"[^\\sA-Za-z0-9-_/]+(\\s)|()[^\\sA-Za-z0-9-_/]+$"if not overridden.
"[\\sA-Za-z0-9-_/]+(\\s)|()[\\sA-Za-z0-9-_/]+$"