Uses of Interface
org.tribuo.util.tokens.impl.SplitFunctionTokenizer.SplitFunction
Package
Description
Simple fixed rule tokenizers.
Provides an implementation of a Wordpiece tokenizer which implements
to the Tribuo
Tokenizer
API.-
Uses of SplitFunctionTokenizer.SplitFunction in org.tribuo.util.tokens.impl
Modifier and TypeClassDescriptionstatic class
Splits tokens at the supplied characters.Modifier and TypeFieldDescriptionprotected SplitFunctionTokenizer.SplitFunction
SplitFunctionTokenizer.splitFunction
static final SplitFunctionTokenizer.SplitFunction
WhitespaceTokenizer.whitespaceSplitCharacterFunction
The splitting function for whitespace, usingCharacter.isWhitespace(char)
.ModifierConstructorDescriptionSplitFunctionTokenizer
(SplitFunctionTokenizer.SplitFunction splitFunction) Creates a new tokenizer using the supplied split function. -
Uses of SplitFunctionTokenizer.SplitFunction in org.tribuo.util.tokens.impl.wordpiece
Modifier and TypeMethodDescriptionWordpieceBasicTokenizer.createSplitFunction
(boolean tokenizeChineseChars) Creates aSplitFunctionTokenizer.SplitFunction
that is used by the super classSplitFunctionTokenizer
to determine how and where the tokenizer splits the input.