Uses of Interface
org.tribuo.util.tokens.impl.SplitFunctionTokenizer.SplitFunction
Packages that use SplitFunctionTokenizer.SplitFunction
Package
Description
Simple fixed rule tokenizers.
Provides an implementation of a Wordpiece tokenizer which implements
to the Tribuo
Tokenizer API.-
Uses of SplitFunctionTokenizer.SplitFunction in org.tribuo.util.tokens.impl
Classes in org.tribuo.util.tokens.impl that implement SplitFunctionTokenizer.SplitFunctionModifier and TypeClassDescriptionstatic classSplits tokens at the supplied characters.Fields in org.tribuo.util.tokens.impl declared as SplitFunctionTokenizer.SplitFunctionModifier and TypeFieldDescriptionprotected SplitFunctionTokenizer.SplitFunctionSplitFunctionTokenizer.splitFunctionThe splitting function.static final SplitFunctionTokenizer.SplitFunctionWhitespaceTokenizer.whitespaceSplitCharacterFunctionThe splitting function for whitespace, usingCharacter.isWhitespace(char).Constructors in org.tribuo.util.tokens.impl with parameters of type SplitFunctionTokenizer.SplitFunctionModifierConstructorDescriptionSplitFunctionTokenizer(SplitFunctionTokenizer.SplitFunction splitFunction) Creates a new tokenizer using the supplied split function. -
Uses of SplitFunctionTokenizer.SplitFunction in org.tribuo.util.tokens.impl.wordpiece
Methods in org.tribuo.util.tokens.impl.wordpiece that return SplitFunctionTokenizer.SplitFunctionModifier and TypeMethodDescriptionWordpieceBasicTokenizer.createSplitFunction(boolean tokenizeChineseChars) Creates aSplitFunctionTokenizer.SplitFunctionthat is used by the super classSplitFunctionTokenizerto determine how and where the tokenizer splits the input.