Package | Description |
---|---|
org.tribuo.util.tokens.impl |
Simple fixed rule tokenizers.
|
org.tribuo.util.tokens.impl.wordpiece |
Provides an implementation of a Wordpiece tokenizer which implements
to the Tribuo
Tokenizer API. |
Modifier and Type | Class and Description |
---|---|
static class |
SplitCharactersTokenizer.SplitCharactersSplitterFunction
Splits tokens at the supplied characters.
|
Modifier and Type | Field and Description |
---|---|
protected SplitFunctionTokenizer.SplitFunction |
SplitFunctionTokenizer.splitFunction |
static SplitFunctionTokenizer.SplitFunction |
WhitespaceTokenizer.whitespaceSplitCharacterFunction |
Constructor and Description |
---|
SplitFunctionTokenizer(SplitFunctionTokenizer.SplitFunction splitFunction)
Creates a new tokenizer using the supplied split function.
|
Modifier and Type | Method and Description |
---|---|
static SplitFunctionTokenizer.SplitFunction |
WordpieceBasicTokenizer.createSplitFunction(boolean tokenizeChineseChars)
Creates a
SplitFunction that is used by the super class
SplitFunctionTokenizer to determine how and where the tokenizer
splits the input. |
Copyright © 2015–2021 Oracle and/or its affiliates. All rights reserved.