Class SplitCharactersTokenizerOptions

java.lang.Object
org.tribuo.util.tokens.options.SplitCharactersTokenizerOptions
All Implemented Interfaces:
com.oracle.labs.mlrg.olcut.config.Options, TokenizerOptions

public class SplitCharactersTokenizerOptions extends Object implements TokenizerOptions
CLI options for a SplitCharactersTokenizer.
  • Field Details

    • splitChars

      @Option(longName="sc-tokenizer-split-characters", usage="The characters to split on.") public char[] splitChars
      The characters to split on.
    • splitXDigitsChars

      @Option(longName="sc-tokenizer-split-x-digits", usage="Characters to split on unless they appear between digits") public char[] splitXDigitsChars
      Characters to split on unless they appear between digits
  • Constructor Details

    • SplitCharactersTokenizerOptions

      public SplitCharactersTokenizerOptions()
  • Method Details