Class DataOptions
java.lang.Object
org.tribuo.data.DataOptions
- All Implemented Interfaces:
com.oracle.labs.mlrg.olcut.config.Options
Options for working with training and test data in a CLI.
-
Nested Class Summary
Nested ClassesModifier and TypeClassDescriptionstatic enum
The delimiters supported by CSV files in this options object.static enum
The input formats supported by this options object. -
Field Summary
FieldsFields inherited from interface com.oracle.labs.mlrg.olcut.config.Options
header
-
Constructor Summary
Constructors -
Method Summary
-
Field Details
-
hashDim
@Option(longName="hashing-dimension", usage="Hashing dimension used for standard text format.") public int hashDim -
ngram
@Option(longName="ngram", usage="Ngram size to generate when using standard text format.") public int ngram -
termCounting
@Option(longName="term-counting", usage="Use term counts instead of boolean when using the standard text format.") public boolean termCounting -
outputPath
@Option(charName='f', longName="model-output-path", usage="Path to serialize model to.") public Path outputPath -
seed
-
inputFormat
@Option(charName='s', longName="input-format", usage="Loads the data using the specified format.") public DataOptions.InputFormat inputFormat -
csvResponseName
@Option(longName="csv-response-name", usage="Response name in the csv file.") public String csvResponseName -
delimiter
-
csvQuoteChar
@Option(longName="csv-quote-char", usage="Quote character in the CSV file.") public char csvQuoteChar -
rowProcessor
@Option(longName="columnar-row-processor", usage="The name of the row processor from the config file.") public RowProcessor<?> rowProcessor -
minCount
-
trainingPath
@Option(charName='u', longName="training-file", usage="Path to the training file.") public Path trainingPath -
testingPath
@Option(charName='v', longName="testing-file", usage="Path to the testing file.") public Path testingPath
-
-
Constructor Details
-
DataOptions
public DataOptions()
-
-
Method Details
-
getOptionsDescription
- Specified by:
getOptionsDescription
in interfacecom.oracle.labs.mlrg.olcut.config.Options
-
load
public <T extends Output<T>> com.oracle.labs.mlrg.olcut.util.Pair<Dataset<T>, Dataset<T>> load(OutputFactory<T> outputFactory) throws IOException - Throws:
IOException
-
saveModel
- Throws:
IOException
-