Package | Description |
---|---|
org.tribuo.data |
Provides classes for loading in data from disk, processing it into examples, and splitting datasets for
things like cross-validation and train-test splits.
|
org.tribuo.data.columnar |
Provides classes for processing columnar data and generating
Example s. |
org.tribuo.data.csv |
Provides classes which can load columnar data (using a
RowProcessor )
from a CSV (or other character delimited format) file. |
org.tribuo.data.sql |
Provides classes which can load columnar data (using a
RowProcessor )
from a SQL source. |
org.tribuo.data.text | |
org.tribuo.data.text.impl |
Provides implementations of text data processors.
|
org.tribuo.datasource |
Simple data sources for ingesting or aggregating data.
|
org.tribuo.json |
Provides interop with JSON formatted data, along with tools for interacting with JSON provenance objects.
|
org.tribuo.regression.example |
Provides some example regression data generators for testing implementations.
|
Modifier and Type | Field and Description |
---|---|
ConfigurableDataSource<? extends Output<?>> |
PreprocessAndSerialize.PreprocessAndSerializeOptions.dataSource |
ConfigurableDataSource<?> |
CompletelyConfigurableTrainTest.ConfigurableTrainTestOptions.testSource |
ConfigurableDataSource<?> |
CompletelyConfigurableTrainTest.ConfigurableTrainTestOptions.trainSource |
Modifier and Type | Class and Description |
---|---|
class |
ColumnarDataSource<T extends Output<T>>
A
ConfigurableDataSource base class which takes columnar data (e.g., csv or DB table rows) and generates Example s. |
Modifier and Type | Class and Description |
---|---|
class |
CSVDataSource<T extends Output<T>>
A
DataSource for loading separable data from a text file (e.g., CSV, TSV)
and applying FieldProcessor s to it. |
Modifier and Type | Class and Description |
---|---|
class |
SQLDataSource<T extends Output<T>>
A
DataSource for loading columnar data from a database
and applying FieldProcessor s to it. |
Modifier and Type | Class and Description |
---|---|
class |
DirectoryFileSource<T extends Output<T>>
A data source for a somewhat-common format for text classification datasets:
a top level directory that contains a number of subdirectories.
|
class |
TextDataSource<T extends Output<T>>
A base class for textual data sets.
|
Modifier and Type | Class and Description |
---|---|
class |
SimpleStringDataSource<T extends Output<T>>
A version of
SimpleTextDataSource that accepts an Iterable of Strings. |
class |
SimpleTextDataSource<T extends Output<T>>
A dataset for a simple data format for text classification experiments.
|
Modifier and Type | Class and Description |
---|---|
class |
IDXDataSource<T extends Output<T>>
A DataSource which can read IDX formatted data (i.e., MNIST).
|
class |
LibSVMDataSource<T extends Output<T>>
A DataSource which can read LibSVM formatted data.
|
Modifier and Type | Class and Description |
---|---|
class |
JsonDataSource<T extends Output<T>>
A
DataSource for loading data from a JSON text file
and applying FieldProcessor s to it. |
Modifier and Type | Class and Description |
---|---|
class |
GaussianDataSource
Generates a single dimensional output drawn from N(slope*x + intercept,variance).
|
Copyright © 2015–2021 Oracle and/or its affiliates. All rights reserved.