Uses of Interface
org.tribuo.DataSource
Package
Description
Provides the core interfaces and classes for using Tribuo.
Provides anomaly data generators used for demos and testing implementations.
Provides a multiclass data generator used for testing implementations, along with several synthetic data generators
for 2d binary classification problems to be used in demos or tutorials.
Provides clustering data generators used for demos and testing implementations.
Provides classes for processing columnar data and generating
Example
s.Provides classes which can load columnar data (using a
RowProcessor
)
from a CSV (or other character delimited format) file.Provides classes which can load columnar data (using a
RowProcessor
)
from a SQL source.Provides implementations of text data processors.
Simple data sources for ingesting or aggregating data.
Evaluation base classes, along with code for train/test splits and cross validation.
Provides interop with JSON formatted data, along with tools for interacting with JSON provenance objects.
Provides a multi-label data generator for testing implementations and a
configurable data source suitable for demos and tests.
Provides some example regression data generators for testing implementations.
-
Uses of DataSource in org.tribuo
Modifier and TypeInterfaceDescriptioninterface
ConfigurableDataSource<T extends Output<T>>
It's aDataSource
that's alsoConfigurable
.ModifierConstructorDescriptionprotected
Dataset
(DataSource<T> dataSource) Creates a dataset.ImmutableDataset
(DataSource<T> dataSource, FeatureMap featureIDMap, OutputInfo<T> outputIDInfo, boolean dropInvalidExamples) Creates a dataset from a data source.ImmutableDataset
(DataSource<T> dataSource, Model<T> model, boolean dropInvalidExamples) Creates a dataset from a data source.MutableDataset
(DataSource<T> dataSource) Creates a dataset from a data source. -
Uses of DataSource in org.tribuo.anomaly.example
Modifier and TypeClassDescriptionfinal class
Generates an anomaly detection dataset sampling each feature uniformly from a univariate Gaussian. -
Uses of DataSource in org.tribuo.classification.example
Modifier and TypeClassDescriptionfinal class
Creates a data source using a 2d checkerboard of alternating classes.final class
A data source for two concentric circles, one per class.class
The base class for the 2d binary classification data sources inorg.tribuo.classification.example
.final class
A data source for two classes generated from separate Gaussians.final class
A data source of two interleaved half circles.final class
A data source of two interleaved half circles with some zero mean Gaussian noise applied to each point. -
Uses of DataSource in org.tribuo.clustering.example
Modifier and TypeClassDescriptionfinal class
Generates a clustering dataset drawn from a mixture of 5 Gaussians. -
Uses of DataSource in org.tribuo.data.columnar
Modifier and TypeClassDescriptionclass
ColumnarDataSource<T extends Output<T>>
AConfigurableDataSource
base class which takes columnar data (e.g., csv or DB table rows) and generatesExample
s. -
Uses of DataSource in org.tribuo.data.csv
Modifier and TypeClassDescriptionclass
CSVDataSource<T extends Output<T>>
ADataSource
for loading separable data from a text file (e.g., CSV, TSV) and applyingFieldProcessor
s to it.Modifier and TypeMethodDescriptionCSVLoader.loadDataSource
(URL csvPath, String responseName) Loads a DataSource from the specified csv path.CSVLoader.loadDataSource
(URL csvPath, String responseName, String[] header) Loads a DataSource from the specified csv path.CSVLoader.loadDataSource
(URL csvPath, Set<String> responseNames) Loads a DataSource from the specified csv path.CSVLoader.loadDataSource
(URL csvPath, Set<String> responseNames, String[] header) Loads a DataSource from the specified csv path.CSVLoader.loadDataSource
(Path csvPath, String responseName) Loads a DataSource from the specified csv path.CSVLoader.loadDataSource
(Path csvPath, String responseName, String[] header) Loads a DataSource from the specified csv path.CSVLoader.loadDataSource
(Path csvPath, Set<String> responseNames) Loads a DataSource from the specified csv path.CSVLoader.loadDataSource
(Path csvPath, Set<String> responseNames, String[] header) Loads a DataSource from the specified csv path. -
Uses of DataSource in org.tribuo.data.sql
Modifier and TypeClassDescriptionclass
SQLDataSource<T extends Output<T>>
ADataSource
for loading columnar data from a database and applyingFieldProcessor
s to it. -
Uses of DataSource in org.tribuo.data.text
Modifier and TypeClassDescriptionclass
DirectoryFileSource<T extends Output<T>>
A data source for a somewhat-common format for text classification datasets: a top level directory that contains a number of subdirectories.class
TextDataSource<T extends Output<T>>
A base class for textual data sets. -
Uses of DataSource in org.tribuo.data.text.impl
Modifier and TypeClassDescriptionclass
SimpleStringDataSource<T extends Output<T>>
A version ofSimpleTextDataSource
that accepts aList
of Strings.class
SimpleTextDataSource<T extends Output<T>>
A dataset for a simple data format for text classification experiments. -
Uses of DataSource in org.tribuo.datasource
Modifier and TypeClassDescriptionclass
AggregateConfigurableDataSource<T extends Output<T>>
Aggregates multipleConfigurableDataSource
s, usesAggregateDataSource.IterationOrder
to control the iteration order.class
AggregateDataSource<T extends Output<T>>
Aggregates multipleDataSource
s, usesAggregateDataSource.IterationOrder
to control the iteration order.final class
IDXDataSource<T extends Output<T>>
A DataSource which can read IDX formatted data (i.e., MNIST).final class
LibSVMDataSource<T extends Output<T>>
A DataSource which can read LibSVM formatted data.class
ListDataSource<T extends Output<T>>
A data source which wraps up a list ofExample
s along with theirDataSourceProvenance
and anOutputFactory
.ModifierConstructorDescriptionAggregateDataSource
(List<DataSource<T>> sources) Creates an aggregate data source which will iterate the provided sources in the order of the list (i.e., usingAggregateDataSource.IterationOrder.SEQUENTIAL
.AggregateDataSource
(List<DataSource<T>> sources, AggregateDataSource.IterationOrder order) Creates an aggregate data source using the supplied sources and iteration order. -
Uses of DataSource in org.tribuo.evaluation
Modifier and TypeMethodDescriptionTrainTestSplitter.getTest()
Gets the testing datasource.TrainTestSplitter.getTrain()
Gets the training data source.Modifier and TypeMethodDescriptionfinal E
AbstractEvaluator.evaluate
(Model<T> model, DataSource<T> datasource) Produces an evaluation for the supplied model and datasource, by callingModel.predict(org.tribuo.Example<T>)
to create the predictions, then aggregating the appropriate statistics.Evaluator.evaluate
(Model<T> model, DataSource<T> datasource) Evaluates the dataset using the supplied model, returning an immutableEvaluation
of the appropriate type.ModifierConstructorDescriptionTrainTestSplitter
(DataSource<T> data) Creates a splitter that splits a dataset 70/30 train and test using a default seed.TrainTestSplitter
(DataSource<T> data, double trainProportion, long seed) Creates a splitter that will split the given data set into a training and testing set.TrainTestSplitter
(DataSource<T> data, long seed) Creates a splitter that splits a dataset 70/30 train and test. -
Uses of DataSource in org.tribuo.json
Modifier and TypeClassDescriptionclass
JsonDataSource<T extends Output<T>>
ADataSource
for loading data from a JSON text file and applyingFieldProcessor
s to it. -
Uses of DataSource in org.tribuo.multilabel.example
Modifier and TypeClassDescriptionfinal class
Generates a multi label output drawn from a series of functions. -
Uses of DataSource in org.tribuo.regression.example
Modifier and TypeClassDescriptionclass
Generates a single dimensional output drawn from N(slope*x + intercept,variance).class
Generates a single dimensional output drawn from N(w_0*x_0 + w_1*x_1 + w_2*x_1*x_0 + w_3*x_1*x_1*x_1 + intercept,variance).