Uses of Interface
org.tribuo.ConfigurableDataSource
Package
Description
Provides anomaly data generators used for demos and testing implementations.
Provides a multiclass data generator used for testing implementations, along with several synthetic data generators
for 2d binary classification problems to be used in demos or tutorials.
Provides clustering data generators used for demos and testing implementations.
Provides classes for loading in data from disk, processing it into examples, and splitting datasets for
things like cross-validation and train-test splits.
Provides classes for processing columnar data and generating
Example
s.Provides classes which can load columnar data (using a
RowProcessor
)
from a CSV (or other character delimited format) file.Provides classes which can load columnar data (using a
RowProcessor
)
from a SQL source.Provides implementations of text data processors.
Simple data sources for ingesting or aggregating data.
Provides interop with JSON formatted data, along with tools for interacting with JSON provenance objects.
Provides a multi-label data generator for testing implementations and a
configurable data source suitable for demos and tests.
Provides some example regression data generators for testing implementations.
-
Uses of ConfigurableDataSource in org.tribuo.anomaly.example
Modifier and TypeClassDescriptionfinal class
Generates an anomaly detection dataset sampling each feature uniformly from a univariate Gaussian. -
Uses of ConfigurableDataSource in org.tribuo.classification.example
Modifier and TypeClassDescriptionfinal class
Creates a data source using a 2d checkerboard of alternating classes.final class
A data source for two concentric circles, one per class.class
The base class for the 2d binary classification data sources inorg.tribuo.classification.example
.final class
A data source for two classes generated from separate Gaussians.final class
A data source of two interleaved half circles.final class
A data source of two interleaved half circles with some zero mean Gaussian noise applied to each point. -
Uses of ConfigurableDataSource in org.tribuo.clustering.example
Modifier and TypeClassDescriptionfinal class
Generates a clustering dataset drawn from a mixture of 5 Gaussians. -
Uses of ConfigurableDataSource in org.tribuo.data
Modifier and TypeFieldDescriptionConfigurableDataSource<? extends Output<?>>
PreprocessAndSerialize.PreprocessAndSerializeOptions.dataSource
Datasource to load from a config fileCompletelyConfigurableTrainTest.ConfigurableTrainTestOptions.testSource
Load the testing DataSource from the config file.CompletelyConfigurableTrainTest.ConfigurableTrainTestOptions.trainSource
Load the training DataSource from the config file. -
Uses of ConfigurableDataSource in org.tribuo.data.columnar
Modifier and TypeClassDescriptionclass
ColumnarDataSource<T extends Output<T>>
AConfigurableDataSource
base class which takes columnar data (e.g., csv or DB table rows) and generatesExample
s. -
Uses of ConfigurableDataSource in org.tribuo.data.csv
Modifier and TypeClassDescriptionclass
CSVDataSource<T extends Output<T>>
ADataSource
for loading separable data from a text file (e.g., CSV, TSV) and applyingFieldProcessor
s to it. -
Uses of ConfigurableDataSource in org.tribuo.data.sql
Modifier and TypeClassDescriptionclass
SQLDataSource<T extends Output<T>>
ADataSource
for loading columnar data from a database and applyingFieldProcessor
s to it. -
Uses of ConfigurableDataSource in org.tribuo.data.text
Modifier and TypeClassDescriptionclass
DirectoryFileSource<T extends Output<T>>
A data source for a somewhat-common format for text classification datasets: a top level directory that contains a number of subdirectories.class
TextDataSource<T extends Output<T>>
A base class for textual data sets. -
Uses of ConfigurableDataSource in org.tribuo.data.text.impl
Modifier and TypeClassDescriptionclass
SimpleStringDataSource<T extends Output<T>>
A version ofSimpleTextDataSource
that accepts aList
of Strings.class
SimpleTextDataSource<T extends Output<T>>
A dataset for a simple data format for text classification experiments. -
Uses of ConfigurableDataSource in org.tribuo.datasource
Modifier and TypeClassDescriptionclass
AggregateConfigurableDataSource<T extends Output<T>>
Aggregates multipleConfigurableDataSource
s, usesAggregateDataSource.IterationOrder
to control the iteration order.final class
IDXDataSource<T extends Output<T>>
A DataSource which can read IDX formatted data (i.e., MNIST).final class
LibSVMDataSource<T extends Output<T>>
A DataSource which can read LibSVM formatted data.ModifierConstructorDescriptionCreates an aggregate data source which will iterate the provided sources in the order of the list (i.e., usingAggregateDataSource.IterationOrder.SEQUENTIAL
.AggregateConfigurableDataSource
(List<ConfigurableDataSource<T>> sources, AggregateDataSource.IterationOrder order) Creates an aggregate data source using the supplied sources and iteration order. -
Uses of ConfigurableDataSource in org.tribuo.json
Modifier and TypeClassDescriptionclass
JsonDataSource<T extends Output<T>>
ADataSource
for loading data from a JSON text file and applyingFieldProcessor
s to it. -
Uses of ConfigurableDataSource in org.tribuo.multilabel.example
Modifier and TypeClassDescriptionfinal class
Generates a multi label output drawn from a series of functions. -
Uses of ConfigurableDataSource in org.tribuo.regression.example
Modifier and TypeClassDescriptionclass
Generates a single dimensional output drawn from N(slope*x + intercept,variance).class
Generates a single dimensional output drawn from N(w_0*x_0 + w_1*x_1 + w_2*x_1*x_0 + w_3*x_1*x_1*x_1 + intercept,variance).