Uses of Interface
org.tribuo.DataSource
Packages that use DataSource
Package
Description
Provides the core interfaces and classes for using Tribuo.
Provides classes for processing columnar data and generating
Example
s.Provides classes which can load columnar data (using a
RowProcessor
)
from a CSV (or other character delimited format) file.Provides classes which can load columnar data (using a
RowProcessor
)
from a SQL source.Provides implementations of text data processors.
Simple data sources for ingesting or aggregating data.
Evaluation base classes, along with code for train/test splits and cross validation.
Provides interop with JSON formatted data, along with tools for interacting with JSON provenance objects.
Provides some example regression data generators for testing implementations.
-
Uses of DataSource in org.tribuo
Subinterfaces of DataSource in org.tribuoModifier and TypeInterfaceDescriptioninterface
ConfigurableDataSource<T extends Output<T>>
It's aDataSource
that's alsoConfigurable
.Constructors in org.tribuo with parameters of type DataSourceModifierConstructorDescriptionprotected
Dataset
(DataSource<T> dataSource) Creates a dataset.ImmutableDataset
(DataSource<T> dataSource, FeatureMap featureIDMap, OutputInfo<T> outputIDInfo, boolean dropInvalidExamples) Creates a dataset from a data source.ImmutableDataset
(DataSource<T> dataSource, Model<T> model, boolean dropInvalidExamples) Creates a dataset from a data source.MutableDataset
(DataSource<T> dataSource) Creates a dataset from a data source. -
Uses of DataSource in org.tribuo.data.columnar
Classes in org.tribuo.data.columnar that implement DataSourceModifier and TypeClassDescriptionclass
ColumnarDataSource<T extends Output<T>>
AConfigurableDataSource
base class which takes columnar data (e.g., csv or DB table rows) and generatesExample
s. -
Uses of DataSource in org.tribuo.data.csv
Classes in org.tribuo.data.csv that implement DataSourceModifier and TypeClassDescriptionclass
CSVDataSource<T extends Output<T>>
ADataSource
for loading separable data from a text file (e.g., CSV, TSV) and applyingFieldProcessor
s to it. -
Uses of DataSource in org.tribuo.data.sql
Classes in org.tribuo.data.sql that implement DataSourceModifier and TypeClassDescriptionclass
SQLDataSource<T extends Output<T>>
ADataSource
for loading columnar data from a database and applyingFieldProcessor
s to it. -
Uses of DataSource in org.tribuo.data.text
Classes in org.tribuo.data.text that implement DataSourceModifier and TypeClassDescriptionclass
DirectoryFileSource<T extends Output<T>>
A data source for a somewhat-common format for text classification datasets: a top level directory that contains a number of subdirectories.class
TextDataSource<T extends Output<T>>
A base class for textual data sets. -
Uses of DataSource in org.tribuo.data.text.impl
Classes in org.tribuo.data.text.impl that implement DataSourceModifier and TypeClassDescriptionclass
SimpleStringDataSource<T extends Output<T>>
A version ofSimpleTextDataSource
that accepts anIterable
of Strings.class
SimpleTextDataSource<T extends Output<T>>
A dataset for a simple data format for text classification experiments. -
Uses of DataSource in org.tribuo.datasource
Classes in org.tribuo.datasource that implement DataSourceModifier and TypeClassDescriptionclass
AggregateDataSource<T extends Output<T>>
Aggregates multipleDataSource
s, and round-robins the iterators.final class
IDXDataSource<T extends Output<T>>
A DataSource which can read IDX formatted data (i.e., MNIST).final class
LibSVMDataSource<T extends Output<T>>
A DataSource which can read LibSVM formatted data.class
ListDataSource<T extends Output<T>>
A data source which wraps up a list ofExample
s along with theirDataSourceProvenance
and anOutputFactory
.Constructor parameters in org.tribuo.datasource with type arguments of type DataSource -
Uses of DataSource in org.tribuo.evaluation
Methods in org.tribuo.evaluation that return DataSourceModifier and TypeMethodDescriptionTrainTestSplitter.getTest()
Gets the testing datasource.TrainTestSplitter.getTrain()
Gets the training data source.Methods in org.tribuo.evaluation with parameters of type DataSourceModifier and TypeMethodDescriptionfinal E
AbstractEvaluator.evaluate
(Model<T> model, DataSource<T> datasource) Produces an evaluation for the supplied model and datasource, by callingModel.predict(org.tribuo.Example<T>)
to create the predictions, then aggregating the appropriate statistics.Evaluator.evaluate
(Model<T> model, DataSource<T> datasource) Evaluates the dataset using the supplied model, returning an immutableEvaluation
of the appropriate type.Constructors in org.tribuo.evaluation with parameters of type DataSourceModifierConstructorDescriptionTrainTestSplitter
(DataSource<T> data) Creates a splitter that splits a dataset 70/30 train and test using a default seed.TrainTestSplitter
(DataSource<T> data, double trainProportion, long seed) Creates a splitter that will split the given data set into a training and testing set.TrainTestSplitter
(DataSource<T> data, long seed) Creates a splitter that splits a dataset 70/30 train and test. -
Uses of DataSource in org.tribuo.json
Classes in org.tribuo.json that implement DataSourceModifier and TypeClassDescriptionclass
JsonDataSource<T extends Output<T>>
ADataSource
for loading data from a JSON text file and applyingFieldProcessor
s to it. -
Uses of DataSource in org.tribuo.regression.example
Classes in org.tribuo.regression.example that implement DataSourceModifier and TypeClassDescriptionclass
Generates a single dimensional output drawn from N(slope*x + intercept,variance).