Uses of Interface
org.tribuo.provenance.DataProvenance
Packages that use DataProvenance
Package
Description
Provides the core interfaces and classes for using Tribuo.
Provides anomaly data generators used for demos and testing implementations.
Provides a multiclass data generator used for testing implementations, along with several synthetic data generators
for 2d binary classification problems to be used in demos or tutorials.
Provides clustering data generators used for demos and testing implementations.
Provides classes which can load columnar data (using a
RowProcessor
)
from a CSV (or other character delimited format) file.Provides classes which can load columnar data (using a
RowProcessor
)
from a SQL source.Provides implementations of text data processors.
Provides utility datasets which subsample or otherwise
transform the wrapped dataset.
Simple data sources for ingesting or aggregating data.
Evaluation base classes, along with code for train/test splits and cross validation.
This package contains the abstract implementation of an external model
trained by something outside of Tribuo.
Provides interop with JSON formatted data, along with tools for interacting with JSON provenance objects.
Provides a multi-label data generator for testing implementations and a
configurable data source suitable for demos and tests.
Provides Tribuo specific infrastructure for the
Provenance
system which
tracks models and datasets.Provides internal implementations for empty provenance classes and TrainerProvenance.
Provides some example regression data generators for testing implementations.
Provides core classes for working with sequences of
Example
s.-
Uses of DataProvenance in org.tribuo
Fields in org.tribuo declared as DataProvenanceModifier and TypeFieldDescriptionprotected final DataProvenance
Dataset.sourceProvenance
The provenance of the data source, extracted on construction.Methods in org.tribuo that return DataProvenanceModifier and TypeMethodDescriptionDataset.getSourceProvenance()
The provenance of the data this Dataset contains.Constructors in org.tribuo with parameters of type DataProvenanceModifierConstructorDescriptionprotected
Dataset
(DataProvenance provenance, OutputFactory<T> outputFactory) Creates a dataset.ImmutableDataset
(Iterable<Example<T>> dataSource, DataProvenance description, OutputFactory<T> outputFactory, FeatureMap featureIDMap, OutputInfo<T> outputIDInfo, boolean dropInvalidExamples) Creates a dataset from a data source.ImmutableDataset
(Iterable<Example<T>> dataSource, DataProvenance description, OutputFactory<T> outputFactory, ImmutableFeatureMap featureIDMap, ImmutableOutputInfo<T> outputIDInfo, boolean dropInvalidExamples) Creates a dataset from a data source.protected
ImmutableDataset
(DataProvenance description, OutputFactory<T> outputFactory) If you call this it's your job to setup outputMap, featureIDMap and fill it with examples.protected
ImmutableDataset
(DataProvenance description, OutputFactory<T> outputFactory, ImmutableFeatureMap featureIDMap, ImmutableOutputInfo<T> outputIDInfo) This is dangerous, and should not be used unless you've overridden everything in ImmutableDataset.MutableDataset
(Iterable<Example<T>> dataSource, DataProvenance provenance, OutputFactory<T> outputFactory) Creates a dataset from a data source.MutableDataset
(DataProvenance sourceProvenance, OutputFactory<T> outputFactory) Creates an empty dataset. -
Uses of DataProvenance in org.tribuo.anomaly.example
Classes in org.tribuo.anomaly.example that implement DataProvenance -
Uses of DataProvenance in org.tribuo.classification.example
Classes in org.tribuo.classification.example that implement DataProvenance -
Uses of DataProvenance in org.tribuo.clustering.example
Classes in org.tribuo.clustering.example that implement DataProvenance -
Uses of DataProvenance in org.tribuo.data.csv
Classes in org.tribuo.data.csv that implement DataProvenanceModifier and TypeClassDescriptionstatic class
Provenance forCSVDataSource
.static final class
Deprecated. -
Uses of DataProvenance in org.tribuo.data.sql
Classes in org.tribuo.data.sql that implement DataProvenance -
Uses of DataProvenance in org.tribuo.data.text
Classes in org.tribuo.data.text that implement DataProvenance -
Uses of DataProvenance in org.tribuo.data.text.impl
Classes in org.tribuo.data.text.impl that implement DataProvenanceModifier and TypeClassDescriptionstatic class
Provenance forSimpleStringDataSource
.static class
Provenance forSimpleTextDataSource
. -
Uses of DataProvenance in org.tribuo.dataset
Classes in org.tribuo.dataset that implement DataProvenanceModifier and TypeClassDescriptionstatic final class
Provenance for theDatasetView
.static class
Provenance forMinimumCardinalityDataset
. -
Uses of DataProvenance in org.tribuo.datasource
Classes in org.tribuo.datasource that implement DataProvenanceModifier and TypeClassDescriptionstatic class
Provenance for theAggregateConfigurableDataSource
.static class
Provenance for theAggregateDataSource
.static final class
Provenance class forIDXDataSource
.static final class
The provenance for aLibSVMDataSource
. -
Uses of DataProvenance in org.tribuo.evaluation
Classes in org.tribuo.evaluation that implement DataProvenanceModifier and TypeClassDescriptionstatic class
Provenance for a split data source.Methods in org.tribuo.evaluation with parameters of type DataProvenanceModifier and TypeMethodDescriptiondefault OnlineEvaluator<T,
E> Evaluator.createOnlineEvaluator
(Model<T> model, DataProvenance provenance) Creates an online evaluator that maintains a list of all the predictions it has seen and can evaluate them upon request.final E
AbstractEvaluator.evaluate
(Model<T> model, List<Prediction<T>> predictions, DataProvenance dataProvenance) Produces an evaluation for the supplied model and predictions by aggregating the appropriate statistics.default E
Evaluator.evaluate
(Model<T> model, List<Prediction<T>> predictions, List<T> groundTruth, DataProvenance dataProvenance) Evaluates the model performance using the supplied predictions, returning an immutableEvaluation
of the appropriate type.Evaluator.evaluate
(Model<T> model, List<Prediction<T>> predictions, DataProvenance dataProvenance) Evaluates the model performance using the supplied predictions, returning an immutableEvaluation
of the appropriate type.Constructors in org.tribuo.evaluation with parameters of type DataProvenanceModifierConstructorDescriptionOnlineEvaluator
(Evaluator<T, E> evaluator, Model<T> model, DataProvenance provenance) Constructs anOnlineEvaluator
which accumulates predictions. -
Uses of DataProvenance in org.tribuo.interop
Classes in org.tribuo.interop that implement DataProvenanceModifier and TypeClassDescriptionclass
A dummy provenance used to describe the dataset of external models. -
Uses of DataProvenance in org.tribuo.json
Classes in org.tribuo.json that implement DataProvenance -
Uses of DataProvenance in org.tribuo.multilabel.example
Classes in org.tribuo.multilabel.example that implement DataProvenance -
Uses of DataProvenance in org.tribuo.provenance
Subinterfaces of DataProvenance in org.tribuo.provenanceModifier and TypeInterfaceDescriptioninterface
A tag interface for configurable data source provenance.interface
Data source provenance.Classes in org.tribuo.provenance that implement DataProvenanceModifier and TypeClassDescriptionclass
Base class for dataset provenance.class
This class stores a String describing the data source, along with a timestamp.Methods in org.tribuo.provenance that return DataProvenanceModifier and TypeMethodDescriptionDatasetProvenance.getSourceProvenance()
The input data provenance.EvaluationProvenance.getTestDatasetProvenance()
The test dataset provenance.Constructors in org.tribuo.provenance with parameters of type DataProvenanceModifierConstructorDescriptionprotected
DatasetProvenance
(DataProvenance sourceProvenance, com.oracle.labs.mlrg.olcut.provenance.ListProvenance<com.oracle.labs.mlrg.olcut.provenance.ObjectProvenance> transformationProvenance, String datasetClassName, boolean isDense, boolean isSequence, int numExamples, int numFeatures, int numOutputs) Constructs a dataset provenance using the supplied information.DatasetProvenance
(DataProvenance sourceProvenance, com.oracle.labs.mlrg.olcut.provenance.ListProvenance<com.oracle.labs.mlrg.olcut.provenance.ObjectProvenance> transformationProvenance, Dataset<T> dataset) Creates a dataset provenance from the supplied dataset.DatasetProvenance
(DataProvenance sourceProvenance, com.oracle.labs.mlrg.olcut.provenance.ListProvenance<com.oracle.labs.mlrg.olcut.provenance.ObjectProvenance> transformationProvenance, SequenceDataset<T> dataset) Creates a dataset provenance from the supplied sequence dataset.EvaluationProvenance
(ModelProvenance modelProvenance, DataProvenance datasetProvenance) Constructs an evaluation provenance from the supplied provenances. -
Uses of DataProvenance in org.tribuo.provenance.impl
Classes in org.tribuo.provenance.impl that implement DataProvenanceModifier and TypeClassDescriptionfinal class
An empty DatasetProvenance, should not be used except by the provenance removal system.final class
An empty DataSourceProvenance, should not be used except by the provenance removal system. -
Uses of DataProvenance in org.tribuo.regression.example
Classes in org.tribuo.regression.example that implement DataProvenanceModifier and TypeClassDescriptionstatic class
Provenance forGaussianDataSource
.static class
Provenance forNonlinearGaussianDataSource
. -
Uses of DataProvenance in org.tribuo.sequence
Classes in org.tribuo.sequence that implement DataProvenanceFields in org.tribuo.sequence declared as DataProvenanceModifier and TypeFieldDescriptionprotected final DataProvenance
SequenceDataset.sourceProvenance
The provenance of the data source, extracted on construction.Methods in org.tribuo.sequence that return DataProvenanceModifier and TypeMethodDescriptionSequenceDataset.getSourceProvenance()
Returns the source provenance.Methods in org.tribuo.sequence with parameters of type DataProvenanceModifier and TypeMethodDescriptionfinal E
AbstractSequenceEvaluator.evaluate
(SequenceModel<T> model, List<List<Prediction<T>>> predictions, DataProvenance dataProvenance) Produces an evaluation for the supplied model and predictions by aggregating the appropriate statistics.SequenceEvaluator.evaluate
(SequenceModel<T> model, List<List<Prediction<T>>> predictions, DataProvenance dataProvenance) Evaluates the supplied model and predictions by aggregating the appropriate statistics.Constructors in org.tribuo.sequence with parameters of type DataProvenanceModifierConstructorDescriptionImmutableSequenceDataset
(Iterable<SequenceExample<T>> dataSource, DataProvenance sourceProvenance, FeatureMap featureIDMap, OutputInfo<T> outputIDInfo, OutputFactory<T> outputFactory) Creates a dataset from a data source.ImmutableSequenceDataset
(Iterable<SequenceExample<T>> dataSource, DataProvenance sourceProvenance, ImmutableFeatureMap featureIDMap, ImmutableOutputInfo<T> outputIDInfo, OutputFactory<T> outputFactory) Creates a dataset from a data source.protected
ImmutableSequenceDataset
(DataProvenance sourceProvenance, ImmutableFeatureMap featureIDMap, ImmutableOutputInfo<T> outputIDInfo) This is dangerous, and should not be used unless you've overridden everything in ImmutableSequenceDataset.protected
ImmutableSequenceDataset
(DataProvenance sourceProvenance, OutputFactory<T> outputFactory) If you call this it's your job to setup outputIDInfo and featureIDMap.MutableSequenceDataset
(Iterable<SequenceExample<T>> dataSource, DataProvenance sourceProvenance, OutputFactory<T> outputFactory) Creates a dataset from a data source.MutableSequenceDataset
(DataProvenance sourceProvenance, OutputFactory<T> outputFactory) Creates an empty sequence dataset.protected
SequenceDataset
(DataProvenance sourceProvenance, OutputFactory<T> outputFactory)
CSVDataSource
.