All Classes and Interfaces (Tribuo 4.3.2 API)

A FeatureMap used by the HashingTrainer to provide feature name hashing and guarantee that the Model does not contain feature name information, but still works with unhashed features names.

Hasher

An abstract base class for hash functions used to hash the names of features.

HashingOptions

An Options implementation which provides CLI arguments for the model hashing functionality.

HashingOptions.ModelHashingType

Supported types of hashes in CLI programs.

HashingSequenceTrainer<T>

A SequenceTrainer that hashes all the feature names on the way in.

HashingSequenceTrainer.HashingSequenceTrainerProvenance

Provenance for HashingSequenceTrainer.

HashingTrainer<T>

A Trainer which hashes the Dataset before the Model is produced.

HdbscanModel

A trained HDBSCAN* model which provides the cluster assignment labels and outlier scores for every data point.

HdbscanOptions

OLCUT Options for the HDBSCAN* implementation.

HdbscanTrainer

An HDBSCAN* trainer which generates a hierarchical, density-based clustering representation of the supplied data.

HdbscanTrainer.ClusterExemplar

A cluster exemplar, with attributes for the point's label, outlier score and its features.

HdbscanTrainer.Distance

Deprecated.

This Enum is deprecated in version 4.3, replaced by DistanceType

HeapMerger

Merges each SparseVector separately using a PriorityQueue as a heap.

Hinge

Hinge loss, scores the correct value margin and any incorrect predictions -margin.

Hinge

Hinge loss, scores the correct value margin and any incorrect predictions -margin.

HTMLOutput

Utilities for nice HTML output that can be put in wikis and such.

Huber

Huber loss, i.e., a mixture of l2 and l1 losses.

IdentityExtractor

Extracts the field value and emits it as a String.

IdentityProcessor

A FieldProcessor which converts the field name and value into a feature with a value of IdentityProcessor.FEATURE_VALUE.

IDFTransformation

A feature transformation that computes the IDF for features and then transforms them with a TF-IDF weighting.

IDFTransformation.IDFTransformationProvenance

Provenance for IDFTransformation.

IDXDataSource<T>

A DataSource which can read IDX formatted data (i.e., MNIST).

IDXDataSource.IDXData

Java side representation for an IDX file.

IDXDataSource.IDXDataSourceProvenance

Provenance class for IDXDataSource.

IDXDataSource.IDXType

The possible IDX input formats.

ImageConverter

Image converter.

ImageTransformer

Image transformer.

ImmutableAnomalyInfo

An ImmutableOutputInfo object for Events.

ImmutableClusteringInfo

An ImmutableOutputInfo object for ClusterIDs.

ImmutableDataset<T>

This is a Dataset which has an ImmutableFeatureMap to store the feature information.

ImmutableFeatureMap

ImmutableFeatureMap is used when unknown features should not be added to the FeatureMap.

ImmutableLabelInfo

An ImmutableOutputInfo object for Labels.

ImmutableMultiLabelInfo

An ImmutableOutputInfo for working with MultiLabel tasks.

ImmutableOutputInfo<T>

An OutputInfo that is fixed, and contains an id number for each valid output.

ImmutableRegressionInfo

A ImmutableOutputInfo for Regressors.

ImmutableSequenceDataset<T>

This is a SequenceDataset which has an ImmutableFeatureMap to store the feature information.

IncrementalTrainer<T,U>

An interface for incremental training of Models.

IndependentMultiLabelModel

A Model which wraps n binary models, where n is the size of the MultiLabel domain.

IndependentMultiLabelTrainer

Trains n independent binary Models, each of which predicts a single Label.

IndependentRegressionTreeModel

A Model wrapped around a list of decision tree root Nodes used to generate independent predictions for each dimension in a regression.

IndependentSequenceModel<T>

A SequenceModel which independently predicts each element of the sequence.

IndependentSequenceTrainer<T>

Trains a sequence model by training a regular model to independently predict every example in each sequence.

IndexedArrayExample<T>

A version of ArrayExample which also has the id numbers.

IndexedArrayExample.FeatureTuple

A tuple of the feature name, id and value.

IndexExtractor

An Extractor with special casing for loading the index from a Row.

InformationTheory

A class of (discrete) information theoretic functions.

InformationTheory.GTestStatistics

An immutable named tuple containing the statistics from a G test.

InformationTheoryDemo

Demo showing how to calculate various mutual informations and entropies.

InformationTheoryDemo.DemoOptions

Command line options.

InformationTheoryDemo.DistributionType

Type of data distribution.

IntArrayContainer

An array container which maintains the array and the size.

IntDoublePair

A Pair of a primitive int and a primitive double.

InterlockingCrescentsDataSource

A data source of two interleaved half circles.

IntExtractor

Extracts the field value and converts it to a int.

InvertedFeature

Internal datastructure for implementing a decision tree.

JMI

Selects features according to the Joint Mutual Information algorithm.

JointRegressorTrainingNode

A decision tree node used at training time.

JsonDataSource<T>

A DataSource for loading data from a JSON text file and applying FieldProcessors to it.

JsonDataSource.JsonDataSourceProvenance

Provenance for JsonDataSource.

JsonFileIterator

An iterator for JSON format files converting them into a format suitable for RowProcessor.

JsonUtil

Utilities for interacting with JSON objects or text representations.

KDTree

A k-d tree nearest neighbour query implementation.

KDTreeFactory

A factory which creates k-d tree nearest neighbour query objects.

Kernel

An interface for a Mercer kernel function.

KernelSVMModel

The inference time version of a kernel model trained using Pegasos.

KernelSVMOptions

Options for using the KernelSVMTrainer.

KernelSVMOptions.KernelEnum

The kernel types.

KernelSVMTrainer

A trainer for a kernelised model using the Pegasos optimiser.

KernelType

Kernel types from libsvm.

KFoldSplitter<T>

A k-fold splitter to be used in cross-validation.

KFoldSplitter.TrainTestFold<T>

Stores a train/test split for a dataset.

KMeansModel

A K-Means model with a selectable distance function.

KMeansOptions

OLCUT Options for the K-Means implementation.

KMeansTrainer

A K-Means trainer, which generates a K-means clustering of the supplied data.

KMeansTrainer.Distance

Deprecated.

This Enum is deprecated in version 4.3, replaced by DistanceType

KMeansTrainer.Initialisation

Possible initialization functions.

KNNClassifierOptions

CLI Options for training a k-nearest neighbour predictor.

KNNClassifierOptions.EnsembleCombinerType

The type of combination function.

KNNModel<T>

A k-nearest neighbours model.

KNNModel.Backend

The parallel backend for batch predictions.

KNNTrainer<T>

A Trainer for k-nearest neighbour models.

KNNTrainer.Distance

Deprecated.

This Enum is deprecated in version 4.3, replaced by DistanceType

L1Distance

L1 (or Manhattan) distance.

L2Distance

L2 (or Euclidean) distance.

Label

An immutable multi-class classification label.

LabelConfusionMatrix

A confusion matrix for Labels.

LabelConverter

Can convert a Label into a Tensor containing one hot encoding of the label and can convert a TFloat16 or TFloat32 into a Prediction or a Label.

LabelEvaluation

Adds multi-class classification specific metrics to ClassifierEvaluation.

LabelEvaluationUtil

Static utility functions for calculating performance metrics on Labels.

LabelEvaluationUtil.PRCurve

Stores the Precision-Recall curve as three arrays: the precisions, the recalls, and the thresholds associated with those values.

LabelEvaluationUtil.ROC

Stores the ROC curve as three arrays: the false positive rate, the true positive rate, and the thresholds associated with those rates.

LabelEvaluator

An Evaluator for Labels.

LabelFactory

A factory for making Label related classes.

LabelFactory.LabelFactoryProvenance

Provenance for LabelFactory.

LabelFeatureExtractor

A class for featurising labels from previous steps in Viterbi.

LabelImpurity

Calculates a tree impurity score based on label counts, weighted label counts or a probability distribution.

LabelInfo

The base class for information about multi-class classification Labels.

LabelledDataGenerator

Generates three example train and test datasets, used for unit testing.

LabelMetric

A EvaluationMetric for Labels which calculates the value based on a ConfusionMatrix.

LabelMetric.Context

The context for a LabelMetric is a ConfusionMatrix.

LabelMetrics

An enum of the default LabelMetrics supported by the multi-class classification evaluation package.

LabelObjective

An interface for single label prediction objectives.

LabelOneVOneTransformer

Can convert an OnnxValue into a Prediction or a Label.

LabelSequenceEvaluation

A class that can be used to evaluate a sequence label classification model element wise on a given set of data.

LabelSequenceEvaluator

A sequence evaluator for labels.

LabelTransformer

Can convert an OnnxValue into a Prediction or a Label.

LARSLassoTrainer

A trainer for a lasso linear regression model which uses LARS to construct the model.

LARSTrainer

A trainer for a linear regression model which uses least angle regression.

LeafNode<T>

An immutable leaf Node that can create a prediction.

LibLinearAnomalyModel

A Model which wraps a LibLinear-java anomaly detection model.

LibLinearAnomalyTrainer

A Trainer which wraps a liblinear-java anomaly detection trainer using a one-class SVM.

LibLinearClassificationModel

A Model which wraps a LibLinear-java classification model.

LibLinearClassificationTrainer

A Trainer which wraps a liblinear-java classifier trainer.

LibLinearModel<T>

A Model which wraps a LibLinear-java model.

LibLinearOptions

Command line options for working with a classification liblinear model.

LibLinearRegressionModel

A Model which wraps a LibLinear-java model.

LibLinearRegressionTrainer

A Trainer which wraps a liblinear-java regression trainer.

LibLinearTrainer<T>

A Trainer which wraps a liblinear-java trainer.

LibLinearType<T>

A carrier type for the liblinear algorithm type.

LibSVMAnomalyModel

An anomaly detection model that uses an underlying libSVM model to make the predictions.

LibSVMAnomalyTrainer

A trainer for anomaly models that uses LibSVM.

LibSVMClassificationModel

A classification model that uses an underlying LibSVM model to make the predictions.

LibSVMClassificationTrainer

A trainer for classification models that uses LibSVM.

LibSVMDataSource<T>

A DataSource which can read LibSVM formatted data.

LibSVMDataSource.LibSVMDataSourceProvenance

The provenance for a LibSVMDataSource.

LibSVMModel<T>

A model that uses an underlying libSVM model to make the predictions.

LibSVMOptions

CLI options for training a LibSVM classification model.

LibSVMRegressionModel

A regression model that uses an underlying libSVM model to make the predictions.

LibSVMRegressionTrainer

A trainer for regression models that uses LibSVM.

LibSVMTrainer<T>

A trainer that will train using libsvm's Java implementation.

LIMEBase

LIMEBase merges the lime_base.py and lime_tabular.py implementations, and deals with simple matrices of numerical or categorical data.

LIMEColumnar

Uses the columnar data processing infrastructure to mix text and tabular data.

LIMEExplanation

An Explanation using LIME.

LIMEText

Uses a Tribuo TextFeatureExtractor to explain the prediction for a given piece of text.

LIMETextCLI

A CLI for interacting with LIMEText.

LIMETextCLI.LIMETextCLIOptions

Command line options.

Linear

A linear kernel, u.dot(v).

LinearAnomalyType

The carrier type for liblinear anomaly detection modes.

LinearAnomalyType.LinearType

The different model types available for classification.

LinearClassificationType

The carrier type for liblinear classification modes.

LinearClassificationType.LinearType

The different model types available for classification.

LinearParameters

A Parameters for producing linear models.

LinearRegressionType

The carrier type for liblinear linear regression modes.

LinearRegressionType.LinearType

The type of linear regression algorithm.

LinearScalingTransformation

A Transformation which takes an observed distribution and rescales it so all values are between the desired min and max.

LinearScalingTransformation.LinearScalingTransformationProvenance

Provenance for LinearScalingTransformation.

LinearSGDModel

The inference time version of a linear model trained using SGD.

LinearSGDModel

The inference time version of a multi-label linear model trained using SGD.

LinearSGDModel

The inference time version of a linear model trained using SGD.

LinearSGDOptions

CLI options for training a linear classifier.

LinearSGDOptions

CLI options for training a linear classifier.

LinearSGDOptions.LossEnum

Available loss types.

LinearSGDOptions.LossEnum

Available loss types.

LinearSGDTrainer

A trainer for a linear classifier using SGD.

LinearSGDTrainer

A trainer for a multi-label linear model which uses SGD.

LinearSGDTrainer

A trainer for a linear regression model which uses SGD.

ListDataSource<T>

A data source which wraps up a list of Examples along with their DataSourceProvenance and an OutputFactory.

ListExample<T>

This class will not be performant until value types are available in Java.

LogisticRegressionTrainer

A logistic regression trainer that uses a reasonable objective, optimiser, number of epochs and minibatch size.

LogMulticlass

A multiclass version of the log loss.

Matrix

Interface for 2 dimensional Tensors.

Matrix.Factorization

Interface for matrix factorizations.

MatrixHeapMerger

Merges each DenseSparseMatrix using a PriorityQueue as a heap on the MatrixIterator.

MatrixIterator

A Comparable Iterator over MatrixTuples.

MatrixTuple

A mutable tuple used to avoid allocation when iterating a matrix.

MeanAbsoluteError

Measures the mean absolute error over a set of inputs.

MeanSquaredError

Measures the mean squared error over a set of inputs.

MeanStdDevTransformation

A Transformation which takes an observed distribution and rescales it so it has the desired mean and standard deviation.

MeanStdDevTransformation.MeanStdDevTransformationProvenance

Provenance for MeanStdDevTransformation.

MeanVarianceAccumulator

An accumulator for online calculation of the mean and variance of a stream of doubles.

Merger

An interface for merging an array of DenseSparseMatrix into a single DenseSparseMatrix.

Merger

An interface which can merge double values.

MessageDigestHasher

Hashes Strings using the supplied MessageDigest type.

MessageDigestHasher.MessageDigestHasherProvenance

Provenance for MessageDigestHasher.

MetricContext<T>

The context for a metric or set of metrics.

MetricID<T>

Just an easier-to-read alias for Pair<MetricTarget<T>, String>.

MetricTarget<T>

Used by a given EvaluationMetric to determine whether it should compute its value for a specific Output value or whether it should average them.

MIM

Selects features according to their mutual information with the class label (aka Mutual Information Maximisation).

MinimumCardinalityDataset<T>

This class creates a pruned dataset in which low frequency features that occur less than the provided minimum cardinality have been removed.

MinimumCardinalityDataset.MinimumCardinalityDatasetProvenance

Provenance for MinimumCardinalityDataset.

MinimumCardinalitySequenceDataset<T>

This class creates a pruned dataset in which low frequency features that occur less than the provided minimum cardinality have been removed.

MinimumCardinalitySequenceDataset.MinimumCardinalitySequenceDatasetProvenance

Provenance for MinimumCardinalitySequenceDataset.

MLPExamples

Static factory methods which produce Multi-Layer Perceptron architectures.

Model<T>

A prediction model, which is used to predict outputs for unseen instances.

ModelCard

ModelCard feature to allow more transparent model reporting.

ModelCardCLI

A command line interface for creating and appending UsageDetails to the serialized version of an existing ModelCard.

ModelCardCLI.ModelCardCLIOptions

CLI options for ModelCardCLI.

ModelDataCarrier<T>

Serialization carrier for common fields in Model and SequenceModel.

ModelDetails

ModelDetails section of a ModelCard.

ModelExplorer

A command line interface for loading in models and inspecting their feature and output spaces.

ModelExplorer.ModelExplorerOptions

CLI options for ModelExplorer.

ModelProvenance

Contains provenance information for an instance of a Model.

ModHashCodeHasher

Hashes names using String.hashCode(), then reduces the dimension.

ModHashCodeHasher.ModHashCodeHasherProvenance

Provenance for the ModHashCodeHasher.

mRMR

Selects features according to the Minimum Redundancy Maximum Relevance algorithm.

MultiLabel

A class for multi-label classification.

MultiLabelConfusionMatrix

A ConfusionMatrix which accepts MultiLabels.

MultiLabelConverter

Can convert a MultiLabel into a Tensor containing a binary encoding of the label vector and can convert a TFloat16 or TFloat32 into a Prediction or a MultiLabel.

MultiLabelDataGenerator

Generates three example train and test datasets, used for unit testing.

MultiLabelEvaluation

A MultiLabel specific ClassifierEvaluation.

MultiLabelEvaluationImpl

The implementation of a MultiLabelEvaluation using the default metrics.

MultiLabelEvaluator

An Evaluator for MultiLabel problems.

MultiLabelFactory

A factory for generating MultiLabel objects and their associated OutputInfo and Evaluator objects.

MultiLabelFactory.MultiLabelFactoryProvenance

Provenance for MultiLabelFactory.

MultiLabelGaussianDataSource

Generates a multi label output drawn from a series of functions.

MultiLabelGaussianDataSource.MultiLabelGaussianDataSourceProvenance

Provenance for MultiLabelGaussianDataSource.

MultiLabelInfo

The base class for information about MultiLabel outputs.

MultiLabelMetric

A EvaluationMetric for evaluating MultiLabel problems.

MultiLabelMetrics

An enum of the default MultiLabelMetrics supported by the multi-label classification evaluation package.

MultiLabelObjective

An interface for multi-label prediction objectives.

MultiLabelTransformer

Can convert an OnnxValue into a Prediction or a MultiLabel.

MultiLabelVotingCombiner

A combiner which performs a weighted or unweighted vote independently across the predicted labels in each multi-label.

MultinomialNaiveBayesModel

A Model for multinomial Naive Bayes with Laplace smoothing.

MultinomialNaiveBayesOptions

CLI options for a multinomial naive bayes model.

MultinomialNaiveBayesTrainer

A Trainer which trains a multinomial Naive Bayes model with Laplace smoothing.

MultivariateNormalDistribution

A class for sampling from multivariate normal distributions.

MurmurHash3

The MurmurHash3 algorithm was created by Austin Appleby and placed in the public domain.

MurmurHash3.LongPair

128 bits of state

MutableAnomalyInfo

An MutableOutputInfo object for Events.

MutableClusteringInfo

A mutable ClusteringInfo.

MutableDataset<T>

A MutableDataset is a Dataset with a MutableFeatureMap which grows over time.

MutableFeatureMap

A feature map that can record new feature value observations.

MutableLabelInfo

A mutable LabelInfo.

MutableMultiLabelInfo

A MutableOutputInfo for working with multi-label tasks.

MutableOutputInfo<T>

A mutable OutputInfo that can record observed output values.

MutableRegressionInfo

A MutableOutputInfo for Regressors.

MutableSequenceDataset<T>

A MutableSequenceDataset is a SequenceDataset with a MutableFeatureMap which grows over time.

NeighboursBruteForce

A brute-force nearest neighbour query implementation.

NeighboursBruteForceFactory

A factory which creates brute-force nearest neighbour query objects.

NeighboursQuery

An interface for nearest neighbour query objects.

NeighboursQueryFactory

An interface for factories which create nearest neighbour query objects.

NeighboursQueryFactoryType

These are the supported neighbour query implementations.

NewsPreprocessor

A document pre-processor for 20 newsgroup data.

NgramProcessor

A text processor that will generate token ngrams of a particular size.

Node<T>

A node in a decision tree.

NoisyInterlockingCrescentsDataSource

A data source of two interleaved half circles with some zero mean Gaussian noise applied to each point.

NonlinearGaussianDataSource

Generates a single dimensional output drawn from N(w_0*x_0 + w_1*x_1 + w_2*x_1*x_0 + w_3*x_1*x_1*x_1 + intercept,variance).

NonlinearGaussianDataSource.NonlinearGaussianDataSourceProvenance

Provenance for NonlinearGaussianDataSource.

NonTokenizer

A convenience class for when you are required to provide a tokenizer but you don't actually want to split up the text into tokens.

NoopFeatureExtractor

A label feature extractor that doesn't produce any label based features.

NoopNormalizer

NoopNormalizer returns a copy in NoopNormalizer.normalize(double[]) and is a no-op in place.

Normalizer

Normalizes, but first subtracts the minimum value (to ensure positivity).

OCILabelConverter

A converter for DenseMatrix and DenseVector into Label Predictions.

OCIModel<T>

A wrapper class around an OCI Data Science Model Deployment endpoint which sends off inputs for scoring and converts the output into a Tribuo prediction.

OCIModel.PredictionJson

Carrier type for easy deserialization from JSON.

OCIModelCLI

This class provides a CLI for deploying and scoring a Tribuo Classification model.

OCIModelCLI.OCIModelOptions

Options for the OCIModelCLI.

OCIModelCLI.OCIModelOptions.Mode

Mode for the CLI.

OCIMultiLabelConverter

A converter for DenseMatrix and DenseVector into MultiLabel Predictions.

OCIOutputConverter<T>

Converter for a DenseMatrix received from OCI Data Science Model Deployment.

OCIRegressorConverter

A converter for DenseMatrix and DenseVector into Regressor Predictions.

OCIUtil

Utils for uploading and deploying models to OCI Data Science.

OCIUtil.OCIDSConfig

Configuration for OCI DS.

OCIUtil.OCIModelArtifactConfig

Configuration for an OCI DS Model artifact.

OCIUtil.OCIModelDeploymentConfig

Configuration for an OCI DS Model Deployment.

OCIUtil.OCIModelType

Enum for OCI model types.

OffsetDateTimeExtractor

Extracts the field value and translates it to an OffsetDateTime based on the specified DateTimeFormatter.

OnlineEvaluator<T,E>

An evaluator which aggregates predictions and produces Evaluations covering all the Predictions it has seen or created.

ONNXAttribute

The spec for an attribute, used to produce the attribute proto at construction time.

ONNXContext

Context object used to scope and manage the creation of ONNX OnnxMl.GraphProto and OnnxMl.ModelProto instances.

ONNXExportable

An interface which denotes this Model can be exported as an ONNX model.

ONNXExternalModel<T>

A Tribuo wrapper around a ONNX model.

ONNXInitializer

A subclass of ONNXRef specialized for OnnxMl.TensorProto.

ONNXMathUtils

Tribuo Math specific helper functions for building ONNX protos.

ONNXNode

A subclass of ONNXRef specialized for OnnxMl.NodeProto.

ONNXOperator

An interface for ONNX operators.

ONNXOperators

ONNX Opset 13, and ONNX-ML version 1.

ONNXPlaceholder

A subclass of ONNXRef specialized for OnnxMl.ValueInfoProto.

ONNXRef<T>

An abstract reference that represents both a node in an ONNX computation graph and a container for a specific ONNX proto object that denotes that node.

ONNXUtils

Helper functions for building ONNX protos.

Output<T>

Output is the root interface for the supported prediction types.

OutputConverter<T>

Converts the Output into a Tensor and vice versa.

OutputFactory<T>

An interface associated with a specific Output, which can generate the appropriate Output subclass, and OutputInfo subclass.

OutputFactoryProvenance

A tag provenance for an output factory.

OutputInfo<T>

Tracks relevant properties of the appropriate Output subclass.

OutputTransformer<T>

Converts an OnnxValue into an Output or a Prediction.

PairDistribution<T1,T2>

A count distribution over CachedPair objects.

ParameterAveraging

Averages the parameters across a gradient run.

Parameters

An interface to a Tensor[] array which accepts updates to the parameters.

Pegasos

An implementation of the Pegasos gradient optimiser used primarily for solving the SVM problem.

Polynomial

A polynomial kernel, (gamma*u.dot(v) + intercept)^degree.

Prediction<T>

A prediction made by a Model.

PreprocessAndSerialize

Reads in a Datasource, processes all the data, and writes it out as a serialized dataset.

PreprocessAndSerialize.PreprocessAndSerializeOptions

Command line options.

ProtoSerializable<T>

Interface for serializing an implementing object to the specified protobuf.

ProtoSerializableClass

Mark a class as being ProtoSerializable and specify the class type used to serialize the "serialized_data".

ProtoSerializableField

Annotation which denotes that a field should be part of the protobuf serialized representation.

ProtoSerializableKeysValuesField

Annotation which denotes that the map field this is applied to is serialized as two repeated fields, one for keys and one for values.

ProtoSerializableMapField

Annotation which denotes that a map field should be part of the protobuf serialized representation.

ProtoSerializableMapValuesField

Annotation which denotes that the map field this is applied to is serialized as a list of values.

ProtoUtil

Utilities for working with Tribuo protobufs.

Quartile

A quartile to split data into 4 chunks.

QuartileResponseProcessor<T>

Processes the response into quartiles and emits them as classification outputs.

RandomForestTrainer<T>

A trainer which produces a random forest.

Range

A range currently being segmented.

RBF

A Radial Basis Function (RBF) kernel, exp(-gamma*|u-v|^2).

RealIDInfo

Same as a RealInfo, but with an additional int id field.

RealInfo

Stores information about real valued features.

RegexFieldProcessor

A FieldProcessor which applies a regex to a field and generates ColumnarFeatures based on the matches.

RegexFieldProcessor.Mode

Matching mode.

RegexPreprocessor

A simple document preprocessor which applies regular expressions to the input.

RegressionDataGenerator

Generates two example train and test datasets, used for unit testing.

RegressionEvaluation

Defines methods that calculate regression performance.

RegressionEvaluator

A Evaluator for multi-dimensional regression using Regressors.

RegressionFactory

A factory for creating Regressors and RegressionInfos.

RegressionFactory.RegressionFactoryProvenance

Provenance for RegressionFactory.

RegressionInfo

The base class for regression information using Regressors.

RegressionMetric

A EvaluationMetric for Regressors which calculates the metric based on a the true values and the predicted values.

RegressionMetrics

An enum of the default RegressionMetrics supported by the multi-dimensional regression evaluation package.

RegressionObjective

An interface for regression objectives.

RegressionSufficientStatistics

The sufficient statistics for regression metrics (i.e., each prediction and each true value).

Regressor

An Output for n-dimensional real valued regression.

Regressor.DimensionTuple

A Regressor which contains a single dimension, used internally when the model implementation doesn't natively support multi-dimensional regression outputs.

RegressorConverter

Can convert a Regressor to a TFloat32 vector and a TFloat32 into a Prediction or Regressor.

RegressorImpurity

Calculates a tree impurity score based on the regression targets.

RegressorImpurity.ImpurityTuple

Tuple class for the impurity and summed weight.

RegressorTrainingNode

A decision tree node used at training time.

RegressorTrainingNode.InvertedData

Tuple containing an inverted dataset (i.e., feature-wise not exmaple-wise).

RegressorTransformer

Can convert an OnnxValue into a Prediction or Regressor.

ReproUtil<T>

Reproducibility utility based on Tribuo's provenance objects.

ReproUtil.FeatureDiff

Record for any differences between feature sets.

ReproUtil.ModelReproduction<T>

Record for a model reproduction.

ReproUtil.OutputDiff<T>

Record for any differences between output domains.

Resources

Utils for working with classpath resources at test time.

ResponseProcessor<T>

An interface that will take the response field and produce an Output.

ResultSetIterator

An iterator over a ResultSet returned from JDBC.

RMSProp

An implementation of the RMSProp gradient optimiser.

Row<T>

A row of values from a RowList.

RowList<T>

An implementation of a List which wraps a set of lists.

RowProcessor<T>

A processor which takes a Map of String to String and returns an Example.

RowProcessor.Builder<T>

Builder for RowProcessor.

RunAll

Trains and tests a model using the supplied data, for each trainer inside a configuration file.

RunAll.RunAllOptions

Command line options.

SelectedFeatureDataset<T>

This class creates a pruned dataset which only contains the selected features.

SelectedFeatureDataset.SelectedFeatureDatasetProvenance

Provenance for SelectedFeatureDataset.

SelectedFeatureSet

A record-like class for a selected feature set.

SeqTest

Build and run a sequence classifier on a generated dataset.

SeqTest.CRFOptions

Command line options.

SeqTrainTest

Build and run a sequence classifier on a generated or serialized dataset using the trainer specified in the configuration file.

SeqTrainTest.SeqTrainTestOptions

Command line options.

SequenceDataGenerator

A data generator for smoke testing sequence label models.

SequenceDataset<T>

A class for sets of data, which are used to train and evaluate classifiers.

SequenceDataSource<T>

A interface for things that can be given to a SequenceDataset's constructor.

SequenceEvaluation<T>

An immutable evaluation of a specific sequence model and dataset.

SequenceEvaluator<T,E>

An evaluation factory which produces immutable SequenceEvaluations of a given SequenceDataset using the given SequenceModel.

SequenceExample<T>

A sequence of examples, used for sequence classification.

SequenceFeatureConverter

Converts a sequence example into a feed dict suitable for TensorFlow.

SequenceModel<T>

A prediction model, which is used to predict outputs for unseen instances.

SequenceModelExplorer

A CLI for interacting with a SequenceModel.

SequenceModelExplorer.SequenceModelExplorerOptions

Command line options.

SequenceOutputConverter<T>

Converts a TensorFlow output tensor into a list of predictions, and a Tribuo sequence example into a Tensorflow tensor suitable for training.

SequenceTrainer<T>

An interface for things that can train sequence prediction models.

SGD

An implementation of single learning rate SGD and optionally momentum.

SGD.Momentum

Momentum types.

SGDObjective<T>

An interface for a loss function that can produce the loss and gradient incurred by a single prediction.

SGDVector

Interface for 1 dimensional Tensors.

ShapeTokenizer

This tokenizer is loosely based on the notion of word shape which is a common feature used in NLP.

ShrinkingMatrix

A subclass of DenseMatrix which shrinks the value every time a new value is added.

ShrinkingTensor

An interface which tags a Tensor with a convertToDense method.

ShrinkingVector

A subclass of DenseVector which shrinks the value every time a new value is added.

Sigmoid

A sigmoid kernel, tanh(gamma*u.dot(v) + intercept).

SigmoidNormalizer

Normalizes the input by applying a logistic sigmoid to each element.

SimpleDataSourceProvenance

This class stores a String describing the data source, along with a timestamp.

SimpleFieldExtractor<T>

Extracts a value from a single field to be placed in an Example's metadata field.

SimpleStringDataSource<T>

A version of SimpleTextDataSource that accepts a List of Strings.

SimpleStringDataSource.SimpleStringDataSourceProvenance

Provenance for SimpleStringDataSource.

SimpleTextDataSource<T>

A dataset for a simple data format for text classification experiments.

SimpleTextDataSource.SimpleTextDataSourceProvenance

Provenance for SimpleTextDataSource.

SimpleTransform

This is used for stateless functions such as exp, log, addition or multiplication by a constant.

SimpleTransform.Operation

Operations understood by this Transformation.

SimpleTransform.SimpleTransformProvenance

Provenance for SimpleTransform.

SkeletalIndependentRegressionModel

A Model which wraps n independent regression models, where n is the size of the MultipleRegressor domain.

SkeletalIndependentRegressionSparseModel

A SparseModel which wraps n independent regression models, where n is the size of the MultipleRegressor domain.

SkeletalIndependentRegressionSparseTrainer<T>

Base class for training n independent sparse models, one per dimension.

SkeletalIndependentRegressionTrainer<T>

Trains n independent binary Models, each of which predicts a single Regressor.

SkeletalTrainerProvenance

The skeleton of a TrainerProvenance that extracts the configured parameters.

SkeletalVariableInfo

Contains information about a feature and can be stored in the feature map in a Dataset.

SLMTrainer

A trainer for a sparse linear regression model.

SparseLinearModel

The inference time version of a sparse linear regression model.

SparseModel<T>

A model which uses a subset of the features it knows about to make predictions.

SparseTrainer<T>

Denotes this trainer emits a SparseModel.

SparseVector

A sparse vector.

SplitCharactersTokenizer

This implementation of Tokenizer is instantiated with an array of characters that are considered split characters.

SplitCharactersTokenizer.SplitCharactersSplitterFunction

Splits tokens at the supplied characters.

SplitCharactersTokenizerOptions

CLI options for a SplitCharactersTokenizer.

SplitFunctionTokenizer

This class supports character-by-character (that is, codepoint-by-codepoint) iteration over input text to create tokens.

SplitFunctionTokenizer.SplitFunction

An interface for checking if the text should be split at the supplied codepoint.

SplitFunctionTokenizer.SplitResult

A combination of a SplitFunctionTokenizer.SplitType and a Token.TokenType.

SplitFunctionTokenizer.SplitType

Defines different ways that a tokenizer can split the input text at a given character.

SplitNode<T>

An immutable Node with a split and two child nodes.

SplitPatternTokenizer

This implementation of Tokenizer is instantiated with a regular expression pattern which determines how to split a string into tokens.

SplitPatternTokenizerOptions

CLI options for a SplitPatternTokenizer.

SplitTextData

Splits data in our standard text format into training and testing portions.

SplitTextData.TrainTestSplitOptions

Command line options.

SQLDataSource<T>

A DataSource for loading columnar data from a database and applying FieldProcessors to it.

SQLDataSource.SQLDataSourceProvenance

Provenance for SQLDataSource.

SQLDBConfig

N.B.

SQLToCSV

Read an SQL query in on the standard input, write a CSV file containing the results to the standard output.

SQLToCSV.SQLToCSVOptions

Command line options.

SquaredLoss

Squared loss, i.e., l2.

StochasticGradientOptimiser

Interface for gradient based optimisation methods.

StripProvenance

A main class for stripping out and storing provenance from a model.

StripProvenance.ProvenanceTypes

Types of provenance that can be removed.

StripProvenance.StripProvenanceOptions

Command line options.

SumAggregator

A feature aggregator that aggregates occurrence counts across a number of feature lists.

SVMAnomalyType

The carrier type for LibSVM anomaly detection modes.

SVMAnomalyType.SVMMode

Valid SVM modes for anomaly detection.

SVMClassificationType

The carrier type for LibSVM classification modes.

SVMClassificationType.SVMMode

The classification model types.

SVMParameters<T>

A container for SVM parameters and the kernel.

SVMRegressionType

The carrier type for LibSVM regression modes.

SVMRegressionType.SVMMode

Type of regression SVM.

SVMType<T>

A carrier type for the SVM type.

TabularExplainer<T>

An explainer for tabular data.

Tensor

An interface for Tensors, currently Vectors and Matrices.

TensorFlowCheckpointModel<T>

This model encapsulates a simple model with an input feed dict, and produces a single output tensor.

TensorFlowFrozenExternalModel<T>

A Tribuo wrapper around a TensorFlow frozen model.

TensorFlowModel<T>

Base class for a TensorFlow model that operates on Examples.

TensorFlowNativeModel<T>

This model encapsulates a TensorFlow model running in graph mode with a single tensor output.

TensorFlowSavedModelExternalModel<T>

A Tribuo wrapper around a TensorFlow saved model bundle.

TensorFlowSequenceModel<T>

A TensorFlow model which implements SequenceModel, suitable for use in sequential prediction tasks.

TensorFlowSequenceTrainer<T>

A trainer for SequenceModels which use an underlying TensorFlow graph.

TensorFlowSequenceTrainer.TensorFlowSequenceTrainerProvenance

Provenance for TensorFlowSequenceTrainer.

TensorFlowTrainer<T>

Trainer for TensorFlow.

TensorFlowTrainer.TensorFlowTrainerProvenance

Provenance for TensorFlowTrainer.

TensorFlowTrainer.TFModelFormat

The model format to emit.

TensorFlowUtil

Helper functions for working with TensorFlow.

TensorFlowUtil.TensorTuple

A serializable tuple containing the tensor class name, the shape and the data.

TensorMap

A map of names and tensors to feed into a session.

Test

Test a classifier for a standard dataset.

Test.ConfigurableTestOptions

Command line options.

TestingDetails

TestingDetails section of a ModelCard.

TextDataSource<T>

A base class for textual data sets.

TextExplainer<T>

An explainer for text data.

TextFeatureExtractor<T>

An interface for things that take text and turn them into examples that we can use to train or evaluate a classifier.

TextFeatureExtractorImpl<T>

An implementation of TextFeatureExtractor that takes a TextPipeline and generates ArrayExample.

TextFieldProcessor

A FieldProcessor which takes a text field and runs a TextPipeline on it to generate features.

TextPipeline

A pipeline that takes a String and returns a List of Features.

TextProcessingException

An exception thrown by the text processing system.

TextProcessor

A TextProcessor takes some text and optionally a feature tag and generates a list of Features from that text.

TimestampedTrainerProvenance

A TrainerProvenance with a timestamp, used when there was no trainer involved in model construction (e.g., creating an EnsembleModel from existing models).

Token

A single token extracted from a String.

Token.TokenType

Tokenizers may product multiple kinds of tokens, depending on the application to which they're being put.

TokenizationException

Wraps exceptions thrown by tokenizers.

Tokenizer

An interface for things that tokenize text: breaking it into words according to some set of rules.

TokenizerOptions

CLI Options for creating a tokenizer.

TokenPipeline

A pipeline for generating ngram features.

Trainer<T>

An interface for things that can train predictive models.

TrainerProvenance

A tag interface for trainer provenances.

TrainerProvenanceImpl

An implementation of TrainerProvenance that delegates everything to SkeletalTrainerProvenance.

TrainingDetails

TrainingDetails section of a ModelCard.

TrainTest

Build and run a decision tree classifier for a standard dataset.

TrainTest

Build and run a classifier for a standard dataset.

TrainTest

Build and run a liblinear-java classifier for a standard dataset.

TrainTest

Build and run a LibSVM classifier for a standard dataset.

TrainTest

Build and run a multinomial naive bayes classifier for a standard dataset.

TrainTest

Build and run a classifier for a standard dataset using FMClassificationTrainer.

TrainTest

Build and run a kernel SVM classifier for a standard dataset.

TrainTest

Build and run a classifier for a standard dataset using LinearSGDTrainer.

TrainTest

Build and run an XGBoost classifier for a standard dataset.

TrainTest

Build and run a HDBSCAN* clustering model for a standard dataset.

TrainTest

Build and run a k-means clustering model for a standard dataset.

TrainTest

Build and run a Tensorflow multi-class classifier for a standard dataset.

TrainTest

Build and run a LibLinear regressor for a standard dataset.

TrainTest

Build and run a LibSVM regressor for a standard dataset.

TrainTest

Build and run a regression tree for a standard dataset.

TrainTest

Build and run a regression factorization machine for a standard dataset.

TrainTest

Build and run a linear regression for a standard dataset.

TrainTest

Build and run a sparse linear regression model for a standard dataset.

TrainTest

Build and run an XGBoost regressor for a standard dataset.

TrainTest.AllClassificationOptions

Command line options.

TrainTest.FMRegressionOptions

Command line options.

TrainTest.HdbscanCLIOptions

Options for the HDBSCAN* CLI.

TrainTest.ImpurityType

Impurity function.

TrainTest.InputType

Type of feature extractor.

TrainTest.KMeansOptions

Options for the K-Means CLI.

TrainTest.LibLinearOptions

Command line options.

TrainTest.LibSVMOptions

Command line options.

TrainTest.LossEnum

Loss function.

TrainTest.LossEnum

Loss function.

TrainTest.RegressionTreeOptions

Command line options.

TrainTest.SGDOptions

Command line options.

TrainTest.SLMOptions

Command line options.

TrainTest.SLMType

Type of sparse linear model.

TrainTest.TensorflowOptions

Options for training a model in TensorFlow.

TrainTest.TrainTestOptions

Command line options.

TrainTest.TrainTestOptions

Command line options.

TrainTest.TrainTestOptions

Command line options.

TrainTest.TrainTestOptions

Command line options.

TrainTest.TrainTestOptions

Command line options.

TrainTest.TrainTestOptions

Command line options.

TrainTest.TrainTestOptions

Command line options.

TrainTest.TrainTestOptions

Command line options.

TrainTest.TreeType

Type of tree trainer.

TrainTest.XGBoostOptions

Command line options.

TrainTestHelper

This class provides static methods used by the demo classes in each classification backend.

TrainTestSplitter<T>

Splits data into training and testing sets.

TrainTestSplitter.SplitDataSourceProvenance

Provenance for a split data source.

Transformation

An interface representing a class of transformations which can be applied to a feature.

TransformationMap

A carrier type for a set of transformations to be applied to a Dataset.

TransformationMap.TransformationList

A carrier type as OLCUT does not support nested generics.

TransformationProvenance

A tag interface for provenances in the transformation system.

TransformedModel<T>

Wraps a Model with it's TransformerMap so all Examples are transformed appropriately before the model makes predictions.

Transformer

A fitted Transformation which can apply a transform to the input value.

TransformerMap

A collection of Transformers which can be applied to a Dataset or Example.

TransformerMap.TransformerMapProvenance

Provenance for TransformerMap.

TransformStatistics

An interface for the statistics that need to be collected for a specific Transformation on a single feature.

TransformTrainer<T>

A Trainer which encapsulates another trainer plus a TransformationMap object to apply to each Dataset before training each Model.

TreeFeature

An inverted feature, which stores a reference to all the values of this feature.

TreeModel<T>

A Model wrapped around a decision tree root Node.

Tribuo

This class stores the current Tribuo version, along with other compile time information.

TripleDistribution<T1,T2,T3>

Generates the counts for a triplet of vectors.

UniqueAggregator

Aggregates feature tokens, generating unique features.

UniqueProcessor

Processes a feature list, aggregating all the feature values with the same name.

UniqueProcessor.UniqueType

The type of reduction operation to perform.

UniversalTokenizer

This class was originally written for the purpose of document indexing in an information retrieval context (principally used in Sun Labs' Minion search engine).

UsageDetails

UsageDetails section of a ModelCard.

UsageDetailsBuilder

A builder class for creating an instance of UsageDetails.

SGD utilities.

Utilities.

Ye olde util class.

A nominal tuple.

Util.SequenceExampleArray

A nominal tuple.

VariableIDInfo

Adds an id number to a VariableInfo.

VariableInfo

A VariableInfo subclass contains information about a feature and its observed values.

VectorIterator

A Comparable Iterator over VectorTuples.

VectorNormalizer

A functional interface that generates a normalized version of a double array.

VectorTuple

A mutable tuple used to avoid allocation when iterating a vector.

ViterbiModel

An implementation of a viterbi model.

ViterbiModel.ScoreAggregation

Types of label score aggregation.

ViterbiTrainer

Builds a Viterbi model using the supplied Trainer.

ViterbiTrainerOptions

Options for building a viterbi trainer.

ViterbiTrainerOptions.ViterbiLabelFeatures

Type of label features to include.

VotingCombiner

A combiner which performs a weighted or unweighted vote across the predicted labels.

WeightCountTuple

A mutable tuple of a double and a long.

WeightedEnsembleModel<T>

An ensemble model that uses weights to combine the ensemble member predictions.

WeightedExamples

Tag interface denoting that a Trainer can use example weights.

WeightedInformationTheory

A class of (discrete) weighted information theoretic functions.

WeightedInformationTheory.VariableSelector

Chooses which variable is the one with associated weights.

WeightedLabels

Tag interface denoting the Trainer can use label weights.

WeightedPairDistribution<T1,T2>

Generates the counts for a pair of vectors.

WeightedTripleDistribution<T1,T2,T3>

Generates the counts for a triplet of vectors.

WhitespaceTokenizer

A simple tokenizer that splits on whitespace.

Wordpiece

This is vanilla implementation of the Wordpiece algorithm as found here: https://github.com/huggingface/transformers/blob/master/src/transformers/models/bert/tokenization_bert.py

WordpieceBasicTokenizer

This is a tokenizer that is used "upstream" of WordpieceTokenizer and implements much of the functionality of the 'BasicTokenizer' implementation in huggingface.

WordpieceTokenizer

This Tokenizer is meant to be a reasonable approximation of the BertTokenizer defined here.

XGBoostClassificationConverter

Converts XGBoost outputs into Label Predictions.

XGBoostClassificationTrainer

A Trainer which wraps the XGBoost training procedure.

XGBoostExternalModel<T>

A Model which wraps around a XGBoost.Booster which was trained by a system other than Tribuo.

XGBoostFeatureImportance

Generate and collate feature importance information from the XGBoost model.

XGBoostFeatureImportance.XGBoostFeatureImportanceInstance

An instance of feature importance values for a single feature.