Class RegressionDataGenerator
java.lang.Object
org.tribuo.regression.example.RegressionDataGenerator
Generates two example train and test datasets, used for unit testing.
They don't necessarily have linear regressed values,
it's for testing the machinery rather than accuracy.
Also can generate a variety of single dimensional gaussian datasets.
-
Field Summary
Fields -
Method Summary
Modifier and TypeMethodDescriptiondenseTrainTest
(double negate) Generates a train/test dataset pair which is dense in the features, each example has 4 features,{A,B,C,D}.Generates an example with no features.Generates an example with no features.Generates an example with the feature ids 1,5,8, which does not intersect with the ids used elsewhere in this class.Generates an example with the feature ids 1,5,8, which does not intersect with the ids used elsewhere in this class.multiDimDenseTrainTest
(double negate) Generates a train/test dataset pair which is dense in the features, each example has 4 features,{A,B,C,D}.Generates a pair of datasets, where the features are sparse, and unknown features appear in the test data.multiDimSparseTrainTest
(double negate) Generates a pair of datasets, where the features are sparse, and unknown features appear in the test data.sparseTrainTest
(double negate) Generates a pair of datasets, where the features are sparse, and unknown features appear in the test data.threeDimDenseTrainTest
(double negate, boolean remapIndices) Generates a train/test dataset pair which is dense in the features, each example has 4 features,{A,B,C,D}.
-
Field Details
-
firstDimensionName
-
secondDimensionName
-
thirdDimensionName
-
SINGLE_DIM_NAME
- See Also:
-
-
Method Details
-
multiDimDenseTrainTest
-
multiDimDenseTrainTest
public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Regressor>, Dataset<Regressor>> multiDimDenseTrainTest(double negate) Generates a train/test dataset pair which is dense in the features, each example has 4 features,{A,B,C,D}.- Parameters:
negate
- Supply -1.0 to negate some features.- Returns:
- A pair of datasets.
-
threeDimDenseTrainTest
public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Regressor>, Dataset<Regressor>> threeDimDenseTrainTest(double negate, boolean remapIndices) Generates a train/test dataset pair which is dense in the features, each example has 4 features,{A,B,C,D}.- Parameters:
negate
- Supply -1.0 to negate some features.remapIndices
- If true invert the indices of the output features. Warning: this should only be used as part of unit testing, it is not expected from standard datasets.- Returns:
- A pair of datasets.
-
multiDimSparseTrainTest
-
multiDimSparseTrainTest
public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Regressor>, Dataset<Regressor>> multiDimSparseTrainTest(double negate) Generates a pair of datasets, where the features are sparse, and unknown features appear in the test data.- Parameters:
negate
- Supply -1.0 to negate some features.- Returns:
- A pair of datasets.
-
invalidMultiDimSparseExample
-
emptyMultiDimExample
-
denseTrainTest
-
denseTrainTest
public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Regressor>, Dataset<Regressor>> denseTrainTest(double negate) Generates a train/test dataset pair which is dense in the features, each example has 4 features,{A,B,C,D}.- Parameters:
negate
- Supply -1.0 to negate some values in this dataset.- Returns:
- A pair of datasets.
-
sparseTrainTest
-
sparseTrainTest
public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Regressor>, Dataset<Regressor>> sparseTrainTest(double negate) Generates a pair of datasets, where the features are sparse, and unknown features appear in the test data.- Parameters:
negate
- Supply -1.0 to negate some values in this dataset.- Returns:
- A pair of datasets.
-
invalidSparseExample
-
emptyExample
-