public abstract class AnomalyDataGenerator extends Object
Also has a dataset generator which returns a training dataset with no anomalies sampled from a single gaussian, and a test dataset sampled from two gaussians where the second is labelled anomalous.
Constructor and Description |
---|
AnomalyDataGenerator() |
Modifier and Type | Method and Description |
---|---|
static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> |
denseTrainTest()
Makes a simple dataset for training and testing.
|
static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> |
denseTrainTest(double negate)
Generates a train/test dataset pair which is dense in the features,
each example has 4 features,{A,B,C,D}, and there are 4 clusters,
{0,1,2,3}.
|
static Example<Event> |
emptyExample()
Generates an example with no features.
|
static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> |
gaussianAnomaly()
Generates two datasets, one without anomalies drawn from a single gaussian
and the second drawn from a mixture of two gaussians, with the second tagged
anomalous.
|
static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> |
gaussianAnomaly(long size,
double fractionAnomalous)
Generates two datasets, one without anomalies drawn from a single gaussian
and the second drawn from a mixture of two gaussians, with the second tagged
anomalous.
|
static Example<Event> |
invalidSparseExample()
Generates an example with the feature ids 1,5,8, which does not intersect with the
ids used elsewhere in this class.
|
static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> |
sparseTrainTest()
Makes a simple dataset for training and testing.
|
static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> |
sparseTrainTest(double negate)
Generates a pair of datasets, where the features are sparse,
and unknown features appear in the test data.
|
public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> gaussianAnomaly()
public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> gaussianAnomaly(long size, double fractionAnomalous)
size
- The number of points to sample for each dataset.fractionAnomalous
- The fraction of anomalous data to generate.public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> denseTrainTest()
Used for smoke testing, doesn't have a real boundary.
public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> denseTrainTest(double negate)
negate
- Supply -1.0 to negate some feature values.public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> sparseTrainTest()
Used for smoke testing, doesn't have a real boundary.
public static com.oracle.labs.mlrg.olcut.util.Pair<Dataset<Event>,Dataset<Event>> sparseTrainTest(double negate)
negate
- Supply -1.0 to negate some feature values.public static Example<Event> invalidSparseExample()
Copyright © 2015–2021 Oracle and/or its affiliates. All rights reserved.