Package org.tribuo.classification.fs
Class JMI
java.lang.Object
org.tribuo.classification.fs.JMI
- All Implemented Interfaces:
com.oracle.labs.mlrg.olcut.config.Configurable
,com.oracle.labs.mlrg.olcut.provenance.Provenancable<FeatureSelectorProvenance>
,FeatureSelector<Label>
Selects features according to the Joint Mutual Information algorithm.
Uses equal width binning for the feature values.
See:
Yang H, Moody J. "Data Visualization and Feature Selection: New Algorithms for Non-Gaussian Data" Advances in Neural Information Processing Systems (NIPS), 1999.and
Brown G, Pocock A, Zhao M-J, Lujan M. "Conditional Likelihood Maximisation: A Unifying Framework for Information Theoretic Feature Selection" Journal of Machine Learning Research (JMLR), 2012, PDF.
-
Field Summary
Fields inherited from interface org.tribuo.FeatureSelector
SELECT_ALL
-
Constructor Summary
ConstructorDescriptionJMI
(int k, int numBins, int numThreads) Constructs a JMI feature selector that ranks the topk
features. -
Method Summary
Modifier and TypeMethodDescriptionboolean
Does this feature selection algorithm return an ordered feature set?void
Used by the OLCUT configuration system, and should not be called by external code.Selects features according to this selection algorithm from the specified dataset.
-
Constructor Details
-
JMI
public JMI(int k, int numBins, int numThreads) Constructs a JMI feature selector that ranks the topk
features.Continuous features are binned into
numBins
equal width bins.- Parameters:
numBins
- The number of bins, must be greater than 1.k
- The number of features to rank.numThreads
- The number of computation threads to use.
-
-
Method Details
-
postConfig
public void postConfig()Used by the OLCUT configuration system, and should not be called by external code.- Specified by:
postConfig
in interfacecom.oracle.labs.mlrg.olcut.config.Configurable
-
isOrdered
public boolean isOrdered()Description copied from interface:FeatureSelector
Does this feature selection algorithm return an ordered feature set?- Specified by:
isOrdered
in interfaceFeatureSelector<Label>
- Returns:
- True if the set is ordered.
-
select
Description copied from interface:FeatureSelector
Selects features according to this selection algorithm from the specified dataset.- Specified by:
select
in interfaceFeatureSelector<Label>
- Parameters:
dataset
- The dataset to use.- Returns:
- A selected feature set.
-
getProvenance
- Specified by:
getProvenance
in interfacecom.oracle.labs.mlrg.olcut.provenance.Provenancable<FeatureSelectorProvenance>
-