Class JMI
java.lang.Object
org.tribuo.classification.fs.JMI
- All Implemented Interfaces:
com.oracle.labs.mlrg.olcut.config.Configurable,com.oracle.labs.mlrg.olcut.provenance.Provenancable<FeatureSelectorProvenance>,FeatureSelector<Label>
Selects features according to the Joint Mutual Information algorithm.
Uses equal width binning for the feature values.
See:
Yang H, Moody J. "Data Visualization and Feature Selection: New Algorithms for Non-Gaussian Data" Advances in Neural Information Processing Systems (NIPS), 1999.and
Brown G, Pocock A, Zhao M-J, Lujan M. "Conditional Likelihood Maximisation: A Unifying Framework for Information Theoretic Feature Selection" Journal of Machine Learning Research (JMLR), 2012, PDF.
-
Field Summary
Fields inherited from interface org.tribuo.FeatureSelector
SELECT_ALL -
Constructor Summary
ConstructorsConstructorDescriptionJMI(int k, int numBins, int numThreads) Constructs a JMI feature selector that ranks the topkfeatures. -
Method Summary
Modifier and TypeMethodDescriptionbooleanDoes this feature selection algorithm return an ordered feature set?voidUsed by the OLCUT configuration system, and should not be called by external code.Selects features according to this selection algorithm from the specified dataset.
-
Constructor Details
-
JMI
public JMI(int k, int numBins, int numThreads) Constructs a JMI feature selector that ranks the topkfeatures.Continuous features are binned into
numBinsequal width bins.- Parameters:
k- The number of features to rank.numBins- The number of bins, must be greater than 1.numThreads- The number of computation threads to use.
-
-
Method Details
-
postConfig
public void postConfig()Used by the OLCUT configuration system, and should not be called by external code.- Specified by:
postConfigin interfacecom.oracle.labs.mlrg.olcut.config.Configurable
-
isOrdered
public boolean isOrdered()Description copied from interface:FeatureSelectorDoes this feature selection algorithm return an ordered feature set?- Specified by:
isOrderedin interfaceFeatureSelector<Label>- Returns:
- True if the set is ordered.
-
select
Description copied from interface:FeatureSelectorSelects features according to this selection algorithm from the specified dataset.- Specified by:
selectin interfaceFeatureSelector<Label>- Parameters:
dataset- The dataset to use.- Returns:
- A selected feature set.
-
getProvenance
- Specified by:
getProvenancein interfacecom.oracle.labs.mlrg.olcut.provenance.Provenancable<FeatureSelectorProvenance>
-