Interface TextProcessor
- All Superinterfaces:
com.oracle.labs.mlrg.olcut.config.Configurable,com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
- All Known Implementing Classes:
NgramProcessor
public interface TextProcessor
extends com.oracle.labs.mlrg.olcut.config.Configurable, com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
A TextProcessor takes some text and optionally a feature tag and generates a list of
Features from that text.-
Method Summary
Methods inherited from interface com.oracle.labs.mlrg.olcut.config.Configurable
postConfigMethods inherited from interface com.oracle.labs.mlrg.olcut.provenance.Provenancable
getProvenance
-
Method Details
-
process
Extracts features from the supplied text.- Parameters:
text- The text to extract.- Returns:
- The extracted features.
- Throws:
TextProcessingException- If an error occurred during extraction (usually from tokenization).
-
process
Extracts features from the supplied text.- Parameters:
tag- The feature name tag.text- The text to extract.- Returns:
- The extracted features.
- Throws:
TextProcessingException- If an error occurred during extraction (usually from tokenization).
-