Package org.tribuo.data.text
Interface TextProcessor
- All Superinterfaces:
com.oracle.labs.mlrg.olcut.config.Configurable
,com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
- All Known Implementing Classes:
NgramProcessor
public interface TextProcessor
extends com.oracle.labs.mlrg.olcut.config.Configurable, com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
A TextProcessor takes some text and optionally a feature tag and generates a list of
Feature
s from that text.-
Method Summary
Methods inherited from interface com.oracle.labs.mlrg.olcut.config.Configurable
postConfig
Methods inherited from interface com.oracle.labs.mlrg.olcut.provenance.Provenancable
getProvenance
-
Method Details
-
process
Extracts features from the supplied text.- Parameters:
text
- The text to extract.- Returns:
- The extracted features.
- Throws:
TextProcessingException
- If an error occurred during extraction (usually from tokenization).
-
process
Extracts features from the supplied text.- Parameters:
tag
- The feature name tag.text
- The text to extract.- Returns:
- The extracted features.
- Throws:
TextProcessingException
- If an error occurred during extraction (usually from tokenization).
-