Package org.tribuo.data.text
Interface TextPipeline
- All Superinterfaces:
com.oracle.labs.mlrg.olcut.config.Configurable
,com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
- All Known Implementing Classes:
BasicPipeline
,BERTFeatureExtractor
,TokenPipeline
public interface TextPipeline
extends com.oracle.labs.mlrg.olcut.config.Configurable, com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
A pipeline that takes a String and returns a List of
Feature
s.
This list is not guaranteed to have unique elements.-
Method Summary
Methods inherited from interface com.oracle.labs.mlrg.olcut.config.Configurable
postConfig
Methods inherited from interface com.oracle.labs.mlrg.olcut.provenance.Provenancable
getProvenance
-
Method Details
-
process
Extracts a list of features from the supplied text, using the tag to prepend the feature names.- Parameters:
tag
- The feature name tag.data
- The text to extract.- Returns:
- The extracted features.
-