public class BasicPipeline extends Object implements TextPipeline
TextPipeline
. Generates unique ngrams.Constructor and Description |
---|
BasicPipeline(Tokenizer tokenizer,
int ngram) |
Modifier and Type | Method and Description |
---|---|
com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance |
getProvenance() |
void |
postConfig()
Used by the OLCUT configuration system, and should not be called by external code.
|
List<Feature> |
process(String tag,
String data)
Extracts a list of features from the supplied text, using the tag to prepend the feature names.
|
String |
toString() |
public BasicPipeline(Tokenizer tokenizer, int ngram)
public void postConfig()
postConfig
in interface com.oracle.labs.mlrg.olcut.config.Configurable
public List<Feature> process(String tag, String data)
TextPipeline
process
in interface TextPipeline
tag
- The feature name tag.data
- The text to extract.public com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance getProvenance()
getProvenance
in interface com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
Copyright © 2015–2021 Oracle and/or its affiliates. All rights reserved.