Class RegexFieldProcessor
java.lang.Object
org.tribuo.data.columnar.processors.field.RegexFieldProcessor
- All Implemented Interfaces:
com.oracle.labs.mlrg.olcut.config.Configurable
,com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
,FieldProcessor
A
FieldProcessor
which applies a regex to a field and generates ColumnarFeature
s based on the matches.-
Nested Class Summary
Nested classes/interfaces inherited from interface org.tribuo.data.columnar.FieldProcessor
FieldProcessor.GeneratedFeatureType
-
Field Summary
Fields inherited from interface org.tribuo.data.columnar.FieldProcessor
NAMESPACE
-
Constructor Summary
ConstructorDescriptionRegexFieldProcessor
(String fieldName, String regex, EnumSet<RegexFieldProcessor.Mode> modes) Constructs a field processor which emits features when the field value matches the supplied regex.RegexFieldProcessor
(String fieldName, Pattern regex, EnumSet<RegexFieldProcessor.Mode> modes) Constructs a field processor which emits features when the field value matches the supplied regex. -
Method Summary
Modifier and TypeMethodDescriptionReturns a copy of this FieldProcessor bound to the supplied newFieldName.Returns the feature type this FieldProcessor generates.Gets the field name this FieldProcessor uses.com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance
void
Used by the OLCUT configuration system, and should not be called by external code.Processes the field value and generates a (possibly empty) list ofColumnarFeature
s.toString()
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait
Methods inherited from interface org.tribuo.data.columnar.FieldProcessor
getNumNamespaces
-
Constructor Details
-
RegexFieldProcessor
public RegexFieldProcessor(String fieldName, Pattern regex, EnumSet<RegexFieldProcessor.Mode> modes) Constructs a field processor which emits features when the field value matches the supplied regex.- Parameters:
fieldName
- The field name to read.regex
- The regex to use for matching.modes
- The matching mode.
-
RegexFieldProcessor
Constructs a field processor which emits features when the field value matches the supplied regex.The regex is compiled on construction.
- Parameters:
fieldName
- The field name to read.regex
- The regex to use for matching.modes
- The matching mode.
-
-
Method Details
-
postConfig
public void postConfig()Used by the OLCUT configuration system, and should not be called by external code.- Specified by:
postConfig
in interfacecom.oracle.labs.mlrg.olcut.config.Configurable
-
getFieldName
Description copied from interface:FieldProcessor
Gets the field name this FieldProcessor uses.- Specified by:
getFieldName
in interfaceFieldProcessor
- Returns:
- The field name.
-
process
Description copied from interface:FieldProcessor
Processes the field value and generates a (possibly empty) list ofColumnarFeature
s.- Specified by:
process
in interfaceFieldProcessor
- Parameters:
value
- The field value to process.- Returns:
- A list of
ColumnarFeature
s.
-
getFeatureType
Description copied from interface:FieldProcessor
Returns the feature type this FieldProcessor generates.- Specified by:
getFeatureType
in interfaceFieldProcessor
- Returns:
- The feature type.
-
copy
Description copied from interface:FieldProcessor
Returns a copy of this FieldProcessor bound to the supplied newFieldName.- Specified by:
copy
in interfaceFieldProcessor
- Parameters:
newFieldName
- The new field name for the copy.- Returns:
- A copy of this FieldProcessor.
-
toString
-
getProvenance
public com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance getProvenance()- Specified by:
getProvenance
in interfacecom.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
-