Class RegexFieldProcessor
java.lang.Object
org.tribuo.data.columnar.processors.field.RegexFieldProcessor
- All Implemented Interfaces:
com.oracle.labs.mlrg.olcut.config.Configurable,com.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>,FieldProcessor
A
FieldProcessor which applies a regex to a field and generates ColumnarFeatures based on the matches.-
Nested Class Summary
Nested ClassesNested classes/interfaces inherited from interface org.tribuo.data.columnar.FieldProcessor
FieldProcessor.GeneratedFeatureType -
Field Summary
Fields inherited from interface org.tribuo.data.columnar.FieldProcessor
NAMESPACE -
Constructor Summary
ConstructorsConstructorDescriptionRegexFieldProcessor(String fieldName, String regex, EnumSet<RegexFieldProcessor.Mode> modes) Constructs a field processor which emits features when the field value matches the supplied regex.RegexFieldProcessor(String fieldName, Pattern regex, EnumSet<RegexFieldProcessor.Mode> modes) Constructs a field processor which emits features when the field value matches the supplied regex. -
Method Summary
Modifier and TypeMethodDescriptionReturns a copy of this FieldProcessor bound to the supplied newFieldName.Returns the feature type this FieldProcessor generates.Gets the field name this FieldProcessor uses.com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenancevoidUsed by the OLCUT configuration system, and should not be called by external code.Processes the field value and generates a (possibly empty) list ofColumnarFeatures.toString()Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitMethods inherited from interface org.tribuo.data.columnar.FieldProcessor
getNumNamespaces
-
Constructor Details
-
RegexFieldProcessor
public RegexFieldProcessor(String fieldName, Pattern regex, EnumSet<RegexFieldProcessor.Mode> modes) Constructs a field processor which emits features when the field value matches the supplied regex.- Parameters:
fieldName- The field name to read.regex- The regex to use for matching.modes- The matching mode.
-
RegexFieldProcessor
Constructs a field processor which emits features when the field value matches the supplied regex.The regex is compiled on construction.
- Parameters:
fieldName- The field name to read.regex- The regex to use for matching.modes- The matching mode.
-
-
Method Details
-
postConfig
Used by the OLCUT configuration system, and should not be called by external code.- Specified by:
postConfigin interfacecom.oracle.labs.mlrg.olcut.config.Configurable
-
getFieldName
Description copied from interface:FieldProcessorGets the field name this FieldProcessor uses.- Specified by:
getFieldNamein interfaceFieldProcessor- Returns:
- The field name.
-
process
Description copied from interface:FieldProcessorProcesses the field value and generates a (possibly empty) list ofColumnarFeatures.- Specified by:
processin interfaceFieldProcessor- Parameters:
value- The field value to process.- Returns:
- A list of
ColumnarFeatures.
-
getFeatureType
Description copied from interface:FieldProcessorReturns the feature type this FieldProcessor generates.- Specified by:
getFeatureTypein interfaceFieldProcessor- Returns:
- The feature type.
-
copy
Description copied from interface:FieldProcessorReturns a copy of this FieldProcessor bound to the supplied newFieldName.- Specified by:
copyin interfaceFieldProcessor- Parameters:
newFieldName- The new field name for the copy.- Returns:
- A copy of this FieldProcessor.
-
toString
-
getProvenance
- Specified by:
getProvenancein interfacecom.oracle.labs.mlrg.olcut.provenance.Provenancable<com.oracle.labs.mlrg.olcut.provenance.ConfiguredObjectProvenance>
-