Uses of Interface
org.tribuo.data.text.DocumentPreprocessor
Packages that use DocumentPreprocessor
Package
Description
Provides implementations of text data processors.
-
Uses of DocumentPreprocessor in org.tribuo.data.text
Fields in org.tribuo.data.text with type parameters of type DocumentPreprocessorModifier and TypeFieldDescriptionprotected List<DocumentPreprocessor> DirectoryFileSource.preprocessorsDocument preprocessors that should be run on the documents that make up this data set.protected List<DocumentPreprocessor> TextDataSource.preprocessorsDocument preprocessors that should be run on the documents that make up this data set.Constructors in org.tribuo.data.text with parameters of type DocumentPreprocessorModifierConstructorDescriptionDirectoryFileSource(Path dataDir, OutputFactory<T> outputFactory, TextFeatureExtractor<T> extractor, DocumentPreprocessor... preprocessors) Creates a data source that will use the given feature extractor and document preprocessors on the data read from the files in the directories representing classes.TextDataSource(File file, OutputFactory<T> outputFactory, TextFeatureExtractor<T> extractor, DocumentPreprocessor... preprocessors) Creates a text data set by reading it from a file.TextDataSource(Path path, OutputFactory<T> outputFactory, TextFeatureExtractor<T> extractor, DocumentPreprocessor... preprocessors) Creates a text data set by reading it from a path. -
Uses of DocumentPreprocessor in org.tribuo.data.text.impl
Classes in org.tribuo.data.text.impl that implement DocumentPreprocessorModifier and TypeClassDescriptionclassA document preprocessor which uppercases or lowercases the input.classA document pre-processor for 20 newsgroup data.final classA simple document preprocessor which applies regular expressions to the input.