Class SimpleTextDataSource<T extends Output<T>>

All Implemented Interfaces:,<DataSourceProvenance>, Iterable<Example<T>>, ConfigurableDataSource<T>, DataSource<T>
Direct Known Subclasses:

public class SimpleTextDataSource<T extends Output<T>> extends TextDataSource<T>
A dataset for a simple data format for text classification experiments. A line in the file looks like:
 OUTPUT##Document text
Each line in the file specifies a single output and document pair. Leading and trailing spaces will be trimmed from outputs and documents. Outputs will be converted to upper case.

As with all of our text data, the file should be in UTF-8.