See: Description
Class | Description |
---|---|
Range |
A range currently being segmented.
|
UniversalTokenizer |
This class was originally written for the purpose of document indexing in an
information retrieval context (principally used in Sun Labs' Minion search
engine).
|
It was originally developed to support information retrieval and forms a useful baseline tokenizer for generating features for machine learning.
Copyright © 2015–2021 Oracle and/or its affiliates. All rights reserved.