com.aliasi.tokenizer
Class NormalizeWhiteSpaceFilterTokenizer
java.lang.Object
com.aliasi.tokenizer.Tokenizer
com.aliasi.tokenizer.FilterTokenizer
com.aliasi.tokenizer.NormalizeWhiteSpaceFilterTokenizer
- All Implemented Interfaces:
- Iterable<String>
Deprecated. Use WhitespaceNormTokenizerFactory instead.
@Deprecated
public class NormalizeWhiteSpaceFilterTokenizer
- extends FilterTokenizer
A NormalizeWhiteSpaceFilterTokenizer reduces each
non-empty whitespace to a single space, leaving empty whitespaces
alone.
- Since:
- LingPipe1.0
- Version:
- 3.8
- Author:
- Bob Carpenter
NormalizeWhiteSpaceFilterTokenizer
@Deprecated
public NormalizeWhiteSpaceFilterTokenizer(Tokenizer tokenizer)
- Deprecated. Use
WhitespaceNormTokenizerFactory or
ModifyTokenTokenizerFactory.modify(Tokenizer)
instead.
- Construct a filter tokenizer that normalizes whitespace,
using the specified contained tokenizer.
- Parameters:
tokenizer - Contained tokenizer.
nextWhitespace
public String nextWhitespace()
- Deprecated.
- Returns the next whitespace, which will either be
the single space string
Strings.SINGLE_SPACE_STRING
or the empty string Strings.EMPTY_STRING.
- Overrides:
nextWhitespace in class FilterTokenizer
- Returns:
- Next whitespace.