|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectcom.aliasi.tokenizer.TokenFeatureExtractor
public class TokenFeatureExtractor
A TokenFeatureExtractor produces feature vectors from
character sequences representing token counts.
The token feature extractors implement the Serializable
interface. A token feature extractor will actually be serializable
if the underlying tokenizer factory is serializable, either by
implementing the Serializable interface or the Compilable interface. If it is not, attempting to serialize the
feature extractor will throw an exception.
| Constructor Summary | |
|---|---|
TokenFeatureExtractor(TokenizerFactory factory)
Construct a token-based feature extractor from the specified tokenizer factory. |
|
| Method Summary | |
|---|---|
Map<String,Counter> |
features(CharSequence in)
Return the feature vector for the specified character sequence. |
String |
toString()
Returns a description of this token feature extractor including its contained tokenizer factory. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait |
| Constructor Detail |
|---|
public TokenFeatureExtractor(TokenizerFactory factory)
factory - Tokenizer factory to use for tokenization.| Method Detail |
|---|
public Map<String,Counter> features(CharSequence in)
features in interface FeatureExtractor<CharSequence>in - Character sequence from which to extract features.
public String toString()
toString() method of the contained tokenizer factory.
toString in class Object
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||