|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectcom.aliasi.chunk.TokenShapeChunker
public class TokenShapeChunker
A TokenShapeChunker uses a named-entity
TokenShapeDecoder and tokenizer factory to implement
entity detection through the chunk.Chunker interface.
A named-entity chunker is constructed from a tokenizer factory and
decoder. The tokenizer factory creates the tokens that are sent to
the decoder. The chunks have types derived from the named-entity
types found.
The tokens and whitespaces returned by the tokenizer are concatenated to form the underlying text slice of the chunks returned by the chunker. Thus a tokenizer like the stop list tokenizer or Porter stemmer tokenizer will create a character slice that does not match the input. A whitespace-normalizing tokenizer filter can be used, for example, to produce normalized text for the basis of the chunks.
| Method Summary | |
|---|---|
Chunking |
chunk(char[] cs,
int start,
int end)
Return the set of named-entity chunks derived from the underlying decoder over the tokenization of the specified character slice. |
Chunking |
chunk(CharSequence cSeq)
Return the set of named-entity chunks derived from the uderlying decoder over the tokenization of the specified character sequence. |
void |
setLog2Beam(double beamWidth)
Sets the log (base 2) beam width for the decoder. |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Method Detail |
|---|
public Chunking chunk(CharSequence cSeq)
For more information on return results, see chunk(char[],int,int).
chunk in interface ChunkercSeq - Character sequence to chunk.
public Chunking chunk(char[] cs,
int start,
int end)
chunk in interface Chunkercs - Characters underlying slice.start - Index of first character in slice.end - Index of one past the last character in the slice.
public void setLog2Beam(double beamWidth)
beamWidth - Width of beam.
IllegalArgumentException - If the beam width is not
positive.
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||