|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||
java.lang.Objectcom.aliasi.sentences.AbstractSentenceModel
com.aliasi.sentences.HeuristicSentenceModel
com.aliasi.sentences.IndoEuropeanSentenceModel
public class IndoEuropeanSentenceModel
An IndoEuropeanSentenceModel is a heuristic sentence
designed primarily for English. Whehter or not it balances
parentheses or forces the last token to be a boundary may be
set in the constructor. It uses the default implementation of
possible sentence starts and the following token sets:
Note that all of these sets are case insensitive.
Possible Stops ...!?"'').
Impossible Penultimates any single letter personal and professional titles, ranks, etc. commas, colon, and quotes common abbreviations directions corporate designators times, months, etc. U.S. political parties U.S. states (not ME or IN) shipping terms address abbreviations
Impossible Starts possible stops (see above) close parentheses ,;:------%
IndoEuropeanSentenceModel
with the same behavior as the model that was written.
| Constructor Summary | |
|---|---|
IndoEuropeanSentenceModel()
Construct an Indo-European sentence model that does not force the final token to be a stop and does not balance parentheses. |
|
IndoEuropeanSentenceModel(boolean forceFinalToken,
boolean balanceParentheses)
Construct an Indo-European sentence model that forces final tokens and balances parentheses according to the specified flags. |
|
| Method Summary |
|---|
| Methods inherited from class com.aliasi.sentences.HeuristicSentenceModel |
|---|
balanceParens, boundaryIndices, forceFinalStop, possibleStart |
| Methods inherited from class com.aliasi.sentences.AbstractSentenceModel |
|---|
boundaryIndices, boundaryIndices, verifyBounds, verifyTokensWhitespaces |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public IndoEuropeanSentenceModel()
public IndoEuropeanSentenceModel(boolean forceFinalToken,
boolean balanceParentheses)
forceFinalToken - Whether the final token is always a
sentence stop.balanceParentheses - Whether sentences can stop if not all
open parentheses have been closed.
|
|||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | ||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | ||||||||