Package | Description |
---|---|
edu.illinois.cs.cogcomp.lbjava.nlp | |
edu.illinois.cs.cogcomp.lbjava.nlp.seg | |
edu.illinois.cs.cogcomp.ner.LbjTagger | |
edu.illinois.cs.cogcomp.pos |
Modifier and Type | Method and Description |
---|---|
static Word |
POSBracketToVector.parsePOSBracketForm(String text,
Word previous)
Given textual input in the format shown below, this method parses and
returns the
Word that the text represents. |
Modifier and Type | Method and Description |
---|---|
protected boolean |
SentenceSplitter.boundary(int index,
Word word,
Word next1,
Word next2)
Determines whether the given punctuation represents the end of a
sentence based on elements of the paragraph immediately surrounding the
punctuation.
|
protected boolean |
SentenceSplitter.endsWithCloseBracket(Word w)
Determines whether the argument ends with any of the following
varieties of open bracket: ) } ] -RBR- .
|
protected boolean |
SentenceSplitter.endsWithQuote(Word w)
Determines whether the argument ends with any of the following varieties
of closing quote: ' '' ''' " '" .
|
protected boolean |
SentenceSplitter.hasStartMarker(Word w)
Determines whether the argument contains any of the following varieties
of "start marker" at its beginning: an open quote, and open bracket, or
a capital letter.
|
protected boolean |
SentenceSplitter.isClose(Word w)
Determines whether the argument represents a closing bracket or a
closing quote.
|
protected boolean |
SentenceSplitter.isClosingBracket(Word w)
Determines whether the argument is exactly equal to any of the following
varieties of closing bracket: ) } ] -RBR- .
|
protected boolean |
SentenceSplitter.isClosingQuote(Word w)
Determines whether the argument is exactly equal to any of the following
varieties of closing quote: ' '' ''' " '" .
|
protected boolean |
SentenceSplitter.isHonorific(Word w)
Determines whether the argument is exactly equal to any of the honorifics
listed below.
|
protected boolean |
SentenceSplitter.isTerminal(Word w)
Determines whether the argument is exactly equal to any of the following
terminal abbreviations: Esq Jr Sr M.D Ph.D .
|
protected boolean |
SentenceSplitter.isTimeZone(Word w)
Determines whether the argument is a United States time zone
abbreviation (AST, CST, EST, HST, MST, PST, ADT, CDT, EDT, HDT, MDT,
PDT, or UTC-11).
|
static Word |
POSBracketToVector.parsePOSBracketForm(String text,
Word previous)
Given textual input in the format shown below, this method parses and
returns the
Word that the text represents. |
protected boolean |
SentenceSplitter.sentenceBeginner(Word word)
Simple check to see if the given word can reliably be identified as the
first word of a sentence.
|
protected boolean |
SentenceSplitter.startsWithOpenBracket(Word w)
Determines whether the argument starts with any of the following
varieties of open bracket: ( { [ -LBR- .
|
protected boolean |
SentenceSplitter.startsWithOpenQuote(Word w)
Determines whether the argument starts with any of the following
varieties of open quote: ` `` ``` " "` .
|
protected boolean |
SentenceSplitter.startsWithQuote(Word w)
Determines whether the first character of the argument is any of the
three varieties of quotes: ' " `.
|
Constructor and Description |
---|
Word(String f,
String pos,
String l,
String sense,
Word p,
int start,
int end)
This constructor is useful when the sentence is being parsed forwards.
|
Word(String f,
String pos,
Word p)
This constructor is useful when the sentence is being parsed forwards.
|
Word(String f,
String pos,
Word p,
int start,
int end)
This constructor is useful when the sentence is being parsed forwards.
|
Word(String f,
Word p)
This constructor is useful when the sentence is being parsed forwards.
|
Word(String f,
Word p,
int start,
int end)
This constructor is useful when the sentence is being parsed forwards.
|
Modifier and Type | Class and Description |
---|---|
class |
Token
Simple extension of the
Word class from LBJava's NLP
library. |
Constructor and Description |
---|
Token(Word w,
Token p,
String type)
A
Token can be constructed from a Word
object representing the same word, a Token representing
the previous word in the sentence, and the type label found in the data. |
Modifier and Type | Class and Description |
---|---|
class |
NEWord |
Constructor and Description |
---|
NEWord(Word w,
NEWord p,
String type)
An
NEWord can be constructed from a Word object representing the
same word, an NEWord representing the previous word in the sentence, and the
named entity type label found in the data. |
Modifier and Type | Method and Description |
---|---|
Set<String> |
MikheevLearner.allowableTags(Word word)
Returns the set of tags that the given word's suffix has been observed with, or a reasonable
default if the suffix has never been observed.
|
Copyright © 2017. All rights reserved.