| Class | Description |
|---|---|
| Affixes |
This class implements a classifier that takes a
Word as input and
generates features representing the prefixes and suffixes of the input
word. |
| Capitalization |
This class implements a classifier that takes a
Word as input and
generates Boolean features representing the capitalizations of the words
in a [-2, +2] window around the input word. |
| ColumnFormat |
This parser returns arrays of
Strings representing the rows
of a file in column format. |
| DataReader |
A generic data reader class that can be used to interface with the `LBJava`
Parser code. |
| Forms |
This class implements a classifier that takes a
Word as input and
generates features representing the forms of the words in a [-2, +2]
window around the input word. |
| NLDocument |
Use this class to represent a natural language document.
|
| POS |
This class converts the
String names of POS tags into
discrete integer values. |
| POSBracketToVector |
Use this parser to return
LinkedVector objects representing
sentences given file names of POS bracket form files to parse. |
| POSBracketToWord | Deprecated
As of LBJava release 2.0.4, the functionality of this class has
been superceded by the
ChildrenFromVectors
parser used in conjunction with POSBracketToVector. |
| Sentence |
This representation of a sentence simply stores the entire text of the
sentence in a string.
|
| SentenceSplitter |
Use this class to extract sentences from plain text.
|
| StringArraysToWords | |
| Word |
Implementation of a word for natural language processing.
|
| WordsInDocumentByDirectory |
This parser creates and returns labeled arrays of
Strings,
each representing all the words in a document. |
| WordSplitter |
This parser takes the plain, unannotated
Sentences returned by
another parser (e.g., SentenceSplitter) and splits them into
Word objects. |
| WordTypeInformation |
This class implements a classifier that takes a
Word as input and
generates Boolean features representing interesting information about the
forms of the words in a [-2, +2] window around the input word. |
Copyright © 2017. All rights reserved.