Word (illinois-cogcomp-nlp 3.1.29 API)

java.lang.Object
- edu.illinois.cs.cogcomp.lbjava.parse.LinkedChild
- - edu.illinois.cs.cogcomp.lbjava.nlp.Word

All Implemented Interfaces:

Serializable, Cloneable

Direct Known Subclasses:

NEWord, Token
```
public class Word
extends edu.illinois.cs.cogcomp.lbjava.parse.LinkedChild
```
Implementation of a word for natural language processing. Please note that in general, one can only count on the form and capitalized fields described below having meaningful values. The form field can be assumed to be filled in because it's hard to imagine a situation in which a Word object should be created without any knowledge of how that word appeared in text. The capitalized field is computed from the form by this class' constructor.

All other fields must be obtained or computed externally. Space is provided for them in this class' implementation as a convenience, since we expect the user will make frequent use of these fields.

This class extends from LinkedChild. Of course, this means that objects of this class contain references to both the previous and the next word in the sentence. Constructors are available that take the previous word as an argument, setting that reference. Thus, a useful technique for constructing all the words in a sentence will involve code that looks like this (where form is a String):

Word current = new Word(form); a loop of some sort { current.next = new Word(form, current); current = current.next; }
Author: Nick Rizzolo See Also: Serialized Form










Field Summary

Fields 

Modifier and Type
Field and Description


boolean
capitalized
Whether or not the word is capitalized is determined automatically by
 the constructor.



String
form
The actual text from the corpus that represents the word.



String
lemma
The base form of the word.



String
partOfSpeech
Names the part of speech of this word.



String
wordSense
An indication of the meaning or usage of this instance of this word.







Fields inherited from class edu.illinois.cs.cogcomp.lbjava.parse.LinkedChild
end, label, next, parent, previous, start








Constructor Summary

Constructors 

Constructor and Description


Word(String f)
When all that is known is the spelling of the word.



Word(String f,
    int start,
    int end)
When you have offset information.



Word(String f,
    String pos)
Sets the actual text and the part of speech.



Word(String f,
    String pos,
    int start,
    int end)
When you have offset information.



Word(String f,
    String pos,
    String l,
    String sense,
    Word p,
    int start,
    int end)
This constructor is useful when the sentence is being parsed forwards.



Word(String f,
    String pos,
    Word p)
This constructor is useful when the sentence is being parsed forwards.



Word(String f,
    String pos,
    Word p,
    int start,
    int end)
This constructor is useful when the sentence is being parsed forwards.



Word(String f,
    Word p)
This constructor is useful when the sentence is being parsed forwards.



Word(String f,
    Word p,
    int start,
    int end)
This constructor is useful when the sentence is being parsed forwards.










Method Summary

All Methods Instance Methods Concrete Methods 

Modifier and Type
Method and Description


String
toString()
The string representation of a word is its POS bracket form, or, if the
 part of speech is not available, it is just the spelling of the word.







Methods inherited from class edu.illinois.cs.cogcomp.lbjava.parse.LinkedChild
clone





Methods inherited from class java.lang.Object
equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, wait














Field Detail





form
public String form
The actual text from the corpus that represents the word.







capitalized
public boolean capitalized
Whether or not the word is capitalized is determined automatically by
 the constructor.







partOfSpeech
public String partOfSpeech
Names the part of speech of this word.







lemma
public String lemma
The base form of the word.







wordSense
public String wordSense
An indication of the meaning or usage of this instance of this word.









Constructor Detail





Word
public Word(String f)
When all that is known is the spelling of the word.

Parameters:
f - The actual text of the word.








Word
public Word(String f,
            String pos)
Sets the actual text and the part of speech.

Parameters:
f - The actual text of the word.
pos - A token representing the word's part of speech.








Word
public Word(String f,
            Word p)
This constructor is useful when the sentence is being parsed forwards.

Parameters:
f - The actual text of the word.
p - The word that came before this one in the sentence.








Word
public Word(String f,
            String pos,
            Word p)
This constructor is useful when the sentence is being parsed forwards.

Parameters:
f - The actual text of the word.
pos - A token representing the word's part of speech.
p - The word that came before this one in the sentence.








Word
public Word(String f,
            int start,
            int end)
When you have offset information.

Parameters:
f - The actual text of the word.
start - The offset into the parent document at which the first
              character of this word is found.
end - The offset into the parent document at which the last
              character of this word is found.








Word
public Word(String f,
            String pos,
            int start,
            int end)
When you have offset information.

Parameters:
f - The actual text of the word.
pos - A token representing the word's part of speech.
start - The offset into the parent document at which the first
              character of this word is found.
end - The offset into the parent document at which the last
              character of this word is found.








Word
public Word(String f,
            Word p,
            int start,
            int end)
This constructor is useful when the sentence is being parsed forwards.

Parameters:
f - The actual text of the word.
p - The word that came before this one in the sentence.
start - The offset into the parent document at which the first
              character of this word is found.
end - The offset into the parent document at which the last
              character of this word is found.








Word
public Word(String f,
            String pos,
            Word p,
            int start,
            int end)
This constructor is useful when the sentence is being parsed forwards.

Parameters:
f - The actual text of the word.
pos - A token representing the word's part of speech.
p - The word that came before this one in the sentence.
start - The offset into the parent document at which the first
              character of this word is found.
end - The offset into the parent document at which the last
              character of this word is found.








Word
public Word(String f,
            String pos,
            String l,
            String sense,
            Word p,
            int start,
            int end)
This constructor is useful when the sentence is being parsed forwards.

Parameters:
f - The actual text of the word.
pos - A token representing the word's part of speech.
l - The base form of the word.
sense - The sense of the word.
p - The word that came before this one in the sentence.
start - The offset into the parent document at which the first
              character of this word is found.
end - The offset into the parent document at which the last
              character of this word is found.










Method Detail





toString
public String toString()
The string representation of a word is its POS bracket form, or, if the
 part of speech is not available, it is just the spelling of the word.
 Note that the POS bracket form of a word also entails displaying left
 brackets ("(", "[", and "{") as
 "-LRB-" and right brackets (")",
 "]", "}") as "-RRB-".

Overrides:
toString in class Object
Returns:
The POS bracket form of this word, or just the spelling of the
 word if the part of speech is not available.

Modifier and Type	Field and Description
`boolean`	`capitalized` Whether or not the word is capitalized is determined automatically by the constructor.
`String`	`form` The actual text from the corpus that represents the word.
`String`	`lemma` The base form of the word.
`String`	`partOfSpeech` Names the part of speech of this word.
`String`	`wordSense` An indication of the meaning or usage of this instance of this word.

Class Word

Field Summary

Fields inherited from class edu.illinois.cs.cogcomp.lbjava.parse.LinkedChild

Constructor Summary

Method Summary

Methods inherited from class edu.illinois.cs.cogcomp.lbjava.parse.LinkedChild

Methods inherited from class java.lang.Object

Field Detail

form

capitalized

partOfSpeech

lemma

wordSense

Constructor Detail

Word

Word

Word

Word

Word

Word

Word

Word

Word

Method Detail

toString