public class CoNLL2000Parser extends ColumnFormat
LBJ2.parse.LinkedVectors populated by LBJ2.nlp.seg.Tokens
representing words. Each LinkedVector represents a sentence from the input. More
information about LinkedVector can be found in LBJ's
online Javadoc.
A line of the input file parsed by this parser contains information about a single word of natural language text. The word itself appears first, followed by a part of speech tag and a BIO chunk tag, all separated by whitespace. If a part of speech tag is unknown, a single dash may appear instead. Words appear in the order they were found in the plain text, and sentences are separated by newlines.
| Constructor and Description |
|---|
CoNLL2000Parser(String file)
Constructs this parser to parse the given file.
|
| Modifier and Type | Method and Description |
|---|---|
Object |
next()
Produces the next object parsed from the input file; in this case, that object is guaranteed
to be a
LinkedVector populated by Tokens representing a sentence. |
public CoNLL2000Parser(String file)
file - The name of the file to parse.public Object next()
LinkedVector populated by Tokens representing a sentence.next in interface edu.illinois.cs.cogcomp.lbjava.parse.Parsernext in class ColumnFormatCopyright © 2017. All rights reserved.