public class CoNLL2000Parser extends ColumnFormat
LBJ2.parse.LinkedVector
s populated by LBJ2.nlp.seg.Token
s
representing words. Each LinkedVector
represents a sentence from the input. More
information about LinkedVector
can be found in LBJ's
online Javadoc.
A line of the input file parsed by this parser contains information about a single word of natural language text. The word itself appears first, followed by a part of speech tag and a BIO chunk tag, all separated by whitespace. If a part of speech tag is unknown, a single dash may appear instead. Words appear in the order they were found in the plain text, and sentences are separated by newlines.
Constructor and Description |
---|
CoNLL2000Parser(String file)
Constructs this parser to parse the given file.
|
Modifier and Type | Method and Description |
---|---|
Object |
next()
Produces the next object parsed from the input file; in this case, that object is guaranteed
to be a
LinkedVector populated by Token s representing a sentence. |
public CoNLL2000Parser(String file)
file
- The name of the file to parse.public Object next()
LinkedVector
populated by Token
s representing a sentence.next
in interface edu.illinois.cs.cogcomp.lbjava.parse.Parser
next
in class ColumnFormat
Copyright © 2017. All rights reserved.