public class XmlTextAnnotationMaker extends Object
Constructor and Description |
---|
XmlTextAnnotationMaker(TextAnnotationBuilder taBuilder,
XmlDocumentProcessor xmlProcessor)
Specifies the behavior of the XmlTextAnnotationMaker: tokenization (via the TextAnnotationBuilder),
which xml tags to use for body text and for retained attributes
|
Modifier and Type | Method and Description |
---|---|
XmlTextAnnotation |
createTextAnnotation(String xmlText,
String corpusId,
String docId)
A method for creating
TextAnnotation by
tokenizing the given text string. |
public XmlTextAnnotationMaker(TextAnnotationBuilder taBuilder, XmlDocumentProcessor xmlProcessor)
taBuilder
- generates the sentence split and tokenized text for further processingxmlProcessor
- responsible for parsing xml, extracting processable text and relevant markup infopublic XmlTextAnnotation createTextAnnotation(String xmlText, String corpusId, String docId)
TextAnnotation
by
tokenizing the given text string.xmlText
- Raw xml text from corpus documentcorpusId
- corpus identifierdocId
- text identifierCopyright © 2017. All rights reserved.