public class XmlTextAnnotationMaker extends Object
| Constructor and Description |
|---|
XmlTextAnnotationMaker(TextAnnotationBuilder taBuilder,
XmlDocumentProcessor xmlProcessor)
Specifies the behavior of the XmlTextAnnotationMaker: tokenization (via the TextAnnotationBuilder),
which xml tags to use for body text and for retained attributes
|
| Modifier and Type | Method and Description |
|---|---|
XmlTextAnnotation |
createTextAnnotation(String xmlText,
String corpusId,
String docId)
A method for creating
TextAnnotation by
tokenizing the given text string. |
public XmlTextAnnotationMaker(TextAnnotationBuilder taBuilder, XmlDocumentProcessor xmlProcessor)
taBuilder - generates the sentence split and tokenized text for further processingxmlProcessor - responsible for parsing xml, extracting processable text and relevant markup infopublic XmlTextAnnotation createTextAnnotation(String xmlText, String corpusId, String docId)
TextAnnotation by
tokenizing the given text string.xmlText - Raw xml text from corpus documentcorpusId - corpus identifierdocId - text identifierCopyright © 2017. All rights reserved.