Represent a document that contains xml markup. Another class --
XmlTextAnnotationMaker
, extracts a subset
of the xml source text that will be processed by NLP components and creates a TextAnnotation from it.
It also extracts xml markup that contains relevant information for use by applications. Examples of possible
text fields are "" and "
"; possible supplementary info could be the "author" attribute
in a tag such as "". Finally, it provides a StringTransformation that maps between
the xml source and the cleaned NLP-processable text. These elements comprise the information for a
XmlTextAnnotation
object.