Safari Books Online is a digital library providing on-demand subscription access to thousands of learning resources.
As discussed in chapter 5, the document content extracted by a parser is returned as XHTML SAX events to the client application. Handling these events can be complicated at times, so Tika provides a number of utility classes in the org.apache.tika.sax package for various different purposes. Table A.4 summarizes the most commonly used utility classes.