Adding Additional Event Handlers
In addition to
ignorableWhitespace, there are two otherContentHandlermethods that can find uses in even simple applications:setDocumentLocatorandprocessingInstruction. In this section, you'll implement those two event handlers.Identifying the Document's Location
A locator is an object that contains the information necessary to find a document. The
Locatorclass encapsulates a system ID (URL) or a public identifier (URN) or both. You would need that information if you wanted to find something relative to the current document--in the same way, for example, that an HTML browser processes anhref="anotherFile"attribute in an anchor tag. The browser uses the location of the current document to findanotherFile.You could also use the locator to print good diagnostic messages. In addition to the document's location and public identifier, the locator contains methods that give the column and line number of the most recently processed event. The
setDocumentLocatormethod, however, is called only once: at the beginning of the parse. To get the current line or column number, you would save the locator whensetDocumentLocatoris invoked and then use it in the other event-handling methods.
Note: The code discussed in this section is in
Echo04.java. Its output is inEcho04-01.txt. (The browsable version isEcho04-01.html.)
Start by removing the extra character-echoing code you added for the last example:
public void characters(char buf[], int offset, int len) throws SAXException {if (textBuffer != null) { echoText(); textBuffer = null; }String s = new String(buf, offset, len); ... }Next, add the following highlighted method to the Echo program to get the document locator and use it to echo the document's system ID.
... private String indentString = " "; // Amount to indent private int indentLevel = 0;public void setDocumentLocator(Locator l) { try { out.write("LOCATOR"); out.write("SYS ID: " + l.getSystemId() ); out.flush(); } catch (IOException e) { // Ignore errors } }public void startDocument() ...Notes:
- This method, in contrast to every other
ContentHandlermethod, does not return aSAXException. So rather than useemitfor output, this code writes directly toSystem.out. (This method is generally expected to simply save theLocatorfor later use rather than do the kind of processing that generates an exception, as here.)- The spelling of these methods is
Id, notID. So you havegetSystemIdandgetPublicId.When you compile and run the program on
slideSample01.xml, here is the significant part of the output:LOCATOR SYS ID: file:<path>/../samples/slideSample01.xml START DOCUMENT <?xml version='1.0' encoding='UTF-8'?> ...Here, it is apparent that
setDocumentLocatoris called beforestartDocument. That can make a difference if you do any initialization in the event-handling code.Handling Processing Instructions
It sometimes makes sense to code application-specific processing instructions in the XML data. In this exercise, you'll modify the Echo program to display a processing instruction contained in
slideSample02.xml.
Note: The code discussed in this section is in
Echo05.java. The file it operates on isslideSample02.xml, as described in Writing Processing Instructions . The output is inEcho05-02.txt. (The browsable versions areslideSample02-xml.htmlandEcho05-02.html.)
As you saw in Writing Processing Instructions, the format for a processing instruction is
<?targetdata?>, wheretargetis the application that is expected to do the processing, anddatais the instruction or information for it to process. The sample fileslideSample02.xmlcontains a processing instruction for a mythical slide presentation program that queries the user to find out which slides to display (technical, executive-level, or all):<slideshow ... ><!-- PROCESSING INSTRUCTION --> <?my.presentation.Program QUERY="exec, tech, all"?><!-- TITLE SLIDE -->To display that processing instruction, add the following highlighted code to the Echo application:
public void characters(char buf[], int offset, int len) ... }public void processingInstruction(String target, String data) throws SAXException { nl(); emit("PROCESS: "); emit("<?"+target+" "+data+"?>"); }private void echoText() ...When your edits are complete, compile and run the program. The relevant part of the output should look like this:
Summary
With the minor exception of
ignorableWhitespace, you have used most of theContentHandlermethods that you need to handle the most commonly useful SAX events. You'll seeignorableWhitespacea little later. Next, though, you'll get deeper insight into how you handle errors in the SAX parsing process.