Link to home
Start Free TrialLog in
Avatar of Vakils
VakilsFlag for United States of America

asked on

Parse XML using Java Dom Parser

How do I parser the XML below using DOM Parser: I need to capture Nodes between <DATA RECORD>
<?xml version="1.0" ?>
<!DOCTYPE main [
  <!ELEMENT main (DATA_RECORD*)>
  <!ELEMENT DATA_RECORD (NEW_UPDATE?,GUILD_STUDIO_CODE?,STUDIO_NAME,CONTRACT_CODE?,CONTRACT_YEAR?,NEGOTIATED_PNH_RATE?,
  EFFECTIVE_DATE?,EXPIRATION_DATE?,SIGN_DATE?,AMPTP?,JPC?,AAAA?,ANA?,DATE_OF_UPDATE?)+>
  <!ELEMENT NEW_UPDATE (#PCDATA)>
  <!ELEMENT GUILD_STUDIO_CODE (#PCDATA)>
  <!ELEMENT STUDIO_NAME (#PCDATA)>
  <!ELEMENT CONTRACT_CODE (#PCDATA)>
  <!ELEMENT CONTRACT_YEAR (#PCDATA)>
  <!ELEMENT NEGOTIATED_PNH_RATE (#PCDATA)>
  <!ELEMENT EFFECTIVE_DATE (#PCDATA)>
  <!ELEMENT EXPIRATION_DATE (#PCDATA)>
  <!ELEMENT SIGN_DATE (#PCDATA)>
  <!ELEMENT AMPTP (#PCDATA)>
  <!ELEMENT JPC (#PCDATA)>
  <!ELEMENT AAAA (#PCDATA)>
  <!ELEMENT ANA (#PCDATA)>
  <!ELEMENT DATE_OF_UPDATE (#PCDATA)>
]>
<main>
  <DATA_RECORD>
    <NEW_UPDATE>NEW</NEW_UPDATE>
    <GUILD_STUDIO_CODE>007005852</GUILD_STUDIO_CODE>
    <STUDIO_NAME>New Concepts Ltd</STUDIO_NAME>
    <CONTRACT_CODE>I</CONTRACT_CODE>
    <CONTRACT_YEAR>1985</CONTRACT_YEAR>
    <NEGOTIATED_PNH_RATE></NEGOTIATED_PNH_RATE>
    <EFFECTIVE_DATE>01-JUL-1985</EFFECTIVE_DATE>
    <EXPIRATION_DATE>30-JUN-1987</EXPIRATION_DATE>
    <SIGN_DATE>1986/03/31 00:00:00</SIGN_DATE>
    <AMPTP>N</AMPTP>
    <JPC>N</JPC>
    <AAAA>N</AAAA>
    <ANA>N</ANA>
    <DATE_OF_UPDATE>16-NOV-1999</DATE_OF_UPDATE>
  </DATA_RECORD>
  <DATA_RECORD>
    <NEW_UPDATE>NEW</NEW_UPDATE>
    <GUILD_STUDIO_CODE>00700652</GUILD_STUDIO_CODE>
    <STUDIO_NAME>Old Concepts Ltd</STUDIO_NAME>
    <CONTRACT_CODE>CM</CONTRACT_CODE>
    <CONTRACT_YEAR>1993</CONTRACT_YEAR>
    <NEGOTIATED_PNH_RATE></NEGOTIATED_PNH_RATE>
    <EFFECTIVE_DATE>01-MAY-1993</EFFECTIVE_DATE>
    <EXPIRATION_DATE>30-APR-1996</EXPIRATION_DATE>
    <SIGN_DATE>1993/07/22 00:00:00</SIGN_DATE>
    <AMPTP>N</AMPTP>
    <JPC>N</JPC>
    <AAAA>N</AAAA>
    <ANA>N</ANA>
    <DATE_OF_UPDATE>27-JUL-1993</DATE_OF_UPDATE>
  </DATA_RECORD>
ASKER CERTIFIED SOLUTION
Avatar of CEHJ
CEHJ
Flag of United Kingdom of Great Britain and Northern Ireland image

Link to home
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
Start Free Trial
Avatar of Vakils

ASKER

Hi,
Can you please explain why I need pattern? All I need is nodes and their values.
A pattern is to get nodes
Avatar of Vakils

ASKER

Thanks. Does xpath have advantage over DOM parser?  It does get node value. I need node name too. Let's see if I can tweak code.
Does xpath have advantage over DOM parser?
Yes. It requires a lot less code. Of course, they are not mutually exclusive - XPath requires a DOM parser. The point is that getting nodes from that DOM without XPath requires more code
Avatar of Vakils

ASKER

I tweaked the code a bit and got foll. result which I wanted. XPath is really powerful for extracting xml data using expressions. Thanks for solution and introducing me to XPath. If you can optimize on the tweak below, pl do post.
Thanks!
      
                 if(nod.getNodeType() == Node.ELEMENT_NODE)
                {
                    System.out.print(nodeList.item(i).getNodeName() + " : "); 
                    if(nod.getFirstChild() != null)
                    {
                        System.out.println(nod.getFirstChild().getNodeValue());
                    }
                    else
                    {
                        System.out.println();
                    }
                }

Open in new window

The result:
run:
CONTRACT_CODE : 2
CONTRACT_YEAR :
CONTRACT_TITLE : 1061
CONTRACT_DESCRIPTION : L-U NON-BRDCST(KMBZ/KLTH/KCMO
EFFECTIVE_DATE : 31-JUL-1992
EXPIRATION_DATE : 30-APR-1996
EXTENSION_OF_END_DATE :
CONTRACT_CODE : 1
CONTRACT_YEAR :
CONTRACT_TITLE : 1062
CONTRACT_DESCRIPTION : L-U NATL COMML(KMBZ/KLTH/KCMO)
EFFECTIVE_DATE : 31-JUL-1992
EXPIRATION_DATE : 30-APR-1996
EXTENSION_OF_END_DATE :
BUILD SUCCESSFUL (total time: 30 seconds)
Avatar of Vakils

ASKER

Thanks.
Avatar of Vakils

ASKER

Oh, and changed arg too, 2nd one   '//DATA_RECORD/*'
Yes, that's OK if for some reason you want the Node instances instead of the content