Vakils
asked on
Parse XML using Java Dom Parser
How do I parser the XML below using DOM Parser: I need to capture Nodes between <DATA RECORD>
<?xml version="1.0" ?>
<!DOCTYPE main [
<!ELEMENT main (DATA_RECORD*)>
<!ELEMENT DATA_RECORD (NEW_UPDATE?,GUILD_STUDIO_ CODE?,STUD IO_NAME,CO NTRACT_COD E?,CONTRAC T_YEAR?,NE GOTIATED_P NH_RATE?,
EFFECTIVE_DATE?,EXPIRATION _DATE?,SIG N_DATE?,AM PTP?,JPC?, AAAA?,ANA? ,DATE_OF_U PDATE?)+>
<!ELEMENT NEW_UPDATE (#PCDATA)>
<!ELEMENT GUILD_STUDIO_CODE (#PCDATA)>
<!ELEMENT STUDIO_NAME (#PCDATA)>
<!ELEMENT CONTRACT_CODE (#PCDATA)>
<!ELEMENT CONTRACT_YEAR (#PCDATA)>
<!ELEMENT NEGOTIATED_PNH_RATE (#PCDATA)>
<!ELEMENT EFFECTIVE_DATE (#PCDATA)>
<!ELEMENT EXPIRATION_DATE (#PCDATA)>
<!ELEMENT SIGN_DATE (#PCDATA)>
<!ELEMENT AMPTP (#PCDATA)>
<!ELEMENT JPC (#PCDATA)>
<!ELEMENT AAAA (#PCDATA)>
<!ELEMENT ANA (#PCDATA)>
<!ELEMENT DATE_OF_UPDATE (#PCDATA)>
]>
<main>
<DATA_RECORD>
<NEW_UPDATE>NEW</NEW_UPDAT E>
<GUILD_STUDIO_CODE>0070058 52</GUILD_ STUDIO_COD E>
<STUDIO_NAME>New Concepts Ltd</STUDIO_NAME>
<CONTRACT_CODE>I</CONTRACT _CODE>
<CONTRACT_YEAR>1985</CONTR ACT_YEAR>
<NEGOTIATED_PNH_RATE></NEG OTIATED_PN H_RATE>
<EFFECTIVE_DATE>01-JUL-198 5</EFFECTI VE_DATE>
<EXPIRATION_DATE>30-JUN-19 87</EXPIRA TION_DATE>
<SIGN_DATE>1986/03/31 00:00:00</SIGN_DATE>
<AMPTP>N</AMPTP>
<JPC>N</JPC>
<AAAA>N</AAAA>
<ANA>N</ANA>
<DATE_OF_UPDATE>16-NOV-199 9</DATE_OF _UPDATE>
</DATA_RECORD>
<DATA_RECORD>
<NEW_UPDATE>NEW</NEW_UPDAT E>
<GUILD_STUDIO_CODE>0070065 2</GUILD_S TUDIO_CODE >
<STUDIO_NAME>Old Concepts Ltd</STUDIO_NAME>
<CONTRACT_CODE>CM</CONTRAC T_CODE>
<CONTRACT_YEAR>1993</CONTR ACT_YEAR>
<NEGOTIATED_PNH_RATE></NEG OTIATED_PN H_RATE>
<EFFECTIVE_DATE>01-MAY-199 3</EFFECTI VE_DATE>
<EXPIRATION_DATE>30-APR-19 96</EXPIRA TION_DATE>
<SIGN_DATE>1993/07/22 00:00:00</SIGN_DATE>
<AMPTP>N</AMPTP>
<JPC>N</JPC>
<AAAA>N</AAAA>
<ANA>N</ANA>
<DATE_OF_UPDATE>27-JUL-199 3</DATE_OF _UPDATE>
</DATA_RECORD>
<?xml version="1.0" ?>
<!DOCTYPE main [
<!ELEMENT main (DATA_RECORD*)>
<!ELEMENT DATA_RECORD (NEW_UPDATE?,GUILD_STUDIO_
EFFECTIVE_DATE?,EXPIRATION
<!ELEMENT NEW_UPDATE (#PCDATA)>
<!ELEMENT GUILD_STUDIO_CODE (#PCDATA)>
<!ELEMENT STUDIO_NAME (#PCDATA)>
<!ELEMENT CONTRACT_CODE (#PCDATA)>
<!ELEMENT CONTRACT_YEAR (#PCDATA)>
<!ELEMENT NEGOTIATED_PNH_RATE (#PCDATA)>
<!ELEMENT EFFECTIVE_DATE (#PCDATA)>
<!ELEMENT EXPIRATION_DATE (#PCDATA)>
<!ELEMENT SIGN_DATE (#PCDATA)>
<!ELEMENT AMPTP (#PCDATA)>
<!ELEMENT JPC (#PCDATA)>
<!ELEMENT AAAA (#PCDATA)>
<!ELEMENT ANA (#PCDATA)>
<!ELEMENT DATE_OF_UPDATE (#PCDATA)>
]>
<main>
<DATA_RECORD>
<NEW_UPDATE>NEW</NEW_UPDAT
<GUILD_STUDIO_CODE>0070058
<STUDIO_NAME>New Concepts Ltd</STUDIO_NAME>
<CONTRACT_CODE>I</CONTRACT
<CONTRACT_YEAR>1985</CONTR
<NEGOTIATED_PNH_RATE></NEG
<EFFECTIVE_DATE>01-JUL-198
<EXPIRATION_DATE>30-JUN-19
<SIGN_DATE>1986/03/31 00:00:00</SIGN_DATE>
<AMPTP>N</AMPTP>
<JPC>N</JPC>
<AAAA>N</AAAA>
<ANA>N</ANA>
<DATE_OF_UPDATE>16-NOV-199
</DATA_RECORD>
<DATA_RECORD>
<NEW_UPDATE>NEW</NEW_UPDAT
<GUILD_STUDIO_CODE>0070065
<STUDIO_NAME>Old Concepts Ltd</STUDIO_NAME>
<CONTRACT_CODE>CM</CONTRAC
<CONTRACT_YEAR>1993</CONTR
<NEGOTIATED_PNH_RATE></NEG
<EFFECTIVE_DATE>01-MAY-199
<EXPIRATION_DATE>30-APR-19
<SIGN_DATE>1993/07/22 00:00:00</SIGN_DATE>
<AMPTP>N</AMPTP>
<JPC>N</JPC>
<AAAA>N</AAAA>
<ANA>N</ANA>
<DATE_OF_UPDATE>27-JUL-199
</DATA_RECORD>
ASKER CERTIFIED SOLUTION
membership
This solution is only available to members.
To access this solution, you must be a member of Experts Exchange.
A pattern is to get nodes
ASKER
Thanks. Does xpath have advantage over DOM parser? It does get node value. I need node name too. Let's see if I can tweak code.
Does xpath have advantage over DOM parser?Yes. It requires a lot less code. Of course, they are not mutually exclusive - XPath requires a DOM parser. The point is that getting nodes from that DOM without XPath requires more code
ASKER
I tweaked the code a bit and got foll. result which I wanted. XPath is really powerful for extracting xml data using expressions. Thanks for solution and introducing me to XPath. If you can optimize on the tweak below, pl do post.
Thanks!
run:
CONTRACT_CODE : 2
CONTRACT_YEAR :
CONTRACT_TITLE : 1061
CONTRACT_DESCRIPTION : L-U NON-BRDCST(KMBZ/KLTH/KCMO
EFFECTIVE_DATE : 31-JUL-1992
EXPIRATION_DATE : 30-APR-1996
EXTENSION_OF_END_DATE :
CONTRACT_CODE : 1
CONTRACT_YEAR :
CONTRACT_TITLE : 1062
CONTRACT_DESCRIPTION : L-U NATL COMML(KMBZ/KLTH/KCMO)
EFFECTIVE_DATE : 31-JUL-1992
EXPIRATION_DATE : 30-APR-1996
EXTENSION_OF_END_DATE :
BUILD SUCCESSFUL (total time: 30 seconds)
Thanks!
if(nod.getNodeType() == Node.ELEMENT_NODE)
{
System.out.print(nodeList.item(i).getNodeName() + " : ");
if(nod.getFirstChild() != null)
{
System.out.println(nod.getFirstChild().getNodeValue());
}
else
{
System.out.println();
}
}
The result:run:
CONTRACT_CODE : 2
CONTRACT_YEAR :
CONTRACT_TITLE : 1061
CONTRACT_DESCRIPTION : L-U NON-BRDCST(KMBZ/KLTH/KCMO
EFFECTIVE_DATE : 31-JUL-1992
EXPIRATION_DATE : 30-APR-1996
EXTENSION_OF_END_DATE :
CONTRACT_CODE : 1
CONTRACT_YEAR :
CONTRACT_TITLE : 1062
CONTRACT_DESCRIPTION : L-U NATL COMML(KMBZ/KLTH/KCMO)
EFFECTIVE_DATE : 31-JUL-1992
EXPIRATION_DATE : 30-APR-1996
EXTENSION_OF_END_DATE :
BUILD SUCCESSFUL (total time: 30 seconds)
ASKER
Thanks.
ASKER
Oh, and changed arg too, 2nd one '//DATA_RECORD/*'
Yes, that's OK if for some reason you want the Node instances instead of the content
ASKER
Can you please explain why I need pattern? All I need is nodes and their values.