Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people, just like you, are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
Solved

XMLHTTP - screen scrape

Posted on 2003-10-27
2
963 Views
Last Modified: 2007-12-19
Hi Experts,

I have the following xml file called "contacts.xml"

<xmp>
<?xml version="1.0"?>
<contacts>
     <contact>
          <field id="firstName" taborder="1">
               <field_value>richard</field_value>
          </field>
          <field id="lastName" taborder="2">
               <field_value>jones</field_value>
          </field>
          <field id="address1" taborder="3">
               <field_value>16 some street</field_value>
          </field>
          <field id="address2" taborder="4">
               <field_value>london</field_value>
          </field>
          <field id="phone" taborder="5">
               <field_value>123456</field_value>
          </field>
          <field id="email" taborder="6">
               <field_value>someemail</field_value>
          </field>
     </contact>
</contacts>
</xmp>


And I am loading it into a page called "scrape1.asp" which uses the "microsoft.XMLHTTP" object to retieve the XML data as follows:


<%@ Language = VBScript %>
<%
Response.Buffer = True
Dim objXMLHTTP, xml

Set xml = Server.CreateObject("Microsoft.XMLHTTP")
' Or, for version 3.0 of XMLHTTP, use:
' Set xml = Server.CreateObject("MSXML2.ServerXMLHTTP")

xml.Open "GET", "http://myurl/xmlTests/contact.xml", False

xml.Send

'Display the HTML as text...
'Response.Write "<xmp>"
'Response.Write xml.responseText
'Response.Write "</xmp>"

'Or, render the HTML...
Response.Write xml.responseText
 
Set xml = Nothing

%>


The xml page is being rendered to the screen within scrape1.asp as expected. I have a couple of questions...


1.

How do I grab information from within the object based on the xml field name and stuff the results into a set of variables?


2.

I would also like to use this object to retrieve pure HTML pages. I would then like to grab information from the HTML pages using the regular expression object. Can you please give me an example of how to do this? An example of what I am tring to do would be:

HMTL = <td>price = £29.30</td>

VALUE IN VARIABLE = <% price %> (where price would = "29.30")




Thanks for you help...




PJORDANNA

0
Comment
Question by:pjordanna
2 Comments
 
LVL 15

Accepted Solution

by:
deighc earned 250 total points
ID: 9658082
>> 1. How do I grab information from within the object based on the xml field name and stuff the results into a set of variables?

You mean you want to loop thru the XML and read out the attributes and/or node text??? Something like this:

<%
' Add this code after xml.send and before set xml = nothing
' Dim these variable names: xmlNodeList, xmlNode

' Select a list of XML nodes
set xmlNodeList = xml.documentElement.selectNodes("/contacts/contact/field")
' Loop thru the node list
for each xmlNode in xmlNodeList
  ' Now you have access to the attributes collection of the <field> node
  Response.write "ID = " & xmlNode.attributes.getNamedItem("id").text
  Response.write "taborder = " & xmlNode.attributes.getNamedItem("id").text
  ' If you want access the text value of <field_value> node....
  Response.write "Value of field_value = " & xmlNode.childNodes(0).text
next
set xmlNodeList = nothing
%>

As for your second question, well, that's an entirely seperate problem and you really ought to post it as a different question. And my reg exp skills are crap.....  ;-)
0
 

Author Comment

by:pjordanna
ID: 9677339
deighc,

Cheers for that...works a treat.



pjordanna
0

Featured Post

Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

Title # Comments Views Activity
Adding Datediff to staistics page 2 62
format nvarchar field as mm/dd/yyyy 4 78
Voice recognition ASP or ASP.NET or JavaScript 2 70
IIS components 2 14
I recently decide that I needed a way to make my pages scream on the net.   While searching around how I can accomplish this I stumbled across a great article that stated "minimize the server requests." I got to thinking, hey, I use more than one…
I would like to start this tip/trick by saying Thank You, to all who said that this could not be done, as it forced me to make sure that it could be accomplished. :) To start, I want to make sure everyone understands the importance of utilizing p…
Nobody understands Phishing better than an anti-spam company. That’s why we are providing Phishing Awareness Training to our customers. According to a report by Verizon, only 3% of targeted users report malicious emails to management. With compan…

809 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question