[Webinar] Streamline your web hosting managementRegister Today

x
?
Solved

Extracting Data from tables on HTML pages

Posted on 2006-10-30
3
Medium Priority
?
216 Views
Last Modified: 2010-04-23
I'm trying to write a sports betting application that will spider websites to get the upcoming fixtures and previous form and then apply various selection criteria to the data.  The data is found mainly in HTML table cells.  How do I write code to loop through all the rows in all of the tables in a HTML page and extract the rows and treat them as individual records, and then extract the data from the cells and treat them as the record fields?  I just want to extract each row so that I can stick the data into a database.

This is done in VB.Net.

Thanks
0
Comment
Question by:useless_eater
1 Comment
 
LVL 10

Accepted Solution

by:
Kinger247 earned 2000 total points
ID: 17854109
The trouble you have is that although this could be done, it may not be the best way.

For example, you could add the webbrowser control to a form and load a web page. Then loop through the elements on the document (html page), locate the tags your looking for that contain the data you need.

BUT, these pages are bound to be asp type pages and prone to change. Then your back to square one again.
This WILL happen frequently !

It sounds like your looking for a shortcut, but your not going to get it so easliy.

I was asked to a do a smiliar thing with energy proces, in the end I gave up.

You could search for some sites that offer web services ?
Could be your only possible solution.
0

Featured Post

Free Tool: Site Down Detector

Helpful to verify reports of your own downtime, or to double check a downed website you are trying to access.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Article by: Jorge
XML Literals are a great way to handle XML files and the community doesn’t use it as much as it should.  An XML Literal is like a String (http://msdn.microsoft.com/en-us/library/system.string.aspx) Literal, only instead of starting and ending with w…
It was really hard time for me to get the understanding of Delegates in C#. I went through many websites and articles but I found them very clumsy. After going through those sites, I noted down the points in a easy way so here I am sharing that unde…
Planning to migrate your EDB file(s) to a new or an existing Outlook PST file? This video will guide you how to convert EDB file(s) to PST. Besides this, it also describes, how one can easily search any item(s) from multiple folders or mailboxes…
Hi, this video explains a free download that you can incorporate into your Access databases, or use stand-alone for contact management. Contacts -- Names, Addresses, Phone Numbers, eMail Addresses, Websites, Lists, Projects, Notes, Attachments…

612 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question