[Last Call] Learn how to a build a cloud-first strategyRegister Now

x
?
Solved

Extracting Data from tables on HTML pages

Posted on 2006-10-30
3
Medium Priority
?
213 Views
Last Modified: 2010-04-23
I'm trying to write a sports betting application that will spider websites to get the upcoming fixtures and previous form and then apply various selection criteria to the data.  The data is found mainly in HTML table cells.  How do I write code to loop through all the rows in all of the tables in a HTML page and extract the rows and treat them as individual records, and then extract the data from the cells and treat them as the record fields?  I just want to extract each row so that I can stick the data into a database.

This is done in VB.Net.

Thanks
0
Comment
Question by:useless_eater
1 Comment
 
LVL 10

Accepted Solution

by:
Kinger247 earned 2000 total points
ID: 17854109
The trouble you have is that although this could be done, it may not be the best way.

For example, you could add the webbrowser control to a form and load a web page. Then loop through the elements on the document (html page), locate the tags your looking for that contain the data you need.

BUT, these pages are bound to be asp type pages and prone to change. Then your back to square one again.
This WILL happen frequently !

It sounds like your looking for a shortcut, but your not going to get it so easliy.

I was asked to a do a smiliar thing with energy proces, in the end I gave up.

You could search for some sites that offer web services ?
Could be your only possible solution.
0

Featured Post

Independent Software Vendors: We Want Your Opinion

We value your feedback.

Take our survey and automatically be enter to win anyone of the following:
Yeti Cooler, Amazon eGift Card, and Movie eGift Card!

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Introduction When many people think of the WebBrowser (http://msdn.microsoft.com/en-us/library/2te2y1x6%28v=VS.85%29.aspx) control, they immediately think of a control which allows the viewing and navigation of web pages. While this is true, it's a…
It’s quite interesting for me as I worked with Excel using vb.net for some time. Here are some topics which I know want to share with others whom this might help. First of all if you are working with Excel then you need to Download the Following …
Loops Section Overview
As many of you are aware about Scanpst.exe utility which is owned by Microsoft itself to repair inaccessible or damaged PST files, but the question is do you really think Scanpst.exe is capable to repair all sorts of PST related corruption issues?

829 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question