Solved

Parsing HTML with Visual c# 2005

Posted on 2008-10-22
3
505 Views
Last Modified: 2013-12-17
Hello,
I'm a really visual c# beginner. I've only created simple task programs and a text parser form my servers.
I would like to parse a html web page :
get links from the first page then connect to these links get data, compare with database and if now exist or not changed put the data in database.
Is there an easy (dumb :))  way for visual c# to begin with (a book, web page, msdn for explanations)
Othe possible ways are welcome. (Don't know if I can do it with sql server 2005 analysis service, or other softwares to extract data)
Thank you
0
Comment
Question by:TAI-
3 Comments
 
LVL 6

Accepted Solution

by:
openshac earned 500 total points
ID: 22774493
The following links might be helpful:

http://www.experts-exchange.com/Programming/Languages/.NET/ASP.NET/Q_23493776.html?sfQueryTermInfo=1+c+screenscrap
http://www.experts-exchange.com/Internet/Web_Development/Q_23417990.html?sfQueryTermInfo=1+c+screenscrap
http://community.screen-scraper.com/Tutorial_4_page_3_Invoking%20screen-scraper%20from%20C%2523.NET

I've written a scrrenscraper before and once you've got the HTML you can easily search for the part of the page you want.  With any luck that part of the page may be in XHTML (i.e. HTML in strict XML format) in which case you can load it into an XML document and parse to your hearts content.  Otherwise you may need to resort to regular expressions to find the parts of the page you are interested in.
0
 
LVL 13

Expert Comment

by:TechTiger007
ID: 22774513
I think this is what you are looking for
http://www.developer.com/net/csharp/article.php/10918_2230091_1
0
 

Author Closing Comment

by:TAI-
ID: 31508663
Thank you for the scraper link.
0

Featured Post

Netscaler Common Configuration How To guides

If you use NetScaler you will want to see these guides. The NetScaler How To Guides show administrators how to get NetScaler up and configured by providing instructions for common scenarios and some not so common ones.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Entity Framework is a powerful tool to help you interact with the DataBase but still doesn't help much when we have a Stored Procedure that returns more than one resultset. The solution takes some of out-of-the-box thinking; read on!
Never store passwords in plain text or just their hash: it seems a no-brainier, but there are still plenty of people doing that. I present the why and how on this subject, offering my own real life solution that you can implement right away, bringin…
Video by: Steve
Using examples as well as descriptions, step through each of the common simple join types, explaining differences in syntax, differences in expected outputs and showing how the queries run along with the actual outputs based upon a simple set of dem…
Polish reports in Access so they look terrific. Take yourself to another level. Equations, Back Color, Alternate Back Color. Write easy VBA Code. Tighten space to use less pages. Launch report from a menu, considering criteria only when it is filled…

770 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question