Solved

Parsing HTML with Visual c# 2005

Posted on 2008-10-22
3
501 Views
Last Modified: 2013-12-17
Hello,
I'm a really visual c# beginner. I've only created simple task programs and a text parser form my servers.
I would like to parse a html web page :
get links from the first page then connect to these links get data, compare with database and if now exist or not changed put the data in database.
Is there an easy (dumb :))  way for visual c# to begin with (a book, web page, msdn for explanations)
Othe possible ways are welcome. (Don't know if I can do it with sql server 2005 analysis service, or other softwares to extract data)
Thank you
0
Comment
Question by:TAI-
3 Comments
 
LVL 6

Accepted Solution

by:
openshac earned 500 total points
Comment Utility
The following links might be helpful:

http://www.experts-exchange.com/Programming/Languages/.NET/ASP.NET/Q_23493776.html?sfQueryTermInfo=1+c+screenscrap
http://www.experts-exchange.com/Internet/Web_Development/Q_23417990.html?sfQueryTermInfo=1+c+screenscrap
http://community.screen-scraper.com/Tutorial_4_page_3_Invoking%20screen-scraper%20from%20C%2523.NET

I've written a scrrenscraper before and once you've got the HTML you can easily search for the part of the page you want.  With any luck that part of the page may be in XHTML (i.e. HTML in strict XML format) in which case you can load it into an XML document and parse to your hearts content.  Otherwise you may need to resort to regular expressions to find the parts of the page you are interested in.
0
 
LVL 13

Expert Comment

by:TechTiger007
Comment Utility
I think this is what you are looking for
http://www.developer.com/net/csharp/article.php/10918_2230091_1
0
 

Author Closing Comment

by:TAI-
Comment Utility
Thank you for the scraper link.
0

Featured Post

How your wiki can always stay up-to-date

Quip doubles as a “living” wiki and a project management tool that evolves with your organization. As you finish projects in Quip, the work remains, easily accessible to all team members, new and old.
- Increase transparency
- Onboard new hires faster
- Access from mobile/offline

Join & Write a Comment

APEX (Application Express) is used to develop a web application from Oracle. SQL Workshop is one of the tools that comes with Oracle APEX to query or modify the database objects or to make any changes to the structure.
Use this article to create a batch file to backup a Microsoft SQL Server database to a Windows folder.  The folder can be on the local hard drive or on a network share.  This batch file will query the SQL server to get the current date & time and wi…
Video by: Steve
Using examples as well as descriptions, step through each of the common simple join types, explaining differences in syntax, differences in expected outputs and showing how the queries run along with the actual outputs based upon a simple set of dem…
Polish reports in Access so they look terrific. Take yourself to another level. Equations, Back Color, Alternate Back Color. Write easy VBA Code. Tighten space to use less pages. Launch report from a menu, considering criteria only when it is filled…

744 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

9 Experts available now in Live!

Get 1:1 Help Now