Solved

HTML Parser in VB .NET

Posted on 2004-09-26
4
916 Views
Last Modified: 2012-05-05
I have to write a program in VB .NET that would look at a html and text files, parse out specific information and then deposit it in a database. This would not be a problem if it was something specific like an e-mail address that you could look at the @ sign for example. Not all the pages look exactly the same, not all have the same format, and the data that I am looking for is just numbers. An example would be to find a person's salary on the page.

As a human, I would look around on the page, look for references of "salary", then reference it that way.

Any ideas ? Information ?

Where to start?

Regex maybe?
0
Comment
Question by:waterzap
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
4 Comments
 
LVL 70

Accepted Solution

by:
Éric Moreau earned 500 total points
ID: 12156043
0
 
LVL 96

Expert Comment

by:Bob Learned
ID: 12157036
If it is simple, you might also be able to get the HTML text, and use simple Regular Expressions to parse.  The HTML Document class is a fairly hefty chunk of real estate that is like squirrel hunting with an elephant rifle.

Bob
0
 
LVL 18

Expert Comment

by:armoghan
ID: 12160371
If you need to find MSHTML.. Its not 2005
Add Reference -> .NET -> Microsoft MSHTML -> Select
0
 
LVL 18

Expert Comment

by:armoghan
ID: 12160380
opps ... sorry wrote in the wrong window
0

Featured Post

Free Tool: SSL Checker

Scans your site and returns information about your SSL implementation and certificate. Helpful for debugging and validating your SSL configuration.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

IP addresses can be stored in a database in any of several ways.  These ways may vary based on the volume of the data.  I was dealing with quite a large amount of data for user authentication purpose, and needed a way to minimize the storage.   …
Flash (http://en.wikipedia.org/wiki/Adobe_Flash) has evolved over the years to where it has become a masterful tool for displaying content screen.  It has excellent layout placement, UI precision as well as rendering capabilities. This, along with t…
If you're a developer or IT admin, you’re probably tasked with managing multiple websites, servers, applications, and levels of security on a daily basis. While this can be extremely time consuming, it can also be frustrating when systems aren't wor…
Add bar graphs to Access queries using Unicode block characters. Graphs appear on every record in the color you want. Give life to numbers. Hopes this gives you ideas on visualizing your data in new ways ~ Create a calculated field in a query: …

717 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question