Solved

HTML Parser in VB .NET

Posted on 2004-09-26
4
909 Views
Last Modified: 2012-05-05
I have to write a program in VB .NET that would look at a html and text files, parse out specific information and then deposit it in a database. This would not be a problem if it was something specific like an e-mail address that you could look at the @ sign for example. Not all the pages look exactly the same, not all have the same format, and the data that I am looking for is just numbers. An example would be to find a person's salary on the page.

As a human, I would look around on the page, look for references of "salary", then reference it that way.

Any ideas ? Information ?

Where to start?

Regex maybe?
0
Comment
Question by:waterzap
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
4 Comments
 
LVL 70

Accepted Solution

by:
Éric Moreau earned 500 total points
ID: 12156043
0
 
LVL 96

Expert Comment

by:Bob Learned
ID: 12157036
If it is simple, you might also be able to get the HTML text, and use simple Regular Expressions to parse.  The HTML Document class is a fairly hefty chunk of real estate that is like squirrel hunting with an elephant rifle.

Bob
0
 
LVL 18

Expert Comment

by:armoghan
ID: 12160371
If you need to find MSHTML.. Its not 2005
Add Reference -> .NET -> Microsoft MSHTML -> Select
0
 
LVL 18

Expert Comment

by:armoghan
ID: 12160380
opps ... sorry wrote in the wrong window
0

Featured Post

SharePoint Admin?

Enable Your Employees To Focus On The Core With Intuitive Onscreen Guidance That is With You At The Moment of Need.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

Suggested Solutions

IP addresses can be stored in a database in any of several ways.  These ways may vary based on the volume of the data.  I was dealing with quite a large amount of data for user authentication purpose, and needed a way to minimize the storage.   …
It seems a simple enough task, yet I see repeated questions asking how to do it: how to pass data between two forms. In this article, I will show you the different mechanisms available for you to do just that. This article is directed towards the .N…
Finds all prime numbers in a range requested and places them in a public primes() array. I've demostrated a template size of 30 (2 * 3 * 5) but larger templates can be built such 210  (2 * 3 * 5 * 7) or 2310  (2 * 3 * 5 * 7 * 11). The larger templa…
In a recent question (https://www.experts-exchange.com/questions/29004105/Run-AutoHotkey-script-directly-from-Notepad.html) here at Experts Exchange, a member asked how to run an AutoHotkey script (.AHK) directly from Notepad++ (aka NPP). This video…

732 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question