Ready made Classes needed to read HTML files

Hi there,
I want to read a HTML file and get the links in that file. I just wanted to know if there were any readymade classes before implementing it. I have tried No Success. Any info on this will be helpful
Who is Participating?
cwreaConnect With a Mentor Commented:
Yes, there are ready-made classes; and chances are you already have them.

You can use Internet Explorer's Document Object Model.  Basically, this is a set of COM interfaces that let you load an HTML document (either from a file or from the web) and then do such things as iterate through the document's tags (by type, etc.)

Check out the following interfaces:  IWebBrowser2, IHTMLDocument2, IHTMLElement, etc.  These interfaces are available with Internet Explorer 4 and 5.

At MSDN online there is a page called "Reusing Browser Technology" at:

Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.