pulling information from html source file

I'm not sure how to describe this but here it goes.

i am going to remax's website and i get a ton of results based on my search parameters.

i get 30 houses per webpage.  I select view source on the webpage to see the html file.  I want to extract all the addresses without doing it manually.  is there a way to accomplish this.

attached is an html file that contains the information....
again i want to extract the information from the source code... addresses and populate an excel spreadsheet with the addresses
Who is Participating?
Scott Fell, EE MVEConnect With a Mentor Developer & EE ModeratorCommented:
The term you are looking for is called screen scraping.  You will use a scripting language such as javascript, php, asp, python or just about any scripting language to read the site's html and convert tags such as <tr><td> to line feeds and data delimiters.  You have to custom design your screen scrap for each site.

However, in this case, just about any real estate site where a REALTOR is involved, uses data from one or more local Multiple Listing Services (MLS).  The MLS is tightly guarded and you can not legally copy the data.  

See the TOS for remax under RESTRICTIONS, the first bullet point.
Unless explicitly specified or with separate, written permission from RE/MAX, LLC, you may not and agree that you will not:
copy, modify, distribute, transmit, display, reproduce, publish, license, create derivative works from, frame in another web site, use on any other web site, transfer or sell any information obtained from this Web Site or any part thereof;
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.