Solved

Need a good screen scraper

Posted on 2014-11-09
7
155 Views
Last Modified: 2014-11-15
I need to scrap various news websites for articles on a specific topic.

What are some good programs? I want free or ones what are moderately priced, if they features are really good.

Thanks.
0
Comment
Question by:newbieweb
  • 4
  • 3
7 Comments
 
LVL 29

Assisted Solution

by:fibo
fibo earned 500 total points
ID: 40433189
Do you want to scrap html source code of the pages or a picture/ screen capture of them (under a selected web browser, since that may interfere)?
0
 

Author Comment

by:newbieweb
ID: 40433225
I only care about words. So HTML titles and text.
0
 
LVL 29

Assisted Solution

by:fibo
fibo earned 500 total points
ID: 40433471
For HTML, specially if you want to display your captures, I would suggest that you drop an eye at Teleport Pro ($50, free trial version available), for which I have been a happy user for a long time.

I see that there are now more complete (and more expensive!) versions available... but they meet quite larger demands that maybe you don't have.
0
Find Ransomware Secrets With All-Source Analysis

Ransomware has become a major concern for organizations; its prevalence has grown due to past successes achieved by threat actors. While each ransomware variant is different, we’ve seen some common tactics and trends used among the authors of the malware.

 

Author Comment

by:newbieweb
ID: 40433797
I forgot to mention, I need it smart enough to:

- run on a cycle
- only flag the new articles

Is this still the choice?

I have also heard ScrapeBox and Mozenda are good.
0
 
LVL 29

Accepted Solution

by:
fibo earned 500 total points
ID: 40435605
They seem a godd choice too if that matches your needs.
Teleport is more a "site scraper" to make complete copies of sites, not specially suited for massive pages like does ScrapeBox
0
 

Author Closing Comment

by:newbieweb
ID: 40444015
thanks
0
 
LVL 29

Expert Comment

by:fibo
ID: 40444223
Glad it helped. Thx for the points and grade
0

Featured Post

Enabling OSINT in Activity Based Intelligence

Activity based intelligence (ABI) requires access to all available sources of data. Recorded Future allows analysts to observe structured data on the open, deep, and dark web.

Join & Write a Comment

Real-time is more about the business, not the technology. In day-to-day life, to make real-time decisions like buying or investing, business needs the latest information(e.g. Gold Rate/Stock Rate). Unlike traditional days, you need not wait for a fe…
For both online and offline retail, the cross-channel business is the most recent pattern in the B2C trade space.
This video teaches users how to migrate an existing Wordpress website to a new domain.
You have products, that come in variants and want to set different prices for them? Watch this micro tutorial that describes how to configure prices for Magento super attributes. Assigning simple products to configurable: We assigned simple products…

746 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question

Need Help in Real-Time?

Connect with top rated Experts

12 Experts available now in Live!

Get 1:1 Help Now