Hi, I have two tasks I am trying to accomplish. Both involve I guess what is referred to as scraping.
First, I want to collect data from google groups related to illnesses. There are many groups where people discuss there illnesses and I want to build a database consisting of the words in these groups. By a database I only mean a spreadsheet with the columns being the words (every word appearing in the thread would be a column heading) and each row being a separate thread (or posting). I was told that using RSS would be a good idea because all google groups have rss feeds.
Can anyone give me a roadmap for how to go about doing this?
Thanks so much.