I want to create a gatherer of information that retrieve parts of a page from a partner site. The idea is to completely retrieve a text from the site, formatting the generated background, font, font-size and creating a zipfile from the result that is maintained in the server for an hour only. I'm having some problems with the retrieving part of the script. For the example of this script I'll use the www.fanfiction.net
site since my client has a confidentiality agreement. Imagine that I want to retrieve the contents of a particular story, that in this case is in a URL like: http://www.fanfiction.net/s/451545/1/
where the first part is the main site, the second part is the reference that it is a story, the third part the number of the story, and the last number is the chapter of the story. What I need is to create a single file or several files of this site capturing the text of the story itself, and ignoring the parts that are there only for the adds and for the management of the site. I'm having a complete block with it if someone could help me at the retrieving part at least, it would help immensely.