I have a large html file that I am trying to scrape the names of companies out of. The company names are always in the following format:
ome Place, Inc.</a>
I would want "Some Place, Inc." as the result here. The company names could be one or more words, they might even have special characters in the name. (@, -, etc) But they will always have "<a href="offsite_quotes.asp?c
ontent=" followed by a url and a "">", then the name of the company.
There might be more than one company name per line. If there is, it would be important to print each one per line. I don't know if doing this with a open file, and while loop would be the way to go or not.