We have thousands of files, not all of which contain the necessary defines for the different languages that our product needs to support. For example, we have PHP strings that contain language that should really be in a definition so that it is configurable:
"This is a test string"
define('This is a test string', TEST_STRING);
Typical problems are that of dealing with the start and end of the PHP tags i.e. <?php and <?=php so that the correct filtering can be used to detect english strings.
The end result of such a script would be to highlight all of the areas where english text is used, so that these areas could be visited directly and the prgrammers could make the necessary modifications to DEFINE the language so that correct language files can be built and the language of the product can be changed.
Not being an expert with regular expressions to start with, I feel that this is a topic much better suited to a site like Experts Exchange, rather than struggling to find the solution ourselves.
Thanks and Good Luck