parsing previous body

words words words<br>words.

On Mon, Jan 2, 2012 at 8:48 PM,
PREVIOUS BODY

Open in new window



words words words<br>words.

On Wed, Feb 12, 2012 at 10:48 PM,
PREVIOUS BODY

Open in new window


I am parsing gmail bodys

These are examples of the gmail body where the body ends and the previous body is displayed

I do not want to see the previous body, so I would like to strip everything after and including

[new line][new line]
On[space][day of week][comma][space][month][space][day of month][comma][space][year][space]at[time of day][space][am or pm]
LVL 1
rgb192Asked:
Who is Participating?
 
Terry WoodsConnect With a Mentor IT GuruCommented:
Try this:

$smaller_body=preg_replace("#^(.*?)\r?\n\r?\nOn \w{3}, \w{3} \d\d?, \d{4} at \d\d?:\d\d [AP]M.*#s","$1",$body)

Open in new window


(update: fixed bug)
0
 
Terry WoodsIT GuruCommented:
A regex for the pattern you're describing is:
\n\nOn \w{3}, \w{3} \d\d?, \d{4} at \d\d?:\d\d [AP]M

Open in new window

You missed a space before the time of day, so I've added it. You may like to add the comma to the end too for completeness.

In some cases I find there are also line feed characters in between newlines. They can be accounted for (without being required) like this:
\r?\n\r?\nOn \w{3}, \w{3} \d\d?, \d{4} at \d\d?:\d\d [AP]M

Open in new window

0
 
rgb192Author Commented:
$body='
words words words<br>words.

On Mon, Jan 2, 2012 at 8:48 PM,
PREVIOUS BODY
 ';

Open in new window


how can I write $smaller_body=replace(pattern,$body)
0
 
rgb192Author Commented:
I have a related question for html gmail body

http://www.experts-exchange.com/Programming/Languages/Regular_Expressions/Q_28020015.html

works thanks
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.