Python: how to check mail status

Hi experts,

I am going to write a Python code to upload new e-mails from a specific user account into MySQL database. This Python code is scheduled to run every miniute.

This user account is generic so nobody will come to check e-mails except the running code. Whenever the code runs, it should only upload NEW e-mails into database. In other words, the code can not upload a e-mail content if this e-mail is in the database already. Therefore I need to know how to distinguish if a e-mail has been "read" by the code or not.

Comparing "From", "To" and "Body" of a e-mail with corresponding parts in the database is certainly a way to find if that e-mail has been loaded or not, however this method is too clumsy to use. Is there any smart way to find this?

Thanks so much.
davidw88Asked:
Who is Participating?

[Product update] Infrastructure Analysis Tool is now available with Business Accounts.Learn More

x
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

ilalopoulosCommented:
You will use the database for the duplicate checking, database engines are optimised for this kind of work.

So:

1. Decide what do you mean by unique, emails headers give you at minimum the from, to, subject, date fields so some of the combinations you can use to define uniqueness are the following:

to, from subject, date
to, from, subject, date, and part of the body
date and a hash of to,from,body
etc.

2. You will define the above as primary keys in the table but take care that depending on the MySQL engine that you will use, keys have a limit in size, so maybe the most effective way to use is the 3d option (a field and a hash of the rest)

In this example I will use subject and date which will be defined as the primary key for the table:

CREATE TABLE `emails` (
  `to` varchar(255) NOT NULL,
  `from` varchar(255) NOT NULL,
  `subject` varchar(255) NOT NULL,
  `date` datetime NOT NULL,
  `body` text NOT NULL,
  PRIMARY KEY  (`subject`,`date`)
);

3. The test for duplicates is done by the database engine, I will give you two approaches:

a) Use the insert query with ON DUPLICATE KEY and a dummy update part - in case that the entry exist the database will not complain nor do anything else - this is the silent approach.

"""INSERT INTO emails (e_to, e_from, e_subject, e_date, e_body) VALUES ('%s', '%s', '%s', '%s', '%s')""" % (e_to, e_from, e_subject, e_date, e_body)

more info for ON DUPLICATE KEY at: http://dev.mysql.com/doc/refman/5.0/en/insert-on-duplicate.html

b) Use a try/except clause to catch duplicate key errors from MySQL:
q = """INSERT INTO emails (e_to, e_from, e_subject, e_date, e_body) VALUES ('%s', '%s', '%s', '%s', '%s')""" % (e_to, e_from, e_subject, e_date, e_body)
 
try:
    
    cursor.execute(q)
    conn.commit()
 
except MySQLdb.IntegrityError, message:
    if message[0] == 1062:
        #Do whatever you want here for the duplicate
        print "duplicate %s %s" % (e_date, e_subject)
    else:
        raise	# Not a duplicate key error

Open in new window

ilalopoulosCommented:
In the above post I have ommited the ON DUPLICATE KEY part in the first approach (3a).

So the complete 3a answer is:

a) Use the insert query with ON DUPLICATE KEY and a dummy update part - in case that the entry exist the database will not complain nor do anything else - this is the silent approach.

"""INSERT INTO emails (e_to, e_from, e_subject, e_date, e_body) VALUES ('%s', '%s', '%s', '%s', '%s') ON DUPLICATE KEY UPDATE e_date = e_date""" % (e_to, e_from, e_subject, e_date, e_body)

more info for ON DUPLICATE KEY at: http://dev.mysql.com/doc/refman/5.0/en/insert-on-duplicate.html

*Also the table create has different fields from the ones I use later on the examples but this is trivial to change.

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
davidw88Author Commented:
I see. Thanks ilalopoulos for your two replies.

I will follow your idea to test and let you know how it works later.

thanks again.
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
Python

From novice to tech pro — start learning today.