I recently upgraded Spamd (and relevant perl modules) on a CentOS host; Spamassassin went from version 3.1.9 to 3.2.4. Since the upgrade, spamd will occassionally freak out and start generating vast quantities of errors like this:
Malformed UTF-8 character (unexpected non-continuation byte 0x00, immediately after start byte 0xd2) in pattern match (m//) at /etc/mail/spamassassin/70_
sare_adult
.cf, rule SARE_OBFUHARDCORE, line 1, <GEN18> line 510.
I performed the same upgrade on other hosts (admittedly, not with identical distro/package configs) and those upgrades did not produce this error. I've tried specifying the LANG=en.US in the init.d spamd script, but that hasn't made any difference.
I am very knowledgeable about MTAs and general spam filtering operations and packages, but I don't know much of anything about character sets and/or encoding methods.
What do I need to do to resolve these errors?
The spamassassin and perl module RPMs are from RPMForge (DAG repository).
Start Free Trial