Is there a way to filter out these characters?
I believe their are some characters that cannot be process due to security issues. I just don't know which ones.
This appears to only happen when an element within the XML file contains
!DOCTYPE html PUBLIC