How can i identify UTF-7 and UTF-8 format by using TEC ver 1.4

How can i identify UTF-7 and UTF-8 format by using TEC ver 1.4
samir_ganuAsked:
Who is Participating?
 
roovCommented:
Samir,
Unicode files usually have a BOM (Byte Order Mark) sign at their start.
If the first three bytes of the stream are byte0=FF byte1=BB and byte2=BF, then the file is UTF8.

Optionally, using CarbonLib 1.2.5 you have GetTextAndEncodingFromCFString (declared in in Appearance.h). Then use the types in CFString.h for UTF8 and UTF7 (NonLossyASCII).

Within TEC, you can use Sniffers (TECSniffTextEncoding) - let me know if you want a code sample (reuven.sherwin@xmpie.com).

0
 
SpideyModCommented:
Force Accepted

SpideyMod
Community Support Moderator @Experts Exchange
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

All Courses

From novice to tech pro — start learning today.