Improve company productivity with a Business Account.Sign Up

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 532
  • Last Modified:

How can i identify UTF-7 and UTF-8 format by using TEC ver 1.4

How can i identify UTF-7 and UTF-8 format by using TEC ver 1.4
0
samir_ganu
Asked:
samir_ganu
1 Solution
 
roovCommented:
Samir,
Unicode files usually have a BOM (Byte Order Mark) sign at their start.
If the first three bytes of the stream are byte0=FF byte1=BB and byte2=BF, then the file is UTF8.

Optionally, using CarbonLib 1.2.5 you have GetTextAndEncodingFromCFString (declared in in Appearance.h). Then use the types in CFString.h for UTF8 and UTF7 (NonLossyASCII).

Within TEC, you can use Sniffers (TECSniffTextEncoding) - let me know if you want a code sample (reuven.sherwin@xmpie.com).

0
 
SpideyModCommented:
Force Accepted

SpideyMod
Community Support Moderator @Experts Exchange
0
Question has a verified solution.

Are you are experiencing a similar issue? Get a personalized answer when you ask a related question.

Have a better answer? Share it in a comment.

Join & Write a Comment

Featured Post

Free Tool: Port Scanner

Check which ports are open to the outside world. Helps make sure that your firewall rules are working as intended.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now