[Webinar] Streamline your web hosting managementRegister Today

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 527
  • Last Modified:

How can i identify UTF-7 and UTF-8 format by using TEC ver 1.4

How can i identify UTF-7 and UTF-8 format by using TEC ver 1.4
0
samir_ganu
Asked:
samir_ganu
1 Solution
 
roovCommented:
Samir,
Unicode files usually have a BOM (Byte Order Mark) sign at their start.
If the first three bytes of the stream are byte0=FF byte1=BB and byte2=BF, then the file is UTF8.

Optionally, using CarbonLib 1.2.5 you have GetTextAndEncodingFromCFString (declared in in Appearance.h). Then use the types in CFString.h for UTF8 and UTF7 (NonLossyASCII).

Within TEC, you can use Sniffers (TECSniffTextEncoding) - let me know if you want a code sample (reuven.sherwin@xmpie.com).

0
 
SpideyModCommented:
Force Accepted

SpideyMod
Community Support Moderator @Experts Exchange
0

Featured Post

Free Tool: ZipGrep

ZipGrep is a utility that can list and search zip (.war, .ear, .jar, etc) archives for text patterns, without the need to extract the archive's contents.

One of a set of tools we're offering as a way to say thank you for being a part of the community.

Tackle projects and never again get stuck behind a technical roadblock.
Join Now