Want to protect your cyber security and still get fast solutions? Ask a secure question today.Go Premium

x
  • Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 207
  • Last Modified:

UTF-8 and wstring

I've been thinking that UTF-8 is best represented in a wstring, but I've just come across a library that has turned my world upside down.  They use char* for UTF-8.  Apparently this is the way it's supposed to be?  If my existing libraries all use wstring for UTF-8, what are your recommendations for me?

Thanks!
Mike
0
thready
Asked:
thready
  • 2
  • 2
2 Solutions
 
threadyAuthor Commented:
ok wow.. jkr  has not answered my question yet.  It must be that I'm not the only one that finds this confusing...!  :)
0
 
jkrCommented:
Be careful what you are wishing for - UTF-8 (http://en.wikipedia.org/wiki/UTF-8) - as the name implies it *tries* to encode UNICODE to a 8-bit contained space, but that requires code points (http://en.wikipedia.org/wiki/Code_point) which are IMO basically a different expression for 'code pages'. So, in a nutshell: If you *need* UNICODE, there is no way around using UNICODE or at least UTF-16, then you *might* be sure that the character mappings will be somehow universal. If you can live with a subset of that (and UTF-8 does not even pretend to be anything else), it will be fine.
0
 
jkrCommented:
*LOL*
0
 
threadyAuthor Commented:
Thanks jkr  :-)
0

Featured Post

Free Tool: SSL Checker

Scans your site and returns information about your SSL implementation and certificate. Helpful for debugging and validating your SSL configuration.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

  • 2
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now