Recognize Chinese Multibyte Character
Posted on 2004-11-01
I'd like to ask any of you the method of recognizing Chinese character (a multibyte character) in a passage containing both Chinese and some single byte characters, such as English and numbers.
When I use a pointer, it only points the passage byte by byte and it is not able to detect whether it is a multibyte character or not.
Is there a way to:
1. Extract these Chinese characters from the passage OR
2. Intelligently pointing character by character (not matter the character is multibyte or single byte) OR
3. Convert all of them to multibyte characters?
Your suggestions will be much appreciated! Thanks!