• Status: Solved
  • Priority: Medium
  • Security: Public
  • Views: 437
  • Last Modified:

gb2312 character set

I am trying to print some chinese characters within a static control.  The characters are in gb2312 format and are stored within a file.  The following code reads in the character but doesn't display the chinese character.

Anyone have any ideas ? I am using the win95 operating system.

     static CFont newfnt;
     CStatic* st=(CStatic*)GetDlgItem(IDC_MSG);
     
     CFont* fnt=st->GetFont();
     LOGFONT lf;

     fnt->GetLogFont(&lf);

     lf.lfCharSet=GB2312_CHARSET;
     strcpy(lf.lfFaceName ,"MS Song");

     newfnt.CreateFontIndirect(&lf);

     st->SetFont(&newfnt);

     FILE* fh;

     fh=fopen("c:\\temp\\test5.txt","rb");

     char buf[3];

     fread(buf,2,1,fh);
     
     buf[2]='\0';

     st->SetWindowText(buf);
0
ajh020797
Asked:
ajh020797
  • 3
  • 3
  • 2
1 Solution
 
BeyondWuCommented:
Have you installed the corresponding charset?
Which version OS are you using(English/Chinese)?
Have you tested st->SetWindowText("Chinese string") directly?
0
 
ajh020797Author Commented:
I am using an english version of windows 95, I have installed the microsoft simplified chinese language pack/ime.  I can view gb webpages ok, I can also enter chinese text using the ime within ie5 and word2000.  I have tried to enter chinese characters into visual studio directly but this doesn't seem to work.
0
 
jizhangCommented:
I do not know if your MS Song font on Win95 is the same font as I used
on Windows NT.  The MS Song font I use  is unicode TTF GB2312
Chinese font.  You need to convert GB2312 codes to unicode first and
then reder with MS Song.

GB2312 are 2 BYTE 8bit codes, you may need to use unsigned char
or int array to store.

SetWindowsTextA may not work for 2 BYTE codes.

Ji.Zhang@nrc.ca
0
Take Control of Web Hosting For Your Clients

As a web developer or IT admin, successfully managing multiple client accounts can be challenging. In this webinar we will look at the tools provided by Media Temple and Plesk to make managing your clients’ hosting easier.

 
BeyondWuCommented:
Try this:
lf.lfCharSet=GB2312_CHARSET;

lf.lfOutPrecision = OUT_TT_PRECIS;
lf.lfClipPrecision = CLIP_TT_ALWAYS;
lf.lfQuality = PROOF_QUALITY;
lf.lfPitchAndFamily = VARIABLE_PITCH | FF_DONTCARE | TMPF_TRUETYPE;

strcpy(lf.lfFaceName ,_T("MS Song"));
newfnt.CreateFontIndirect(&lf);

0
 
ajh020797Author Commented:
I tried the changes to the font but it doesn't seem to have worked.

I am led to believe that win95 is not a unicode based o/s so converting to unicode is not an option.  Win95 only supports MBCS.

MS Office, Outlook ecpress, and internet explorer all seem to be able to display gb2312 and big5 encoded text withuot any problem on my system.
0
 
jizhangCommented:

Hi there,

SetFont() should work.

The problem is in SetWindowsText(), which only supports single byte codes.

You may set big font size like this:
       LOGFONT   lf;
       lf.lfHeight = 24;
       lf.lfWidth = 12;
  ..
     strcpy(lf.lfFaceName, _T("MS Song"));
       newfnt.CreateFontIndirect(&lf);
  ..
 GetDlgItem(IDC_MSG)->SetWindowText("abcde");
You will see the SetFont() works (lookat  the  font  face is MS Song).

>MS Office, Outlook ecpress, and internet explorer all seem to be able to display
>gb2312 and big5 encoded ..

  You can draw gb2312 with TextOutW() -- that I used to use on WinNT.
  Win95 has TextOutW function in English Env as well.

  for example,
  wchar_t stru[] = {0x4e2d, 0x6587};  // unicode {0x4e2d 0x6587} are gb2312 codes {0xd6d0 0xcec4}
  TextOutW( pDC)->m_hDC, 50, 50, stru, 2);
  that works in OnDraw() of CxxView.

 But it seems not work in dialog control:
 TextOutW( GetDlgItem(IDC_MSG)->GetDC()->m_hDC, 0,0, stru, 2);

 There is no SetWindowsTextW() in MFC English Env.
0
 
ajh020797Author Commented:
Thanks !! Finally I have a way to display these characters, all I need to do now if work out how to convert into unicode !
0
 
jizhangCommented:

You need a table to convert from/to unicode/GB.

You may get my table gb2312.dat in uni03.zip at
http://www.zanmen.com/homepages/JiZhang/software/

gb2312.dat is ASCII file, which looks like:
....
f7e79e8b
f7e89e92
f7e993d6
f7ea9e9d
f7eb9e9f
f7ec9edb
f7ed9edc
...

read in  format   %4x%4x  to gb_tab(i), uni_tab(i)
translation:
   if you have a GB, then look up gb_tab() to get k, translate to the value in  uni_tab(k)

Besides, ASCII char set is a sub-set of GB, you may need to deal  with ascii
if you use mixed ascii+GB in your strings.


0

Featured Post

Get expert help—faster!

Need expert help—fast? Use the Help Bell for personalized assistance getting answers to your important questions.

  • 3
  • 3
  • 2
Tackle projects and never again get stuck behind a technical roadblock.
Join Now