Solved

Encodings on the Internet

Posted on 2003-10-28
3
196 Views
Last Modified: 2010-04-09
This question is about web pages encoded with UTF-8. If I use the tag
<meta http-equiv="content-type" content="type-html"; charset=utf-8">
everything is fine (IE displays the page correctly).
But is this the only form of Unicode IE accepts? I tried "charset=unicode-1-1" and it didn't work. What are the encodings today's Internet supports? Couldn't find this anywhere clearly explained...
0
Comment
Question by:campinas
[X]
Welcome to Experts Exchange

Add your voice to the tech community where 5M+ people just like you are talking about what matters.

  • Help others & share knowledge
  • Earn cash & points
  • Learn & ask questions
  • 2
3 Comments
 
LVL 31

Accepted Solution

by:
seanpowell earned 500 total points
ID: 9633740
Here's a couple of good links for you:

A list of the character sets that IE can use:
http://msdn.microsoft.com/library/default.asp?url=/workshop/author/dhtml/reference/charsets/charset4.asp#charsets4

An explanation (in traditional W3C style):
http://www.w3.org/TR/REC-html40/charset.html

There's some specific info available for the UTF-8 standard at - you guessed it:
http://www.utf-8.com/
0
 

Author Comment

by:campinas
ID: 9636827
Thank you!

I found the first link (Microsoft) most useful. However, I tried the charset label "unicode" for a UTF-16 page and it didn't work. Maybe the endianness was wrong.

Meanwhile I found this article

http://www-106.ibm.com/developerworks/library/utfencodingforms/ 

on forms of unicode, very good albeit rather old.

I'd conclude that the safest web page encoding for me is UTF-8, for these reasons:
- it covers all languages (like any form of Unicode)
- it is optimized for Roman based languages
- most text editors have an option for a default UTF-8 file save
- Word saves rtf and htm in UTF-8
- endianness is fixed (no worry about choosing that)
- it is supported by all browsers (I think), not like UTF-16
- it is definetely the dominant form of Unicode today, so why bother with what'll be tomorrow
- although it's slower with indexing, the difference may not be significant (?) with fast processors

If I was wrong or missed something, please correct.
0
 
LVL 31

Expert Comment

by:seanpowell
ID: 9650466
Jeez - sorry for the delay. You've pretty much nailed everything and absolutely UTF-8 is the best bet in your case - and nicely done.

Thanks,
GM
0

Featured Post

Free Tool: SSL Checker

Scans your site and returns information about your SSL implementation and certificate. Helpful for debugging and validating your SSL configuration.

One of a set of tools we are providing to everyone as a way of saying thank you for being a part of the community.

Question has a verified solution.

If you are experiencing a similar issue, please ask a related question

SASS allows you to treat your CSS code in a more OOP way. Let's have a look on how you can structure your code in order for it to be easily maintained and reused.
This article demonstrates how to create a simple responsive confirmation dialog with Ok and Cancel buttons using HTML, CSS, jQuery and Promises
In this tutorial viewers will learn how to position overlapping items using z-index in CSS. They will also learn the restrictions on the z-index property.  Create a new HTML document with an internal stylesheet.: Create a div in CSS and name it Red.…
HTML5 has deprecated a few of the older ways of showing media as well as offering up a new way to create games and animations. Audio, video, and canvas are just a few of the adjustments made between XHTML and HTML5. As we learned in our last micr…

739 members asked questions and received personalized solutions in the past 7 days.

Join the community of 500,000 technology professionals and ask your questions.

Join & Ask a Question