Encodings on the Internet

This question is about web pages encoded with UTF-8. If I use the tag
<meta http-equiv="content-type" content="type-html"; charset=utf-8">
everything is fine (IE displays the page correctly).
But is this the only form of Unicode IE accepts? I tried "charset=unicode-1-1" and it didn't work. What are the encodings today's Internet supports? Couldn't find this anywhere clearly explained...
campinasAsked:
Who is Participating?
I wear a lot of hats...

"The solutions and answers provided on Experts Exchange have been extremely helpful to me over the last few years. I wear a lot of hats - Developer, Database Administrator, Help Desk, etc., so I know a lot of things but not a lot about one thing. Experts Exchange gives me answers from people who do know a lot about one thing, in a easy to use platform." -Todd S.

seanpowellCommented:
Here's a couple of good links for you:

A list of the character sets that IE can use:
http://msdn.microsoft.com/library/default.asp?url=/workshop/author/dhtml/reference/charsets/charset4.asp#charsets4

An explanation (in traditional W3C style):
http://www.w3.org/TR/REC-html40/charset.html

There's some specific info available for the UTF-8 standard at - you guessed it:
http://www.utf-8.com/
0

Experts Exchange Solution brought to you by

Your issues matter to us.

Facing a tech roadblock? Get the help and guidance you need from experienced professionals who care. Ask your question anytime, anywhere, with no hassle.

Start your 7-day free trial
campinasAuthor Commented:
Thank you!

I found the first link (Microsoft) most useful. However, I tried the charset label "unicode" for a UTF-16 page and it didn't work. Maybe the endianness was wrong.

Meanwhile I found this article

http://www-106.ibm.com/developerworks/library/utfencodingforms/ 

on forms of unicode, very good albeit rather old.

I'd conclude that the safest web page encoding for me is UTF-8, for these reasons:
- it covers all languages (like any form of Unicode)
- it is optimized for Roman based languages
- most text editors have an option for a default UTF-8 file save
- Word saves rtf and htm in UTF-8
- endianness is fixed (no worry about choosing that)
- it is supported by all browsers (I think), not like UTF-16
- it is definetely the dominant form of Unicode today, so why bother with what'll be tomorrow
- although it's slower with indexing, the difference may not be significant (?) with fast processors

If I was wrong or missed something, please correct.
0
seanpowellCommented:
Jeez - sorry for the delay. You've pretty much nailed everything and absolutely UTF-8 is the best bet in your case - and nicely done.

Thanks,
GM
0
It's more than this solution.Get answers and train to solve all your tech problems - anytime, anywhere.Try it for free Edge Out The Competitionfor your dream job with proven skills and certifications.Get started today Stand Outas the employee with proven skills.Start learning today for free Move Your Career Forwardwith certification training in the latest technologies.Start your trial today
HTML

From novice to tech pro — start learning today.