Default character endcoding on Windows versions

0

I know that in Western Europe and the US, CP-1252 is the default character encoding.

How about default character encoding in other windows version?

In addition, recently I hear that windows 8 use UTF-8 as default character encoding, is this correct?

teddy

Posted 2014-07-18T02:28:09.373

Reputation: 103

1

The encoding is going to be appropriate for the default language. Your asking a very broad question that doesn't have a single answer. Where did you hear that Windows 8 uses UTF-8 character encoding?Unicode and Additional Information

– Ramhound – 2014-07-18T03:05:11.980

Is there any where has these information, like an MSDN page? That's enough for the answer. – teddy – 2014-07-18T03:07:20.650

Feel free to do your own searches. I had no problem finding the information myself. – Ramhound – 2014-07-18T03:08:08.940

Answers

3

Windows intrinsically uses the UTF-16 encoding. The “default encodings”, or “default codepages” in Microsoft terminology, are just legacy encoding that are used by many old programs. E.g., in Notepad, the default encoding when saving a file is “ANSI”, which (rather misleadingly) denotes the default codepage (but Notepad internally works with UTF-16).

The system locale determines the default codepage. The page Windows Locale Codes - Sortable list shows the mapping and is probably reliable. (I was unable to find a Microsoft page on this.)

In particular, web browsers tend to use the default codepage so determined as the encoding of a web page, if encoding information is not available. In reality, it’s more complicated. In HTML5 LC, section Determining the character encoding describes (and prescribes) the process, and it also contains a mapping table that largely reflects the Windows mappings from locales to codepages.

Jukka K. Korpela

Posted 2014-07-18T02:28:09.373

Reputation: 4 475