Questions tagged [charset]

Character sets are used to represent a repertoire of characters by some kind of encoding system, such as US-ASCII, ISO-8859, or UTF-8.

Character encoding is used to represent a repertoire of characters by some kind of encoding system. Depending on the abstraction level and context, corresponding code points and the resulting code space may be regarded as bit patterns, octets, natural numbers, electrical pulses, etc. A character encoding is used in computation, data storage, and transmission of textual data. "Character set", "character map", "codeset" and "code page" are related, but not identical, terms.

Common character encodings include

  • ASCII
  • EBCDIC
  • ISO 8859
  • Windows-1250
  • Guobiao
  • Big5
  • Unicode

Source: https://en.wikipedia.org/wiki/Character_encoding

Related tags

67 questions
50
votes
4 answers

How to enable correct charset HTTP-header in NGINX

What is the right way to enable correct charset headers in NGINX? I'm analyzing my website with Google Page Speed. It says that I should specify the charset of HTML files in HTTP-headers. What is the right way to do this? I already tried to set…
pvorb
  • 980
  • 3
  • 10
  • 15
16
votes
2 answers

Command to create MySQL database with Character set UTF-8

I use create database dbname; to create database. but I want it to created with Character set UTF-8 Anyone know what is the command to use?
Komputer
16
votes
5 answers

How do I change my locale to UTF-8 in CentOS?

Here's the output of…
locale
  • 373
  • 2
  • 4
  • 10
8
votes
5 answers

Best way to make sure a MySQL database is fully in UTF8

After some problems with UTF8 and none-UTF8 strings, we're standardising on UTF8. One thing I need to do is check that everything is in UTF8 in the MySQL database? What do I need to check? Server default characterset Default character set of each…
Amandasaurus
  • 30,211
  • 62
  • 184
  • 246
7
votes
1 answer

autoindex list UTF-8 charset in Nginx

My nginx autoindex page does not display UTF-8 characters correctly, utf-8 problem I have set the charset utf-8; in my server block config section of nginx.conf file but that doesn't seem to fix the problem.
Dara Ardalan
  • 81
  • 1
  • 6
7
votes
5 answers

Are there any disadvantages of using UTF8 in an oracle database?

We are installing ordering a configured oracle database and they are asking us what character encoding we would like to have. The application (in Java) is in English only but users are from different parts of the world. Are there any motivations…
user22463
6
votes
2 answers

How do I change the default character encoding in IIS on Windows Server 2003?

When validating my web page using the W3C Validator, I noticed that although I specify ISO-8859-1 encoding in my HTML, the web server seems to be specifying UTF-8 encoding in the HTTP headers. I've looked everywhere but I can't seem to find a way to…
Mark Bell
  • 308
  • 2
  • 5
  • 18
5
votes
2 answers

Mount unix samba 4 share to osx client without mangled file names

I have a unix server (arch linux) with samba 4.1.12. The share has files inside it with utf-8 nfc encoding (standard utf-8). When I mount this samba share into an osx client (10.9.5), files with special names like File with "quotes" are displayed…
Markus
  • 151
  • 2
  • 10
5
votes
2 answers

Is there a difference between en_US.utf8 and en_US.UTF-8?

Server info (DNS and IPs removed): cat /proc/version && uname -a && java -version Linux version 2.6.16.33-xenU (*************) (gcc version 4.1.1 20070105 (Red Hat 4.1.1-52)) #2 SMP Wed Aug 15 17:27:36 SAST 2007 Linux *************…
Matthew Herbst
  • 157
  • 1
  • 7
5
votes
1 answer

Is there a legitimate reason to use an encoded word in any header other than the subject?

Encoded words allow non-US-ASCII characters to be encoded as US-ASCII and thus placed into email headers. http://en.wikipedia.org/wiki/MIME#Encoded-Word Encoded words are often used in the Subject header, especially when the Subject is in another…
james.garriss
  • 360
  • 6
  • 17
5
votes
3 answers

How to properly backup mediawiki database (mysql) without messing up the data?

I want to backup a mediawiki database stored in a MySQL server 5.1.36 using mysqldump. Most of the wiki articles are written in spanish and a don't want to mess up with it by creating the dump with the wrong character set. mysql>…
Toto
  • 401
  • 1
  • 5
  • 13
4
votes
1 answer

Is there a way to "replay" a packet capture of a smtp session for the purpose of debugging?

I'm troubleshooting a smtp issue involving character-set encoding and it's extremely difficult to involve the end user. I have a packet capture of an incoming smtp session that results in problematic behavior but if I try to copy/paste the mime…
Mike B
  • 11,570
  • 42
  • 106
  • 165
4
votes
1 answer

Broken characters in filenames only in some directories

We have a web server running CentOS 5.8 that uses SVN for version control. When trying to switch to the latest revision, we got an error about the filenames of files in an upload directory: svn: Error converting entry in directory…
4
votes
3 answers

Set character_set_results UTF8 in MySQL my.cnf

how can I set the Variable character_set_results from latin1 to uft8? I thought it would be enough to add the following variable in my.cnf: default-character-set=utf8 But it not seem so: mysql> SHOW VARIABLES LIKE…
Marc
  • 51
  • 1
  • 1
  • 3
3
votes
2 answers

UTF-8 Characters in Apache Access Log ✔

The issue I'm using PHP's apache_note() to log variables from web requests to a CustomLog format. However, try as I might, Apache doesn't want to log UTF-8 characters the way I'd like. In PHP, I have apache_note('some_value', '✔'); which corresponds…
Bill Huertas
  • 131
  • 1
  • 4
1
2 3 4 5