Indian Script Code for Information Interchange

Indian Script Code for Information Interchange (ISCII) is a coding scheme for representing various writing systems of India. It encodes the main Indic scripts and a Roman transliteration. The supported scripts are: Assamese, Bengali (Bangla), Devanagari, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu. ISCII does not encode the writing systems of India based on Persian, but its writing system switching codes nonetheless provide for Kashmiri, Sindhi, Urdu, Persian, Pashto and Arabic. The Persian-based writing systems were subsequently encoded in the PASCII encoding.

ISCII has not been widely used outside certain government institutions and has now been rendered largely obsolete by Unicode. Unicode uses a separate block for each Indic writing system, and largely preserves the ISCII layout within each block.

Background

The Brahmi-derived writing systems have similar structure. So ISCII encodes letters with the same phonetic value at the same code point, overlaying the various scripts. For example, the ISCII codes 0xB3 0xDB represent [ki]. This will be rendered as കി in Malayalam, कि in Devanagari, as ਕਿ in Gurmukhi, and as கி in Tamil. The writing system can be selected in rich text by markup or in plain text by means of the ATR code described below.

One motivation for the use of a single encoding is the idea that it will allow easy transliteration from one writing system to another. However, there are enough incompatibilities that this is not really a practical idea. See About ISCII.

ISCII is an 8-bit encoding. The lower 128 code points are plain ASCII, the upper 128 code points are ISCII-specific. In addition to the code points representing characters, ISCII makes use of a code point with mnemonic ATR that indicates that the following byte contains one of two kinds of information. One set of values changes the writing system until the next writing system indicator or end-of-line. Another set of values select display modes such as bold and italic. ISCII does not provide a means of indicating the default writing system.

Codepage layout

The following table shows the character set for Devanagari. The code sets for Assamese, Bengali, Gujarati, Gurmukhi, Kannada, Malayalam, Oriya, Tamil, and Telugu are similar, with each Devanagari form replaced by the equivalent form in each writing system. Each character is shown with its decimal code and its Unicode equivalent.

ISCII Devanagari
	_0	_1	_2	_3	_4	_5	_6	_7	_8	_9	_A	_B	_C	_D	_E	_F
0_ 0	NUL 0000	SOH 0001	STX 0002	ETX 0003	EOT 0004	ENQ 0005	ACK 0006	BEL 0007	BS 0008	HT 0009	LF 000A	VT 000B	FF 000C	CR 000D	SO 000E	SI 000F
1_ 16	DLE 0010	DC1 0011	DC2 0012	DC3 0013	DC4 0014	NAK 0015	SYN 0016	ETB 0017	CAN 0018	EM 0019	SUB 001A	ESC 001B	FS 001C	GS 001D	RS 001E	US 001F
2_ 32	SP 0020	! 0021	" 0022	# 0023	$ 0024	% 0025	& 0026	' 0027	( 0028	) 0029	* 002A	+ 002B	, 002C	- 002D	. 002E	/ 002F
3_ 48	0 0030	1 0031	2 0032	3 0033	4 0034	5 0035	6 0036	7 0037	8 0038	9 0039	: 003A	; 003B	< 003C	= 003D	> 003E	? 003F
4_ 64	@ 0040	A 0041	B 0042	C 0043	D 0044	E 0045	F 0046	G 0047	H 0048	I 0049	J 004A	K 004B	L 004C	M 004D	N 004E	O 004F
5_ 80	P 0050	Q 0051	R 0052	S 0053	T 0054	U 0055	V 0056	W 0057	X 0058	Y 0059	Z 005A	[ 005B	\ 005C	] 005D	^ 005E	_ 005F
6_ 96	` 0060	a 0061	b 0062	c 0063	d 0064	e 0065	f 0066	g 0067	h 0068	i 0069	j 006A	k 006B	l 006C	m 006D	n 006E	o 006F
7_ 112	p 0070	q 0071	r 0072	s 0073	t 0074	u 0075	v 0076	w 0077	x 0078	y 0079	z 007A	{ 007B	\| 007C	} 007D	~ 007E	DEL 007F
8_ 128
9_ 144
A_ 160		ँ 0901	ं 0902	ः 0903	अ 0905	आ 0906	इ 0907	ई 0908	उ 0909	ऊ 090A	ऋ 090B	ऎ 090E	ए 090F	ऐ 0910	ऍ 090D	ऒ 0912
B_ 176	ओ 0913	औ 0914	ऑ 0911	क 0915	ख 0916	ग 0917	घ 0918	ङ 0919	च 091A	छ 091B	ज 091C	झ 091D	ञ 091E	ट 091F	ठ 0920	ड 0921
C_ 192	ढ 0922	ण 0923	त 0924	थ 0925	द 0926	ध 0927	न 0928	ऩ 0929	प 092A	फ 092B	ब 092C	भ 092D	म 092E	य 092F	य़ 095F	र 0930
D_ 208	ऱ 0931	ल 0932	ळ 0933	ऴ 0934	व 0935	श 0936	ष 0937	स 0938	ह 0939	INV	ा 093E	ि 093F	ी 0940	ु 0941	ू 0942	ृ 0943
E_ 224	ॆ 0946	े 0947	ै 0948	ॅ 0945	ॊ 094A	ो 094B	ौ 094C	ॉ 0949	् 094D	़ 093C	। 0964					ATR
F_ 240	EXT	० 0966	१ 0967	२ 0968	३ 0969	४ 096A	५ 096B	६ 096C	७ 096D	८ 096E	९ 096F

Letter Number Punctuation Symbol Other Undefined

Special code points

INV character—code point D9 (217): The INV character is used as a pseudo-consonant to display combining elements in isolation. For example, क (ka) + ् (halant) + INV = क्‍ (half ka). The Unicode equivalent is U+200D ZERO WIDTH JOINER.
ATR character—code point EF (239): The ATR character followed by a byte code is used to switch to a different font attribute (such as bold) or language (such as Bengali), up to the next ATR sequence or the end of the line. This has no direct Unicode equivalent, as font attributes are not part of Unicode, and each script has a distinct set of code points.
EXT character—code point F0 (240): The EXT character followed by a byte code indicates a Vedic accent. This has no direct Unicode equivalent, as Vedic accents are assigned to distinct code points.
Halant character ्—code point E8 (232): The halant character removes the implicit vowel from a consonant and is used between consonants to represent conjunct consonants. For example, क (ka) + ् (halant) + त (ta) = क्त (kta). The sequence ् (halant) + ् (halant) displays a conjunct with an explicit halant, for example क (ka) + ् (halant) + ् (halant) + त (ta) = क्‌त. The sequence ् (halant) + ़ (nukta) displays a conjunct with half consonants, if available, for example क (ka) + ् (halant) + ़ (nukta) + त (ta) = क्‍त.

ISCII		Unicode
single halant	`E8`	halant	`094D`
halant + halant	`E8 E8`	halant + ZWNJ	`094D 200C`
halant + nukta	`E8 E9`	halant + ZWJ	`094D 200D`

Nukta character ़—code point E9 (233): The nukta character after another ISCII character is used for a number of rarer characters which don't exist in the main ISCII set. For example क (ka) + ़ (nukta) = क़ (qa). These characters have precomposed forms in Unicode, as shown in the following table.

ISCII code point	Original character	Character with nukta	Unicode code point
A1 (161)	ँ	ॐ	0950
A6 (166)	इ	ऌ	090C
A7 (167)	ई	ॡ	0961
AA (176)	ऋ	ॠ	0960
B3 (179)	क	क़	0958
B4 (180)	ख	ख़	0959
B5 (181)	ग	ग़	095A
BA (186)	ज	ज़	095B
BF (191)	ड	ड़	095C
C0 (192)	ढ	ढ़	095D
C9 (201)	फ	फ़	095E
DB (219)	ि	ॢ	0962
DC (220)	ी	ॣ	0963
DF (223)	ृ	ॄ	0944
EA (234)	।	ऽ	093D

Code pages for ISCII conversion

To convert from Unicode (UTF-8) to an ISCII / ANSI coding, the following code pages may be used:

57002: Devanagari (Hindi, Marathi, Sanskrit, Konkani)
57003: Bengali
57004: Tamil
57005: Telugu
57006: Assamese
57007: Odia
57008: Kannada
57009: Malayalam
57010: Gujarati
57011: Punjabi (Gurmukhi)

Code points for all language

Code-set for all abugidas using ISCII

Hex	Official Listing	ISO 15919	Devanagari		Bengali		Gurmukhi		Gujarati		Oriya		Tamil		Telugu		Kannada		Malayalam
A0	Sign OM		ॐ	0950					ૐ	0AD0
A1	Vowel-modifier CHANDRABINDU		ँ	0901	ঁ	0981	ਁ	0A01	ઁ	0A81	ଁ	0B01			ఁ	0C01
A2	Vowel-modifier ANUSWARAM	ṁ	ं	0902	ং	0982	ਂ	0A02	ં	0A82	ଂ	0B02	ஂ	0B82	ం	0C02	ಂ	0C82	ം	0D02
A3	Vowel-modifier VISARGAM	ḥ	ः	0903	ঃ	0983	ਃ	0A03	ઃ	0A83	ଃ	0B03	ஃ	0B83	ః	0C03	ಃ	0C83	ഃ	0D03
A4	Vowel A	a	अ	0905	অ	0985	ਅ	0A05	અ	0A85	ଅ	0B05	அ	0B85	అ	0C05	ಅ	0C85	അ	0D05
A5	Vowel AA	ā	आ	0906	আ	0986	ਆ	0A06	આ	0A86	ଆ	0B06	ஆ	0B86	ఆ	0C06	ಆ	0C86	ആ	0D06
A6	Vowel I	i	इ	0907	ই	0987	ਇ	0A07	ઇ	0A87	ଇ	0B07	இ	0B87	ఇ	0C07	ಇ	0C87	ഇ	0D07
A6*	Vowel LI (Sanskrit)	ḷ	ऌ	090C	ঌ	098C			ઌ	0A8C	ଌ	0B0C			ఌ	0C0C	ಌ	0C8C	ഌ	0D0C
A7	Vowel II	ī	ई	0908	ঈ	0988	ਈ	0A08	ઈ	0A88	ଈ	0B08	ஈ	0B88	ఈ	0C08	ಈ	0C88	ഈ	0D08
A7*	Vowel LII (Sanskrit)	ḹ	ॡ	0961	ৡ	09E1			ૡ	0AE1	ୡ	0B61			ౡ	0C61	ೡ	0CE1	ൡ	0D61
A8	Vowel U	u	उ	0909	উ	0989	ਉ	0A09	ઉ	0A89	ଉ	0B09	உ	0B89	ఉ	0C09	ಉ	0C89	ഉ	0D09
A9	Vowel UU	ū	ऊ	090A	ঊ	098A	ਊ	0A0A	ઊ	0A8A	ଊ	0B0A	ஊ	0B8A	ఊ	0C0A	ಊ	0C8A	ഊ	0D0A
AA	Vowel RI	r̥	ऋ	090B	ঋ	098B			ઋ	0A8B	ଋ	0B0B			ఋ	0C0B	ಋ	0C8B	ഋ	0D0B
AA*	Vowel RII (Sanskrit)	ṝ	ॠ	0960	ৠ	09E0			ૠ	0AE0	ୠ	0B60			ౠ	0C60	ೠ	0CE0	ൠ	0D60
AB	Vowel E (Southern Scripts)	e	ऎ	090E									எ	0B8E	ఎ	0C0E	ಎ	0C8E	എ	0D0E
AC	Vowel EY	ē	ए	090F	এ	098F	ਏ	0A0F	એ	0A8F	ଏ	0B0F	ஏ	0B8F	ఏ	0C0F	ಏ	0C8F	ഏ	0D0F
AD	Vowel AI	ai	ऐ	0910	ঐ	0990	ਐ	0A10	ઐ	0A90	ଐ	0B10	ஐ	0B90	ఐ	0C10	ಐ	0C90	ഐ	0D10
AE	Vowel AYE (Devanagari Script)	ê	ऍ	090D					ઍ	0A8D
AF	Vowel O (Southern Scripts)	o	ऒ	0912									ஒ	0B92	ఒ	0C12	ಒ	0C92	ഒ	0D12
B0	Vowel OW	ō	ओ	0913	ও	0993	ਓ	0A13	ઓ	0A93	ଓ	0B13	ஓ	0B93	ఓ	0C13	ಓ	0C93	ഓ	0D13
B1	Vowel AU	au	औ	0914	ঔ	0994	ਔ	0A14	ઔ	0A94	ଔ	0B14	ஔ	0B94	ఔ	0C14	ಔ	0C94	ഔ	0D14
B2	Vowel AWE (Devanagari Script)	ô	ऑ	0911					ઑ	0A91
B3	Consonant KA	k	क	0915	ক	0995	ਕ	0A15	ક	0A95	କ	0B15	க	0B95	క	0C15	ಕ	0C95	ക	0D15
B3*	Consonant QA (Urdu)	q	क़	0958
B4	Consonant KHA	kh	ख	0916	খ	0996	ਖ	0A16	ખ	0A96	ଖ	0B16			ఖ	0C16	ಖ	0C96	ഖ	0D16
B4*	Consonant KHHA (Urdu)	kh	ख़	0959			ਖ਼	0A59
B5	Consonant GA	g	ग	0917	গ	0997	ਗ	0A17	ગ	0A97	ଗ	0B17			గ	0C17	ಗ	0C97	ഗ	0D17
B5*	Consonant GHHA (Urdu)	ġ	ग़	095A			ਗ਼	0A5A
B6	Consonant GHA	gh	घ	0918	ঘ	0998	ਘ	0A18	ઘ	0A98	ଘ	0B18			ఘ	0C18	ಘ	0C98	ഘ	0D18
B7	Consonant NGA	ṅ	ङ	0919	ঙ	0999	ਙ	0A19	ઙ	0A99	ଙ	0B19	ங	0B99	ఙ	0C19	ಙ	0C99	ങ	0D19
B8	Consonant CHA	c	च	091A	চ	099A	ਚ	0A1A	ચ	0A9A	ଚ	0B1A	ச	0B9A	చ	0C1A	ಚ	0C9A	ച	0D1A
B9	Consonant CHHA	ch	छ	091B	ছ	099B	ਛ	0A1B	છ	0A9B	ଛ	0B1B			ఛ	0C1B	ಛ	0C9B	ഛ	0D1B
BA	Consonant JA	j	ज	091C	জ	099C	ਜ	0A1C	જ	0A9C	ଜ	0B1C	ஜ	0B9C	జ	0C1C	ಜ	0C9C	ജ	0D1C
BA*	Consonant ZA (Urdu)	z	ज़	095B			ਜ਼	0A5B
BB	Consonant JHA	jh	झ	091D	ঝ	099D	ਝ	0A1D	ઝ	0A9D	ଝ	0B1D			ఝ	0C1D	ಝ	0C9D	ഝ	0D1D
BC	Consonant JNA	ñ	ञ	091E	ঞ	099E	ਞ	0A1E	ઞ	0A9E	ଞ	0B1E	ஞ	0B9E	ఞ	0C1E	ಞ	0C9E	ഞ	0D1E
BD	Consonant Hard TA	ṭ	ट	091F	ট	099F	ਟ	0A1F	ટ	0A9F	ଟ	0B1F	ட	0B9F	ట	0C1F	ಟ	0C9F	ട	0D1F
BE	Consonant Hard THA	ṭh	ठ	0920	ঠ	09A0	ਠ	0A20	ઠ	0AA0	ଠ	0B20			ఠ	0C20	ಠ	0CA0	ഠ	0D20
BF	Consonant Hard DA	ḍ	ड	0921	ড	09A1	ਡ	0A21	ડ	0AA1	ଡ	0B21			డ	0C21	ಡ	0CA1	ഡ	0D21
BF*	Consonant Flapped DA	ṛ	ड़	095C	ড়	09DC	ੜ	0A5C			ଡ଼	0B5C
C0	Consonant Hard DHA	ḍh	ढ	0922	ঢ	09A2	ਢ	0A22	ઢ	0AA2	ଢ	0B22			ఢ	0C22	ಢ	0CA2	ഢ	0D22
C0*	Consonant Flapped DHA	ṛh	ढ़	095D	ঢ়	09DD					ଢ଼	0B5D
C1	Consonant Hard NA	ṇ	ण	0923	ণ	09A3	ਣ	0A23	ણ	0AA3	ଣ	0B23	ண	0BA3	ణ	0C23	ಣ	0CA3	ണ	0D23
C2	Consonant Soft TA	t	त	0924	ত	09A4	ਤ	0A24	ત	0AA4	ତ	0B24	த	0BA4	త	0C24	ತ	0CA4	ത	0D24
C3	Consonant Soft THA	th	थ	0925	থ	09A5	ਥ	0A25	થ	0AA5	ଥ	0B25			థ	0C25	ಥ	0CA5	ഥ	0D25
C4	Consonant Soft DA	d	द	0926	দ	09A6	ਦ	0A26	દ	0AA6	ଦ	0B26			ద	0C26	ದ	0CA6	ദ	0D26
C5	Consonant Soft DHA	dh	ध	0927	ধ	09A7	ਧ	0A27	ધ	0AA7	ଧ	0B27			ధ	0C27	ಧ	0CA7	ധ	0D27
C6	Consonant Soft NA	n	न	0928	ন	09A8	ਨ	0A28	ન	0AA8	ନ	0B28	ந	0BA8	న	0C28	ನ	0CA8	ന	0D28
C7	Consonant NA (Tamil)	ṉ	ऩ	0929									ன	0BA9
C8	Consonant PA	p	प	092A	প	09AA	ਪ	0A2A	પ	0AAA	ପ	0B2A	ப	0BAA	ప	0C2A	ಪ	0CAA	പ	0D2A
C9	Consonant PHA	ph	फ	092B	ফ	09AB	ਫ	0A2B	ફ	0AAB	ଫ	0B2B			ఫ	0C2B	ಫ	0CAB	ഫ	0D2B
C9*	Consonant FA (Urdu)	f	फ़	095E			ਫ਼	0A5E									ೞ	0CDE
CA	Consonant BA	b	ब	092C	ব	09AC	ਬ	0A2C	બ	0AAC	ବ	0B2C			బ	0C2C	ಬ	0CAC	ബ	0D2C
CB	Consonant BHA	bh	भ	092D	ভ	09AD	ਭ	0A2D	ભ	0AAD	ଭ	0B2D			భ	0C2D	ಭ	0CAD	ഭ	0D2D
CC	Consonant MA	m	म	092E	ম	09AE	ਮ	0A2E	મ	0AAE	ମ	0B2E	ம	0BAE	మ	0C2E	ಮ	0CAE	മ	0D2E
CD	Consonant YA	y	य	092F	য	09AF	ਯ	0A2F	ય	0AAF	ଯ	0B2F	ய	0BAF	య	0C2F	ಯ	0CAF	യ	0D2F
CE	Consonant JYA (Bengali, Assamese & Oriya)	ẏ	य़	095F	য়	09DF					ୟ	0B5F
CF	Consonant RA	r̥	र	0930	র	09B0	ਰ	0A30	ર	0AB0	ର	0B30	ர	0BB0	ర	0C30	ರ	0CB0	ര	0D30
D0	Consonant Hard RA (Southern Scripts)	ṟ	ऱ	0931									ற	0BB1	ఱ	0C31	ಱ	0CB1	റ	0D31
D1	Consonant LA	l	ल	0932	ল	09B2	ਲ	0A32	લ	0AB2	ଲ	0B32	ல	0BB2	ల	0C32	ಲ	0CB2	ല	0D32
D2	Consonant Hard LA	ḷ	ळ	0933			ਲ਼	0A33	ળ	0AB3	ଳ	0B33	ள	0BB3	ళ	0C33	ಳ	0CB3	ള	0D33
D3	Consonant ZHA (Tamil & Malayalam)	ḻ	ऴ	0934									ழ	0BB4					ഴ	0D34
D4	Consonant VA	v	व	0935	ৱ	09F1	ਵ	0A35	વ	0AB5	ଵ	0B35	வ	0BB5	వ	0C35	ವ	0CB5	വ	0D35
D5	Consonant SHA	ś	श	0936	শ	09B6	ਸ਼	0A36	શ	0AB6	ଶ	0B36	ஶ	0BB6	శ	0C36	ಶ	0CB6	ശ	0D36
D6	Consonant Hard SHA	ṣ	ष	0937	ষ	09B7			ષ	0AB7	ଷ	0B37	ஷ	0BB7	ష	0C37	ಷ	0CB7	ഷ	0D37
D7	Consonant SA	s	स	0938	স	09B8	ਸ	0A38	સ	0AB8	ସ	0B38	ஸ	0BB8	స	0C38	ಸ	0CB8	സ	0D38
D8	Consonant HA	h	ह	0939	হ	09B9	ਹ	0A39	હ	0AB9	ହ	0B39	ஹ	0BB9	హ	0C39	ಹ	0CB9	ഹ	0D39
D9	Consonant INVISIBLE
DA	Vowel Sign AA	ā	ा	093E	া	09BE	ਾ	0A3E	ા	0ABE	ା	0B3E	ா	0BBE	ా	0C3E	ಾ	0CBE	ാ	0D3E
DB	Vowel Sign I	i	ि	093F	ি	09BF	ਿ	0A3F	િ	0ABF	ି	0B3F	ி	0BBF	ి	0C3F	ಿ	0CBF	ി	0D3F
DB*	Vowel Sign LI (Sanskrit)	ḷ	ॢ	0962	ৢ	09E2			ૢ	0AE2	ୢ	0B62			ౢ	0C62	ೢ	0CE2	ൢ	0D62
DC	Vowel Sign II	ī	ी	0940	ী	09C0	ੀ	0A40	ી	0AC0	ୀ	0B40	ீ	0BC0	ీ	0C40	ೀ	0CC0	ീ	0D40
DC*	Vowel Sign LII (Sanskrit)	ḹ	ॣ	0963	ৣ	09E3			ૣ	0AE3	ୣ	0B63			ౣ	0C63	ೣ	0CE3	ൣ	0D63
DD	Vowel Sign U	u	ु	0941	ু	09C1	ੁ	0A41	ુ	0AC1	ୁ	0B41	ு	0BC1	ు	0C41	ು	0CC1	ു	0D41
DE	Vowel Sign UU	ū	ू	0942	ূ	09C2	ੂ	0A42	ૂ	0AC2	ୂ	0B42	ூ	0BC2	ూ	0C42	ೂ	0CC2	ൂ	0D42
DF	Vowel Sign RI	r̥	ृ	0943	ৃ	09C3			ૃ	0AC3	ୃ	0B43			ృ	0C43	ೃ	0CC3	ൃ	0D43
DF*	Vowel Sign RII (Sanskrit)	ṝ	ॄ	0944	ৄ	09C4			ૄ	0AC4	ୄ	0B44			ౄ	0C44	ೄ	0CC4	ൄ	0D44
E0	Vowel Sign E (Southern Scripts)	e	ॆ	0946									ெ	0BC6	ె	0C46	ೆ	0CC6	െ	0D46
E1	Vowel Sign EY	ē	े	0947	ে	09C7	ੇ	0A47	ે	0AC7	େ	0B47	ே	0BC7	ే	0C47	ೇ	0CC7	േ	0D47
E2	Vowel Sign AI	ai	ै	0948	ৈ	09C8	ੈ	0A48	ૈ	0AC8	ୈ	0B48	ை	0BC8	ై	0C48	ೈ	0CC8	ൈ	0D48
E3	Vowel Sign AYE (Devanagari Script)	ê	ॅ	0945					ૅ	0AC5
E4	Vowel Sign O (Southern Scripts)	o	ॊ	094A									ொ	0BCA	ొ	0C4A	ೊ	0CCA	ൊ	0D4A
E5	Vowel Sign OW	ō	ो	094B	ো	09CB	ੋ	0A4B	ો	0ACB	ୋ	0B4B	ோ	0BCB	ో	0C4B	ೋ	0CCB	ോ	0D4B
E6	Vowel Sign AU	au	ौ	094C	ৌ	09CC	ੌ	0A4C	ૌ	0ACC	ୌ	0B4C	ௌ	0BCC	ౌ	0C4C	ೌ	0CCC	ൌ	0D4C
E7	Vowel Sign AWE (Devanagari Script)	ô	ॉ	0949					ૉ	0AC9
E8	Vowel Omission Sign (Halant)		्	094D	্	09CD	੍	0A4D	્	0ACD	୍	0B4D	்	0BCD	్	0C4D	್	0CCD	്	0D4D
E9	Diacritic Sign (Nuktam)		़	093C	়	09BC	਼	0A3C	઼	0ABC	଼	0B3C					಼	0CBC
EA	Full Stop (Viram, Northern Scripts)		।	0964
EA*	Vowel Stress Sign AVAGRAH		ऽ	093D	ঽ	09BD			ઽ	0ABD	ଽ	0B3D			ఽ	0C3D	ಽ	0CBD	ഽ	0D3D
EB	Unused
EC	Unused
ED	Unused
EE	Unused
EF	Attribute Code
F0	Extension Code
F1	Digit 0		०	0966	০	09E6	੦	0A66	૦	0AE6	୦	0B66	௦	0BE6	౦	0C66	೦	0CE6	൦	0D66
F2	Digit 1		१	0967	১	09E7	੧	0A67	૧	0AE7	୧	0B67	௧	0BE7	౧	0C67	೧	0CE7	൧	0D67
F3	Digit 2		२	0968	২	09E8	੨	0A68	૨	0AE8	୨	0B68	௨	0BE8	౨	0C68	೨	0CE8	൨	0D68
F4	Digit 3		३	0969	৩	09E9	੩	0A69	૩	0AE9	୩	0B69	௩	0BE9	౩	0C69	೩	0CE9	൩	0D69
F5	Digit 4		४	096A	৪	09EA	੪	0A6A	૪	0AEA	୪	0B6A	௪	0BEA	౪	0C6A	೪	0CEA	൪	0D6A
F6	Digit 5		५	096B	৫	09EB	੫	0A6B	૫	0AEB	୫	0B6B	௫	0BEB	౫	0C6B	೫	0CEB	൫	0D6B
F7	Digit 6		६	096C	৬	09EC	੬	0A6C	૬	0AEC	୬	0B6C	௬	0BEC	౬	0C6C	೬	0CEC	൬	0D6C
F8	Digit 7		७	096D	৭	09ED	੭	0A6D	૭	0AED	୭	0B6D	௭	0BED	౭	0C6D	೭	0CED	൭	0D6D
F9	Digit 8		८	096E	৮	09EE	੮	0A6E	૮	0AEE	୮	0B6E	௮	0BEE	౮	0C6E	೮	0CEE	൮	0D6E
FA	Digit 9		९	096F	৯	09EF	੯	0A6F	૯	0AEF	୯	0B6F	௯	0BEF	౯	0C6F	೯	0CEF	൯	0D6F
FB	Unused
FC	Unused
FD	Unused
FE	Unused
FF	Unused

gollark: Trees are fine, though.

gollark: Unsafely!

gollark: Not really.

gollark: C++ does allow *great* error message length to code length ratios.

gollark: I may need to something something regular expressions.

External links

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.

Character encodings
Early telecommunications	Telegraph code Needle Morse Non-Latin Wabun/Kana Chinese Cyrillic Korean Baudot and Murray FIELDATA ASCII ISO/IEC 646 BCDIC 353 355 357 358 359 360 EBCDIC Teletex and Videotex/Teletext ISO/IEC 6937 / ITU T.51 ITU T.61 ITU T.101 World System Teletext background sets
ISO/IEC 8859	Approved -1 -2 -3 -4 -5 -6 -7 -8 -9 -10 -11 -13 -14 -15 -16 Abandoned -12 Adaptations ISO-IR-182 ISO-IR-200 ISO-IR-201 Proposed but not approved ISO-IR-111 ISO-IR-197 French/Dutch/Turkish draft
Bibliographic use	MARC-8 ANSEL CCCII/EACC ISO 5426 / 5426-2 / 5427 / 5428 / 6438 / 6861 / 6862 / 10585 / 10586 / 10754 / 11822
National standards	ArmSCII BraSCII CNS 11643 ELOT 927 GOST 10859 GB 18030 HKSCS I.S. 434 ISCII JIS X 0201 JIS X 0208 JIS X 0212 JIS X 0213 KOI-7 KPS 9566 KS X 1001 LST 1284 LST 1564 LST 1590-1 LST 1590-2 LST 1590-3 LST 1590-4 PASCII RUSCII SI 960 TIS-620 TSCII VISCII VSCII YUSCII
ISO/IEC 2022	7-bit CN CN-EXT JP JP-EXT JP-1 JP-2 JP-3 KR ISO/IEC 4873 ISO/IEC 8859 ISO/IEC 10367 Extended Unix Code / EUC CN KR JP TW
MacOS code pages ("scripts")	Armenian Arabic Barents Cyrillic Celtic CentEuro ChineseSimp / EUC-CN ChineseTrad / Big5 Croatian Cyrillic Devanagari Dingbats Farsi (Persian) Gaelic Georgian Greek Gujarati Gurmukhi Hebrew Iceland Inuit Japanese / ShiftJIS Keyboard Korean / EUC-KR Latin (Kermit) Maltese/Esperanto Ogham / I.S. 434 Roman Romanian Sámi Symbol Thai / TIS-620 Turkish Turkic Cyrillic Ukrainian VT100
DOS code pages	100 111 112 113 151 152 161 162 163 164 165 166 210 220 301 437 449 489 620 667 668 707 708 709 710 711 714 715 720 721 737 768 770 771 772 773 774 775 776 777 778 790 850 851 852 853 854 855/872 856 857 858 859 860 861 862 863 864 865 866/808 867 868 869 874/1161/1162 876 877 878 881 882 883 884 885 891 895 896 897 898 899 900 903 904 906 907 909 910 911 926 927 928 929 932 934 936 938 941 942 943 944 946 947 948 949 950/1370 951 966 991 1034 1039 1040 1041 1042 1043 1044 1046 1086 1088 1092 1093 1098 1108 1109 1114 1115 1116 1117 1118 1119 1125/848 1126 1127 1131/849 1139 1167 1168 1300 1351 1361 1362 1363 1372 1373 1374 1375 1380 1381 1385 1386 1391 1392 1393 1394 3012 3021 3843 3844 3845 3846 3847 3848 30000 30001 30002 30003 30004 30005 30006 30007 30008 30009 30010 30011 30012 30013 30014 30015 30016 30017 30018 30019 30020 30021 30022 30023 30024 30025 30026 30027 30028 30029 30030 30031 30032 30033 30034 30039 30040 58152 58210 58335 59234 59829 60258 60853 61282 62306 CS Indic CSX Indic CSX+ Indic CWI-2 Iran System Kamenický KOI8 Mazovia MIK
IBM AIX code pages	367 371 806 813 819 895 896 912 913 914 915 916 919 920 921/901 922/902 923 952 953 954 955 956 957 958 959 960 961 963 964 965 970 971 1004 1006 1008 1009 1010 1011 1012 1013 1014 1015 1016 1017 1018 1019 1029 1036 1089 1111 1124 1129/1163 1133 1350 1382 1383
IBM code pages for other vendors' encodings	Apple Macintosh 1275 1280 1281 1282 1283 1284 1285 1286 Adobe 1038 1276 1277 DEC 1020 1021 1023 1090 1100 1101 1102 1103 1104 1105 1106 1107 1287 1288 HP 1050 1051 1052 1053 1054 1055 1056 1057 1058
Windows code pages	CER-GS 874/1162 (TIS-620) 932/943 (Shift JIS) 936/1386 (GBK) 950/1370 (Big5) 949/1363 (EUC-KR) 1169 1174 Extended Latin-8 1200 (UTF-16LE) 1201 (UTF-16BE) 1250 1251 1252 1253 1254 1255 1256 1257 1258 1261 1270 54936 (GB18030) Armenian Cyrillic + Finnish Cyrillic + French Cyrillic + German Polytonic Greek 65001 (UTF-8)
Microsoft code pages for other vendors' encodings	Apple Macintosh 10000 10001 10002 10003 10004 10005 10006 10007 10008 10010 10017 10021 10029 10079 10081 10082
EBCDIC code pages	1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37/1140 37-2 38 39 40 251 252 254 256 257 258 259 260 264 273/1141 274 275 276 277/1142 278/1143 279 280/1144 281 282 283 284/1145 285/1146 286 287 288 289 290 297/1147 298 300 320 321 322 330 352 353 355 357 358 359 360 361 363 382 383 384 385 386 387 388 389 390 391 392 393 394 395 410 420 421 423 424 425 435 500/1148 803 829 833 834 835 836 837 838/1160 839 870/1110/1153 871/1149 875 880 881 882 883 884 885 886 887 888 889 890 892 893 905 918 924 930/1390 931 933/1364 935/1388 937/1371 939/1399 1001 1002 1003 1005 1007 1024 1025/1154 1026/1155 1027 1028 1030 1031 1032 1033 1037 1047 1068 1069 1070 1071 1073 1074 1075 1076 1077 1078 1079 1080 1081 1082 1083 1084 1085 1087 1091 1097 1112/1156 1113 1122/1157 1123/1158 1130/1164 1132 1136 1137 1150 1151 1152 1159 1165 1166 1278 1279 1303 1364 1376 1377 JEF KEIS
DEC terminals (VTx)	Multinational (MCS) National Replacement (NRCS) French Canadian Swiss Spanish United Kingdom Dutch Finnish French Norwegian and Danish Swedish Norwegian and Danish (alternative) 8-bit Greek 8-bit Turkish 7-bit Hebrew 8-bit Hebrew Special Graphics Technical (TCS)
Platform specific	Acorn Adobe Standard Adobe Latin 1 Amstrad CPC Apple I Apple II Apple III ATASCII Atari ST BICS Casio calculators CDC Compucolor II CP/M+ DEC RADIX 50 DEC MCS/NRCS DG International ELWRO-Junior FIELDATA GEM GEOS GSM 03.38 HP Roman Extension HP Roman-8 HP Roman-9 HP FOCAL HP RPL IBM SQUOZE LICS LMBCS Mattel Aquarius Minitel MSX NEC APC NeXT OricSCII PCW PETSCII Sega SC-3000 Sharp calculators Sharp MZ Sinclair QL Teletext TI calculators TRS-80 Ventura International Ventura Symbol WISCII XCCS ZX80 ZX81 ZX Spectrum
Unicode / ISO/IEC 10646	UTF-1 UTF-7 UTF-8 UTF-16 (UTF-16LE/UTF-16BE) / UCS-2 UTF-32 (UTF-32LE/UTF-32BE) / UCS-4 UTF-EBCDIC GB 18030 BOCU-1 CESU-8 SCSU
TeX typesetting system	Cork IL1 IL2 IL3 L7X LGR LY1 OML OMS OMX OT1 OT2 OT3 OT4 PL0 QX T2A T2B T2C T2D T3 T4 T5 TS1 TS3 U X2
Miscellaneous code pages	ABICOMP APL 293 310 (Graphic Escape) 351 (GDDM) 907 (OEM) ISO-IR-68 ARIB STD-B24 HZ IEC-P27-1 INIS 7-bit 8-bit Cyrillic ISO-IR-169 ISO 2033 Johab Mojikyō SEASCII Stanford/ITS TACE16 TRON UTF-5 UTF-6 WTF-8
Related topics	Code page Control character (C0 C1) CCSID Character encodings in HTML Charset detection Han unification Hardware ISO 6429/IEC 6429/ANSI X3.64 Mojibake
Character sets