ARIB STD B24 character set

Volume 1 of the Association of Radio Industries and Businesses (ARIB) STD-B24 standard for Broadcast Markup Language[2] specifies, amongst other details, a character encoding for use in Japanese-language broadcasting. It was introduced on 1999-10-26.[2] The latest revision is version 6.3 as of 2016-07-06.

ARIB STB-B24 encoding
StandardARIB STB-B24 Volume 1
ClassificationISO 2022 profile/extension
Transforms / EncodesARIB STB-B24 Kanji, Kana and mosaic sets,
JIS X 0201
ARIB STB-B24 Kanji set
Weather symbols: a few of the extended symbols included.
Language(s)Japanese, English, Russian
Partial support: Greek, Chinese
StandardARIB STB-B24 Volume 1
ClassificationISO-2022-structured CJK DBCS
ExtendsJIS X 0208
Encoding formats
  • ARIB STB-B24 encoding (ISO 2022 based)
  • Shift JIS (ARIB variant)[1]

It includes a number of ARIB extended characters (ARIB外字, ARIB gaiji) not found in the base standards (JIS X 0208 and JIS X 0201). It was the source standard for many symbol characters which were added to Unicode, including portions of the Miscellaneous Symbols, Enclosed Alphanumeric Supplement and Enclosed Ideographic Supplement blocks.[3] Its contributions partially overlap the Unicode emoji, but were added a year earlier, in Unicode 5.2.[4]

The ARIB STD-B62 standard, published in 2014, defines Unicode mappings for a selection of the B24 extended characters (excluding, for example, those duplicated by JIS X 0213), as well as a few extended Kanji.[5] It also includes a mapping of utilised characters outside the Basic Multilingual Plane to the BMP's private use area.

Sets and codes

The ARIB STD B24 standard defines multiple character sets and a method of switching between them. These include a Kanji set (an extension of JIS X 0208), an Alphanumeric set, a Hiragana set, Katakana sets of two distinct layouts and four mosaic sets.[6] The sets are selected using ISO 2022 mechanisms for 94-sets, using the following codes (proportional sets use the same layout as the corresponding non-proportional ones):[7]

SetTypeCode (column/line)Code (hexadecimal)Code (ASCII character)Comments
Kanji2-byte4/242BThe escape code B used for the ARIB Kanji set[7] is used for the 1983 version of JIS C 6226 (JIS X 0208, of which the ARIB Kanji set is an extension) in ISO-2022-JP.[8][9]
Alphanumeric1-byte4/104AJJIS_C6220-ro (ISO646-JP, JIS X 0201 Roman set). Similar to ASCII, with two assignments differing. Escape code J matches usage in ISO-2022-JP.[9]
Proportional alphanumeric1-byte3/6366
Hiragana1-byte3/0300Hiragana themselves follow the same layout as row 4 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation.
Proportional Hiragana1-byte3/7377
Katakana1-byte3/1311Katakana themselves follow the same layout as row 5 of JIS X 0208, but without a lead byte. Also adds several additional assignments for punctuation.
Proportional Katakana1-byte3/8388
JIS X 0201 Katakana1-byte4/949IJIS_C6220-jp (JIS X 0201 Kana set). Escape code matches usage in ISO-2022-JP-3.
Mosaic A1-byte3/2322Pseudographics
Mosaic B1-byte3/3333
Mosaic C1-byte3/4344Non-spacing pseudographics
Mosaic D1-byte3/5355

Code charts

Kanji (double-byte) set

This is a double-byte character set extending JIS X 0208.

Lead byte

The encoding bytes correspond to the row or cell number plus 0x20, or 32 in decimal (see below). Hence, the code set starting with 0x21 has a row number of 1, and its cell 1 has a continuation byte of 0x21 (or 33), and so forth. Most of the code corresponds to JIS X 0208, exceptions are shown with a heavy border.

ARIB STD-B24 Kanji (double-byte) set (lead bytes)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_ SP
0020
 
Punct.
LEAD
1-_
Symbol
LEAD
2-_
Alnum.
LEAD
3-_
Hira.
LEAD
4-_
Kata.
LEAD
5-_
Greek
LEAD
6-_
Cyrillic
LEAD
7-_
Box
LEAD
8-_
 
 
9-_
 
 
10-_
 
 
11-_
 
 
12-_
 
 
13-_
 
 
14-_
 
 
15-_
3_ Kanji L1
LEAD
16-_
Kanji L1
LEAD
17-_
Kanji L1
LEAD
18-_
Kanji L1
LEAD
19-_
Kanji L1
LEAD
20-_
Kanji L1
LEAD
21-_
Kanji L1
LEAD
22-_
Kanji L1
LEAD
23-_
Kanji L1
LEAD
24-_
Kanji L1
LEAD
25-_
Kanji L1
LEAD
26-_
Kanji L1
LEAD
27-_
Kanji L1
LEAD
28-_
Kanji L1
LEAD
29-_
Kanji L1
LEAD
30-_
Kanji L1
LEAD
31-_
4_ Kanji L1
LEAD
32-_
Kanji L1
LEAD
33-_
Kanji L1
LEAD
34-_
Kanji L1
LEAD
35-_
Kanji L1
LEAD
36-_
Kanji L1
LEAD
37-_
Kanji L1
LEAD
38-_
Kanji L1
LEAD
39-_
Kanji L1
LEAD
40-_
Kanji L1
LEAD
41-_
Kanji L1
LEAD
42-_
Kanji L1
LEAD
43-_
Kanji L1
LEAD
44-_
Kanji L1
LEAD
45-_
Kanji L1
LEAD
46-_
Kanji L1
LEAD
47-_
5_ Kanji L2
LEAD
48-_
Kanji L2
LEAD
49-_
Kanji L2
LEAD
50-_
Kanji L2
LEAD
51-_
Kanji L2
LEAD
52-_
Kanji L2
LEAD
53-_
Kanji L2
LEAD
54-_
Kanji L2
LEAD
55-_
Kanji L2
LEAD
56-_
Kanji L2
LEAD
57-_
Kanji L2
LEAD
58-_
Kanji L2
LEAD
59-_
Kanji L2
LEAD
60-_
Kanji L2
LEAD
61-_
Kanji L2
LEAD
62-_
Kanji L2
LEAD
63-_
6_ Kanji L2
LEAD
64-_
Kanji L2
LEAD
65-_
Kanji L2
LEAD
66-_
Kanji L2
LEAD
67-_
Kanji L2
LEAD
68-_
Kanji L2
LEAD
69-_
Kanji L2
LEAD
70-_
Kanji L2
LEAD
71-_
Kanji L2
LEAD
72-_
Kanji L2
LEAD
73-_
Kanji L2
LEAD
74-_
Kanji L2
LEAD
75-_
Kanji L2
LEAD
76-_
Kanji L2
LEAD
77-_
Kanji L2
LEAD
78-_
Kanji L2
LEAD
79-_
7_ Kanji L2
LEAD
80-_
Kanji L2
LEAD
81-_
Kanji L2
LEAD
82-_
Kanji L2
LEAD
83-_
Kanji L2
LEAD
84-_
 
 
85-_
 
 
86-_
 
 
87-_
 
 
88-_
 
 
89-_
Traffic
LEAD
90-_
Map
LEAD
91-_
Misc.
LEAD
92-_
Misc.
LEAD
93-_
List
LEAD
94-_
DEL
007F
 

Character sets 0x21-0x74 (row numbers 1-84: punctuation, alphabets, numbers, Kana, Kanji)

Character set 0x7A (row number 90, traffic symbols)

Characters 90-45 through 90-63 and 90-66 through 90-84 (shown below with a heavy border) are listed in the B24 standard only in table 7-10 (the list of extension characters), and are also the only characters in rows 90 through 91 which are not transport-related symbols; this is noted in the B24 standard in an endnote to table 7-10.[10] The remainder of the extensions are listed in both table 7-4 (the double-byte code chart) and table 7-10.[10]

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7A)[5][11]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
26CC
90-1

26CD
90-2

2757
90-3

26CF
90-4

26D0
90-5

26D1
90-6

 
90-7

26D2
90-8

26D5
90-9

26D3
90-10

26D4
90-11

 
90-12

 
90-13

 
90-14

 
90-15
3_ 🅿
1F17F
90-16
🆊
1F18A
90-17

 
90-18

 
90-19

26D6
90-20

26D7
90-21

26D8
90-22

26D9
90-23

26DA
90-24

26DB
90-25

26DC
90-26

26DD
90-27

26DE
90-28

26DF
90-29

26E0
90-30

26E1
90-31
4_
2B55
90-32

3248
90-33

3249
90-34

324A
90-35

324B
90-36

324C
90-37

324D
90-38

324E
90-39

324F
90-40

 
90-41

 
90-42

 
90-43

 
90-44

2491
90-45

2492
90-46

2493
90-47
5_ 🅊
1F14A
90-48
🅌
1F14C
90-49
🄿
1F13F
90-50
🅆
1F146
90-51
🅋
1F14B
90-52
🈐
1F210
90-53
🈑
1F211
90-54
🈒
1F212
90-55
🈓
1F213
90-56
🅂
1F142
90-57
🈔
1F214
90-58
🈕
1F215
90-59
🈖
1F216
90-60
🅍
1F14D
90-61
🄱
1F131
90-62
🄽
1F13D
90-63
6_
2B1B
90-64

2B24
90-65
🈗
1F217
90-66
🈘
1F218
90-67
🈙
1F219
90-68
🈚
1F21A
90-69
🈛
1F21B
90-70

26BF
90-71
🈜
1F21C
90-72
🈝
1F21D
90-73
🈞
1F21E
90-74
🈟
1F21F
90-75
🈠
1F220
90-76
🈡
1F221
90-77
🈢
1F222
90-78
🈣
1F223
90-79
7_ 🈤
1F224
90-80
🈥
1F225
90-81
🅎
1F14E
90-82

3299
90-83
🈀
1F200
90-84

 
90-85

 
90-86

 
90-87

 
90-88

 
90-89

 
90-90

 
90-91

 
90-92

 
90-93

 
90-94

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x7B (row number 91, map symbols)

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7B)[5][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
26E3
91-1

2B56
91-2

2B57
91-3

2B58
91-4

2B59
91-5

2613
91-6

328B
91-7

3012
91-8

26E8
91-9

3246
91-10

3245
91-11

26E9
91-12
[lower-alpha 1]
0FD6
91-13

26EA
91-14

26EB
91-15
3_
26EC
91-16

2668
91-17

26ED
91-18

26EE
91-19

26EF
91-20

2693
91-21

2708
91-22

26F0
91-23

26F1
91-24

26F2
91-25

26F3
91-26

26F4
91-27

26F5
91-28
🅗
1F157
91-29

24B9
91-30

24C8
91-31
4_
26F6
91-32
🅟
1F15F
91-33
🆋
1F18B
91-34
🆍
1F18D
91-35
🆌
1F18C
91-36
🅹
1F179
91-37

26F7
91-38

26F8
91-39

26F9
91-40

26FA
91-41
🅻
1F17B
91-42

260E
91-43

26FB
91-44

26FC
91-45

26FD
91-46

26FE
91-47
5_ 🅼
1F17C
91-48

26FF
91-49

 
91-50

 
91-51

 
91-52

 
91-53

 
91-54

 
91-55

 
91-56

 
91-57

 
91-58

 
91-59

 
91-60

 
91-61

 
91-62

 
91-63
6_
 
91-64

 
91-65

 
91-66

 
91-67

 
91-68

 
91-69

 
91-70

 
91-71

 
91-72

 
91-73

 
91-74

 
91-75

 
91-76

 
91-77

 
91-78

 
91-79
7_
 
91-80

 
91-81

 
91-82

 
91-83

 
91-84

 
91-85

 
91-86

 
91-87

 
91-88

 
91-89

 
91-90

 
91-91

 
91-92

 
91-93

 
91-94

Character set 0x7C (row number 92, units, enclosed forms, list markers, arrows)

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7C)[5][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
27A1
92-1

2B05
92-2

2B06
92-3

2B07
92-4

2B2F
92-5

2B2E
92-6

5E74
92-7

6708
92-8

65E5
92-9

5186
92-10

33A1
92-11

33A5
92-12

339D
92-13

33A0
92-14

33A4
92-15
3_ 🄀
1F100
92-16

2488
92-17

2489
92-18

248A
92-19

248B
92-20

248C
92-21

248D
92-22

248E
92-23

248F
92-24

2490
92-25
[lower-alpha 2]
 
92-26
[lower-alpha 2]
 
92-27
[lower-alpha 2]
 
92-28
[lower-alpha 2]
 
92-29
[lower-alpha 2]
 
92-30
[lower-alpha 2]
 
92-31
4_ 🄁
1F101
92-32
🄂
1F102
92-33
🄃
1F103
92-34
🄄
1F104
92-35
🄅
1F105
92-36
🄆
1F106
92-37
🄇
1F107
92-38
🄈
1F108
92-39
🄉
1F109
92-40
🄊
1F10A
92-41

3233
92-42

3236
92-43

3232
92-44

3231
92-45

3239
92-46

3244
92-47
5_
25B6
92-48

25C0
92-49

3016
92-50

3017
92-51

27D0
92-52
²
00B2
92-53
³
00B3
92-54
🄭
1F12D
92-55
(vn)[lower-alpha 3]
 
92-56
(ob)[lower-alpha 3]
 
92-57
(cb)[lower-alpha 3]
 
92-58
(ce[lower-alpha 3]
 
92-59
mb)[lower-alpha 3]
 
92-60
(hp)[lower-alpha 3]
 
92-61
(br)[lower-alpha 3]
 
92-62
(p)[lower-alpha 3]
 
92-63
6_ (s)[lower-alpha 3]
 
92-64
(ms)[lower-alpha 3]
 
92-65
(t)[lower-alpha 3]
 
92-66
(bs)[lower-alpha 3]
 
92-67
(b)[lower-alpha 3]
 
92-68
(tb)[lower-alpha 3]
 
92-69
(tp)[lower-alpha 3]
 
92-70
(ds)[lower-alpha 3]
 
92-71
(ag)[lower-alpha 3]
 
92-72
(eg)[lower-alpha 3]
 
92-73
(vo)[lower-alpha 3]
 
92-74
(fl)[lower-alpha 3]
 
92-75
(ke[lower-alpha 3]
 
92-76
y)[lower-alpha 3]
 
92-77
(sa[lower-alpha 3]
 
92-78
x)[lower-alpha 3]
 
92-79
7_ (sy[lower-alpha 3]
 
92-80
n)[lower-alpha 3]
 
92-81
(or[lower-alpha 3]
 
92-82
g)[lower-alpha 3]
 
92-83
(pe[lower-alpha 3]
 
92-84
r)[lower-alpha 3]
 
92-85
🄬
1F12C
92-86
🄫
1F12B
92-87

3247
92-88
🆐
1F190
92-89
🈦
1F226
92-90

213B
92-91

 
92-92

 
92-93

 
92-94

Character set 0x7D (row number 93, game and weather symbols, fractions, units, enclosed forms)

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7D)[5][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
322A
93-1

322B
93-2

322C
93-3

322D
93-4

322E
93-5

322F
93-6

3230
93-7

3237
93-8

337E
93-9

337D
93-10

337C
93-11

337B
93-12

2116
93-13

2121
93-14

3036
93-15
3_
26BE
93-16
🉀
1F240
93-17
🉁
1F241
93-18
🉂
1F242
93-19
🉃
1F243
93-20
🉄
1F244
93-21
🉅
1F245
93-22
🉆
1F246
93-23
🉇
1F247
93-24
🉈
1F248
93-25
🄪
1F12A
93-26
🈧
1F227
93-27
🈨
1F228
93-28
🈩
1F229
93-29
🈔
1F214
93-30
🈪
1F22A
93-31
4_ 🈫
1F22B
93-32
🈬
1F22C
93-33
🈭
1F22D
93-34
🈮
1F22E
93-35
🈯
1F22F
93-36
🈰
1F230
93-37
🈱
1F231
93-38

2113
93-39

338F
93-40

3390
93-41

33CA
93-42

339E
93-43

33A2
93-44

3371
93-45

 
93-46

 
93-47
5_ ½
00BD
93-48

2189
93-49

2153
93-50

2154
93-51
¼
00BC
93-52
¾
00BE
93-53

2155
93-54

2156
93-55

2157
93-56

2158
93-57

2159
93-58

215A
93-59

2150
93-60

215B
93-61

2151
93-62

2152
93-63
6_
2600
93-64

2601
93-65

2602
93-66

26C4
93-67

2616
93-68

2617
93-69

26C9
93-70

26CA
93-71

2666
93-72

2665
93-73

2663
93-74

2660
93-75

26CB
93-76

2A00
93-77

203C
93-78

2049
93-79
7_
26C5
93-80

2614
93-81

26C6
93-82

2603
93-83

26C7
93-84

26A1
93-85

26C8
93-86

 
93-87

269E
93-88

269F
93-89

266C
93-90

260E
93-91

 
93-92

 
93-93

 
93-94

Character set 0x7E (row number 94, list markers)

Characters from ARIB STD-B24 which were not retained in ARIB STD-B62 are shown shaded.

ARIB STD-B24 Kanji (double-byte) set (prefixed with 0x7E)[5][11][12]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
2160
94-1

2161
94-2

2162
94-3

2163
94-4

2164
94-5

2165
94-6

2166
94-7

2167
94-8

2168
94-9

2169
94-10

216A
94-11

216B
94-12

2470
94-13

2471
94-14

2472
94-15
3_
2473
94-16

2474
94-17

2475
94-18

2476
94-19

2477
94-20

2478
94-21

2479
94-22

247A
94-23

247B
94-24

247C
94-25

247D
94-26

247E
94-27

247F
94-28

3251
94-29

3252
94-30

3253
94-31
4_
3254
94-32
🄐
1F110
94-33
🄑
1F111
94-34
🄒
1F112
94-35
🄓
1F113
94-36
🄔
1F114
94-37
🄕
1F115
94-38
🄖
1F116
94-39
🄗
1F117
94-40
🄘
1F118
94-41
🄙
1F119
94-42
🄚
1F11A
94-43
🄛
1F11B
94-44
🄜
1F11C
94-45
🄝
1F11D
94-46
🄞
1F11E
94-47
5_ 🄟
1F11F
94-48
🄠
1F120
94-49
🄡
1F121
94-50
🄢
1F122
94-51
🄣
1F123
94-52
🄤
1F124
94-53
🄥
1F125
94-54
🄦
1F126
94-55
🄧
1F127
94-56
🄨
1F128
94-57
🄩
1F129
94-58

3255
94-59

3256
94-60

3257
94-61

3258
94-62

3259
94-63
6_
325A
94-64

2460
94-65

2461
94-66

2462
94-67

2463
94-68

2464
94-69

2465
94-70

2466
94-71

2467
94-72

2468
94-73

2469
94-74

246A
94-75

246B
94-76

246C
94-77

246D
94-78

246E
94-79
7_
246F
94-80

2776
94-81

2777
94-82

2778
94-83

2779
94-84

277A
94-85

277B
94-86

277C
94-87

277D
94-88

277E
94-89

277F
94-90

24EB
94-91

24EC
94-92

325B
94-93

 
94-94

Single-byte sets

Alphanumeric set

Differences from US-ASCII are shown with a heavy border.

ARIB STD-B24 Alphanumeric set[13]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
32

 
!
0021
"
0022
#
0023
$
0024
%
0025
&
0026
'
0027
(
0028
)
0029
*
002A
+
002B
,
002C
-
002D
.
002E
/
002F
3_
48
0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
:
003A
;
003B
<
003C
=
003D
>
003E
?
003F
4_
64
@
0040
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
5_
80
P
0050
Q
0051
R
0052
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
[
005B
¥
00A5
]
005D
^
005E
_
005F
6_
96
`
0060
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
7_
112
p
0070
q
0071
r
0072
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
{
007B
|
007C
}
007D

203E

 

Hiragana set

Character allocations not following row 4 of JIS X 0208 are shown with a heavy border.

ARIB STD-B24 Hiragana set[14]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
32

 

3041

3042

3043

3044

3045

3046

3047

3048

3049

304A

304B

304C

304D

304E

304F
3_
48

3050

3051

3052

3053

3054

3055

3056

3057

3058

3059

305A

305B

305C

305D

305E

305F
4_
64

3060

3061

3062

3063

3064

3065

3066

3067

3068

3069

306A

306B

306C

306D

306E

306F
5_
80

3070

3071

3072

3073

3074

3075

3076

3077

3078

3079

307A

307B

307C

307D

307E

307F
6_
96

3080

3081

3082

3083

3084

3085

3086

3087

3088

3089

308A

308B

308C

308D

308E

308F
7_
112

3090

3091

3092

3093

 

 

 

309D

309E

30FC

3002

300C

300D

3001

30FB

 

Katakana set

Character allocations not following row 5 of JIS X 0208 are shown with a heavy border.

ARIB STD-B24 Katakana set[15]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
32

 

30A1

30A2

30A3

30A4

30A5

30A6

30A7

30A8

30A9

30AA

30AB

30AC

30AD

30AE

30AF
3_
48

30B0

30B1

30B2

30B3

30B4

30B5

30B6

30B7

30B8

30B9

30BA

30BB

30BC

30BD

30BE

30BF
4_
64

30C0

30C1

30C2

30C3

30C4

30C5

30C6

30C7

30C8

30C9

30CA

30CB

30CC

30CD

30CE

30CF
5_
80

30D0

30D1

30D2

30D3

30D4

30D5

30D6

30D7

30D8

30D9

30DA

30DB

30DC

30DD

30DE

30DF
6_
96

30E0

30E1

30E2

30E3

30E4

30E5

30E6

30E7

30E8

30E9

30EA

30EB

30EC

30ED

30EE

30EF
7_
112

30F0

30F1

30F2

30F3

30F4

30F5

30F6

30FD

30FE

30FC

3002

300C

300D

3001

30FB

 

JIS X 0201 Katakana set

ARIB STD-B24 JIS X 0201 Katakana set[16]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_

 

FF61

FF62

FF63

FF64

FF65

FF66

FF67

FF68

FF69

FF6A

FF6B

FF6C

FF6D

FF6E

FF6F
3_
48

FF70

FF71

FF72

FF73

FF74

FF75

FF76

FF77

FF78

FF79

FF7A

FF7B

FF7C

FF7D

FF7E
ソ
FF7F
4_
64

FF80

FF81

FF82

FF83

FF84

FF85

FF86

FF87

FF88

FF89

FF8A

FF8B

FF8C

FF8D

FF8E

FF8F
5_
80

FF90

FF91

FF92

FF93

FF94

FF95

FF96

FF97

FF98

FF99

FF9A

FF9B

FF9C

FF9D

FF9E

FF9F
6_
96
7_
112

Mosaic sets

Shift_JIS variant

In addition to the modified ISO 2022 encoding, the B24 standard also specifies a Shift JIS encoding following JIS X 0208:1997, but with the addition of the extended characters in the kanji set.[1]

First byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2 ! " # $ % & ' ( ) * + , - . /
3 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4 @ A B C D E F G H I J K L M N O
5 P Q R S T U V W X Y Z [ ¥ ] ^ _
6 ` a b c d e f g h i j k l m n o
7 p q r s t u v w x y z { | }
8
9
A
B ソ
C
D
E
F
Second byte
0 1 2 3 4 5 6 7 8 9 A B C D E F
0
1
2
3
4
5
6
7
8
9
A
B
C
D
E
F
 
Non printable ASCII character
Unaltered ASCII character
Modified ASCII character
Single-byte half-width katakana
First byte of a double-byte character, used by JIS X 0208
First byte of an ARIB extended character
Not used as first byte, unallocated space in JIS X 0208
Not used as first byte
Second byte of a double-byte character whose first half of the JIS sequence was odd
Second byte of a double-byte character whose first half of the JIS sequence was even
Unused as second byte of a double-byte character
gollark: Well, pjals is 5, you see.
gollark: what happened?!?!?!
gollark: discord store more like **potatOS for desktop**
gollark: *is
gollark: <@151391317740486657> dumb.

See also

Footnotes

  1. Glossed as "temple" (i.e. Buddhist temple) in B24 table 7-10 (the list of extension characters).
  2. Small form (70% size per code chart / table 7-10) of a kanji character. Shown here simulated.
  3. Musical abbreviation (or half thereof) not present in Unicode, simulated here with multiple characters.

References

  1. ARIB (2008), p. 105, part 2, section 7.3
  2. ARIB (2008)
  3. Suignard, Michel (2008-03-11). "ISO/IEC JTC1/SC2/WG2 N 3397: Japanese TV Symbols" (PDF).
  4. "Unicode 5.2 Emoji List". Emojipedia.
  5. ARIB (2014), pp. 3350, part 2, Table 5-2
  6. ARIB (2008), pp. 48-52
  7. ARIB (2008), p. 39, part 2, Table 7-3
  8. "ISO-IR-087" (PDF). Information Technology Standards Commission of Japan (IPSJ/ITSCJ).
  9. RFC 1468 (IETF)
  10. ARIB (2008), p. 72
  11. ARIB (2008), pp. 54-72, part 2, Table 7-10
  12. ARIB (2008), pp. 46-47, part 2, Table 7-4
  13. ARIB (2008), p. 48, part 2, Table 7-5
  14. ARIB (2008), p. 50, part 2, Table 7-7
  15. ARIB (2008), p. 49, part 2, Table 7-6
  16. ARIB (2008), p. 52, part 2, Table 7-9

Further reading

This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.