Code page 895

Code page 895 (CCSID 895)[1] is a 7-bit character set and is Japan's national ISO 646 variant.[2] It is the Roman set (first or left half) of the JIS X 0201 (formerly JIS C 6220) Japanese Standard and is variously called Japan 7-Bit Latin,[3] JISCII,[4] JIS Roman,[5] JIS C6220-1969-ro, ISO646-JP[6] or Japanese-Roman.[7] Its ISO-IR registration number is 14.[8]

Amongst IBM's code pages, it accompanies code page 896 (half-width katakana), which encodes the Kana set of JIS X 0201 with extensions, and code page 897 which encodes the 8-bit form of JIS X 0201. It is used in Unix-like systems and, when combined with code page 896 and the 2-byte IBM code page 952 and code page 953, makes up the four code-sets of code page 954, one of IBM's versions of EUC-JP.

Codepage layout

Code page 895 / ISO-IR-014[8][9][10]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_
0

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 
1_
16

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 
2_
32
SP
0020
!
0021
"
0022
#
0023
$
0024
%
0025
&
0026
'
0027
(
0028
)
0029
*
002A
+
002B
,
002C
-
002D
.
002E
/
002F
3_
48
0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
:
003A
;
003B
<
003C
=
003D
>
003E
?
003F
4_
64
@
0040
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
5_
80
P
0050
Q
0051
R
0052
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
[
005B
¥
00A5
]
005D
^
005E
_
005F
6_
96
`
0060
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
7_
112
p
0070
q
0071
r
0072
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
{
007B
|
007C
}
007D

203E
DEL
007F

  Letter  Number  Punctuation  Symbol  Other  Undefined  Differences from US-ASCII

gollark: As planned.
gollark: Although I actually wrote the regex as```pythonWHITESPACE = r"[\t\n ]*"NUMBER = r"\-?(?:0|[1-9][0-9]*)(?:\.[0-9]+)?(?:[eE][+-]?[0-9]+)?"ARRAY = f"(?:\[{WHITESPACE}(?:|(?R)|(?R)(?:,{WHITESPACE}(?R){WHITESPACE})*){WHITESPACE}])"STRING = r'"(?:[^"\\\n]|\\["\\/bfnrt]|\\u[0-9a-fA-F]{4})*"'TERMINAL = f"(?:true|false|null|{NUMBER}|{STRING})"PAIR = f"(?:{WHITESPACE}{STRING}{WHITESPACE}:{WHITESPACE}(?R){WHITESPACE})"OBJECT = f"(?:{{(?:{WHITESPACE}|{PAIR}|(?:{PAIR}(?:,{PAIR})*))}})"VALUE = f"{WHITESPACE}(?:{ARRAY}|{OBJECT}|{TERMINAL}){WHITESPACE}"```which is much easier.
gollark: Regex is kind of like the APL of string pattern matching, in that it's very terse and expressive but incomprehensible.
gollark: Well, the regex engine is fine with it.
gollark: It's actually a recursive regex, so it can generate infinitely deep problems with a finite... regex.

See also

Shift-JIS

References

  1. "CCSID 895 information document". Archived from the original on 2016-03-26.
  2. RFC 1468
  3. "Code page 895 information document". Archived from the original on 2016-08-10.
  4. "IBM-943 and IBM-932", IBM Knowledge Center, IBM
  5. "kUnicodeForceASCIIRangeMask", Apple Developer Documentation, Apple Inc
  6. RFC 1345
  7. da Cruz, Frank (2010-04-02), "Kermit and MIME Character-Set Names", Kermit Project, Columbia University
  8. ISO-IR 014: The Japanese Roman graphic set of characters (PDF), Information Technology Standards Commission of Japan (ITSCJ/IPSJ)
  9. Code Page CPGID 00895 (pdf) (PDF), IBM
  10. Code Page CPGID 00895 (txt), IBM
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.