KOI8-F

KOI8-F or KOI8 Unified is an 8-bit character set.[1] It was designed by Peter Cassetta[2] of Fingertip Software (now defunct) as an attempt to support all the encoded letters from both KOI8-E (ISO-IR-111) and KOI8-RU (and hence also, KOI8-U and KOI8-R), along with some of the pseudographics from KOI8-R,[3][2] with some additional punctuation in the remaining space, sourced partly from Windows-1251.[2] This encoding was only used in the software of that company.

KOI8 Unified
Alias(es)KOI8-F
Language(s)Belarusian, Ukrainian, Russian, Bulgarian, Serbian Cyrillic, Macedonian
Created byPeter Cassetta (Fingertip Software)
Classification8-bit KOI, extended ASCII
ExtendsKOI8-B
Based onKOI8-RU, KOI8-E
Other related encoding(s)KOI8-R, KOI8-U

Character set

The following table shows the KOI8-F encoding. Each character is shown with its equivalent Unicode code point. Differences from ISO-IR-111 are boxed; other relevant encodings which are matched, if any, are noted in footnotes.

KOI8-F[4]
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
0_
0
1_
16
2_
32
SP
0020
!
0021
"
0022
#
0023
$
0024
%
0025
&
0026
'
0027
(
0028
)
0029
*
002A
+
002B
,
002C
-
002D
.
002E
/
002F
3_
48
0
0030
1
0031
2
0032
3
0033
4
0034
5
0035
6
0036
7
0037
8
0038
9
0039
:
003A
;
003B
<
003C
=
003D
>
003E
?
003F
4_
64
@
0040
A
0041
B
0042
C
0043
D
0044
E
0045
F
0046
G
0047
H
0048
I
0049
J
004A
K
004B
L
004C
M
004D
N
004E
O
004F
5_
80
P
0050
Q
0051
R
0052
S
0053
T
0054
U
0055
V
0056
W
0057
X
0058
Y
0059
Z
005A
[
005B
\
005C
]
005D
^
005E
_
005F
6_
96
`
0060
a
0061
b
0062
c
0063
d
0064
e
0065
f
0066
g
0067
h
0068
i
0069
j
006A
k
006B
l
006C
m
006D
n
006E
o
006F
7_
112
p
0070
q
0071
r
0072
s
0073
t
0074
u
0075
v
0076
w
0077
x
0078
y
0079
z
007A
{
007B
|
007C
}
007D
~
007E
8_
128
[lower-alpha 1]
2500
[lower-alpha 1]
2502
[lower-alpha 1]
250C
[lower-alpha 1]
2510
[lower-alpha 1]
2514
[lower-alpha 1]
2518
[lower-alpha 1]
251C
[lower-alpha 1]
2524
[lower-alpha 1]
252C
[lower-alpha 1]
2534
[lower-alpha 1]
253C
[lower-alpha 1]
2580
[lower-alpha 1]
2584
[lower-alpha 1]
2588
[lower-alpha 1]
258C
[lower-alpha 1]
2590
9_
144
[lower-alpha 1]
2591
[lower-alpha 2]
2018
[lower-alpha 2]
2019
[lower-alpha 2]
201C
[lower-alpha 2]
201D
∙/•[lower-alpha 3]
2219/2022
[lower-alpha 2]
2013
[lower-alpha 2]
2014
©
00A9
[lower-alpha 2]
2122
NBSP[lower-alpha 1][lower-alpha 4]
00A0
»
00BB
®
00AE
«
00AB
·[lower-alpha 1]
00B7
¤
00A4
A_
160
NBSP[lower-alpha 4]
00A0
ђ
0452
ѓ
0453
ё
0451
є
0454
ѕ
0455
і
0456
ї
0457
ј
0458
љ
0459
њ
045A
ћ
045B
ќ
045C
ґ[lower-alpha 5]
0491
ў
045E
џ
045F
B_
176

2116
Ђ
0402
Ѓ
0403
Ё
0401
Є
0404
Ѕ
0405
І
0406
Ї
0407
Ј
0408
Љ
0409
Њ
040A
Ћ
040B
Ќ
040C
Ґ[lower-alpha 5]
0490
Ў
040E
Џ
040F
C_
192
ю
044E
а
0430
б
0431
ц
0446
д
0434
е
0435
ф
0444
г
0433
х
0445
и
0438
й
0439
к
043A
л
043B
м
043C
н
043D
о
043E
D_
208
п
043F
я
044F
р
0440
с
0441
т
0442
у
0443
ж
0436
в
0432
ь
044C
ы
044B
з
0437
ш
0448
э
044D
щ
0449
ч
0447
ъ
044A
E_
224
Ю
042E
А
0410
Б
0411
Ц
0426
Д
0414
Е
0415
Ф
0424
Г
0413
Х
0425
И
0418
Й
0419
К
041A
Л
041B
М
041C
Н
041D
О
041E
F_
240
П
041F
Я
042F
Р
0420
С
0421
Т
0422
У
0423
Ж
0416
В
0412
Ь
042C
Ы
042B
З
0417
Ш
0428
Э
042D
Щ
0429
Ч
0427
Ъ
042A

  Letter  Number  Punctuation  Symbol  Other  Undefined  Differences from ISO-IR-111

  1. Matching KOI8-R, KOI8-U, KOI8-RU.
  2. Matching Windows-1251 and Windows-1252.
  3. May be U+2219, which matches RFC 1489 (KOI8-R),[4] or U+2022, which matches Windows-1251 and Windows-1252.
  4. The non-breaking space is encoded twice: first at 0x9A matching KOI8-R, and then at 0xA0 matching KOI8-E (the latter of which also happens to be its location in Windows-1251 and Windows-1252).
  5. Matching KOI8-U and KOI8-RU.
gollark: Mostly with numbers in them, probably as people got the things wrong.
gollark: I found about 50 via nonexhaustive search.
gollark: I know, right?
gollark: https://0t.lt/c
gollark: <@!160279332454006795> Logical XOR is just !=, we've been through this.

See also

References

  1. Nechayev, Valentin (2013) [2001]. "Review of 8-bit Cyrillic encodings universe". Archived from the original on 2016-12-05. Retrieved 2016-12-05.
  2. Czyborra, Roman (1998-11-30) [1998-05-25]. "The Cyrillic Charset Soup". Archived from the original on 2016-12-03. Retrieved 2016-12-03.
  3. "KOI8 Unified". Fingertip Software. Archived from the original on 1998-01-09. Retrieved 2020-02-11.
  4. Leisher, Mark (2008) [1998-03-05]. "KOI8 Unified Cyrillic to Unicode 2.1 mapping table". Department of Mathematical Sciences, New Mexico State University. Retrieved 2020-05-02.
This article is issued from Wikipedia. The text is licensed under Creative Commons - Attribution - Sharealike. Additional terms may apply for the media files.