Chinese Character Code for Information Interchange

Jump to navigation Jump to search

CCCII / EACC
CCCII-illust.svg
The character (circle, Japanese yen, Chinese yuan) in four layers of CCCII.
Language(s)Chinese, Japanese, Korean
StandardMARC-8, ANSI/NISO Z39.64 (both EACC version)
Current statusUsed mainly by library systems
ClassificationTBCS for CJK based on the ISO 2022 structure, JACKPHY component of MARC

The Chinese Character Code for Information Interchange (Chinese: 中文資訊交換碼) or CCCII is a character set developed by the Chinese Character Analysis Group in Taiwan. It was first published in 1980, and significantly expanded in 1982 and 1987.[1]

It is used mostly by library systems.[2][3] It is one of the earliest established and most sophisticated encodings for Traditional Chinese (predating the establishment of Big5 in 1984 and CNS 11643 in 1986).[2] It is distinguished by its unique system for encoding simplified versions and other variants of its main set of hanzi characters.[1]

A variant of an earlier version of CCCII is used by the Library of Congress as part of MARC-8, under the name East Asian Character Code (EACC, ANSI/NISO Z39.64),[4] where it comprises part of MARC 21's JACKPHY support. However, EACC contains fewer characters than the most recent versions of CCCII.[5][1]

Design[edit]

Nine characters from CCCII or EACC implementations, which are encoded as variants of (double-edged sword). The ninth of these, 釖, is usually a variant of .[6]

Byte ranges[edit]

CCCII is designed as an 94n set, as defined by ISO/IEC 2022.[1] Each Chinese character is represented by a 3-byte code in which each byte is 7-bit, between 0x21 and 0x7E inclusive. Thus, the maximum number of Chinese characters representable in CCCII is 94×94×94 = 830584. In practice the number of characters encodable by CCCII would be less than this number, because variant characters are encoded in related ISO 2022 planes under CCCII, so most of the code points would have to be reserved for variants.

In practice, however, bytes outside of these ranges are sometimes used. The code 0x212320 is used by some implementations as an ideographic space.[7] A CCCII specification used by libraries in Hong Kong uses codes starting with 0x2120 for punctuation and symbols.[8] The first byte 0x7F is used by some variants to encode codes for some otherwise unavailable Unified Repertoire and Ordering or CJK Unified Ideographs Extension A hanzi (e.g. 0x7F3449 for U+3449 or 0x7F796E for U+796E;[8] notice how the continuation bytes match the UCS-2BE code), and this may include bytes outside of the 0x21–0x7E or even 0x20–0x7F range, e.g. 0x7F551C for U+551C,[9] 0x7F5AA4 for U+5AA4[9] or 0x7F8EDA for U+8EDA.[8]

Interaction with ISO 2022[edit]

CCCII/EACC is not registered in the International Registry of Coded Character Sets to be Used with Escape Sequences,[10] and as such, does not have a standard designation escape for use with ISO 2022. MARC-8 assigns EACC the private-use F-byte 0x31 (1) in its implementation of ANSI X3.41 (ISO 2022).[11]

Layers and variant characters[edit]

The 94 ISO 2022 planes are grouped into 16 layers of 6 planes each (except for layer 16, which contains the four planes 91–94).[1] Layer 1 contains both non-hanzi and hanzi characters, with the non-hanzi and most frequently used hanzi being placed in plane 1, and with the remaining five planes consisting of less common hanzi.[1] Layer 2 contains Simplified Chinese characters, with their row and cell numbers being the same as their Traditional Chinese equivalents in layer 1. Layers 3 through 12 contain further variant forms, at row and cell numbers homologous to the first two layers.[12]

The last four layers are used for other purposes. Specifically, layer 13 contains additional characters for Japanese language support (kana and Japanese kokuji), and layer 14 contains additional characters for Korean language support (hangul).[12] Layer 15 is unused (reserved), while layer 16 is used for other characters.[1]

This distinctive design has been criticised by Christian Wittern of the International Research Institute for Zen Buddhism at Hanazono University, who asserts that the relationship of character variants "is very complex and can not be expressed in a fixed, one-dimensional, hard-wired codetable".[3] Ken Lunde describes it as "one of the most well thought-out character set standards from Taiwan", describing its structure as "to be truly admired", but concluding that OpenType variant form substitution can provide the same level of functionality.[1]

CCCII defines roughly 53940 code points as of its 1987 edition, although a more recent draft from 1989 extends this to 75684 code points (comprising 44167 unique characters and 31517 variants). EACC, the variant used by the Library of Congress, includes only a smaller set of 15686 characters.[1]

Adoption[edit]

As of 1995, CCCII or EACC was used mostly in libraries in the United States, Hong Kong and Taiwan. Although CCCII promised pan-CJK coverage, its support was limited to specialised hardware; difficulty ascertaining when the root versus variant character should be used, exacerbated by a lack of firmly established reference glyphs, further limited its adoption, resulting in Big5 being more commonly used for Chinese in those territories outside of library use (since Unicode had yet to become widely adopted at the time).[3]

As of 2009, EACC is still in extensive use for specialized bibliographic purposes.[1] It was also an important precursor to Unicode.[1] Unicode hanzi characters are referenced to their corresponding CCCII and EACC codes in the Unihan database, in the keys kCCCII and kEACC.[4] Mapping tables for hanzi, hangul, kana and punctuation between EACC and Unicode are available from the Library of Congress.[13]

Punctuation, symbol, kana and jamo charts[edit]

Following are charts for punctuation, symbols, kana and Hangul jamo, showing the characters and giving possible Unicode mappings. Where possible, these are referenced against published mapping data.

Unicode mappings for Hangul syllables are omitted for brevity, but may also be obtained from the Library of Congress.[14] CCCII hanzi number in the tens of thousands[1][3] and are not shown below (except where they are also included in the non-hanzi range, as radicals or numerals), but mappings to Unicode are available from the Unihan database[4] and from elsewhere.[9][8]

Character set 0x2120 (plane 1, row 0: Hong Kong punctuation)[edit]

Although CCCII is usually a 94n set,[1] and therefore does not usually use codes starting with 0x2120,[9] the following layout is used by a variant used by libraries in Hong Kong:[8]

CCCII (Hong Kong)[8] (prefixed with 0x2120)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
FF64
1-0-1

FF61
1-0-2

 
1-0-3

FF65
1-0-4

FF9E
1-0-5

FF9F
1-0-6
´
00B4
1-0-7

FF40
1-0-8
¨
00A8
1-0-9

 
1-0-10

FFE3
1-0-11

30FD
1-0-12

30FE
1-0-13

309D
1-0-14

309E
1-0-15
3_
3003
1-0-16

 
1-0-17

 
1-0-18

3006
1-0-19

 
1-0-20

 
1-0-21

 
1-0-22

 
1-0-23

 
1-0-24

2016
1-0-25

 
1-0-26

 
1-0-27

 
1-0-28

2026
1-0-29

2025
1-0-30

 
1-0-31
4_
201C
1-0-32

3014
1-0-33

3015
1-0-34

 
1-0-35

 
1-0-36

 
1-0-37

 
1-0-38

FF62
1-0-39

FF63
1-0-40

300E
1-0-41

300F
1-0-42

3010
1-0-43

3011
1-0-44
±
00B1
1-0-45
×
00D7
1-0-46
÷
00F7
1-0-47
5_
2260
1-0-48

2266
1-0-49

2267
1-0-50

221E
1-0-51

2234
1-0-52

2642
1-0-53

2640
1-0-54
°
00B0
1-0-55

2103
1-0-56
¢
00A2
1-0-57
£
00A3
1-0-58
§
00A7
1-0-59

2606
1-0-60

2605
1-0-61

25CB
1-0-62

25CF
1-0-63
6_
2032
1-0-64

2033
1-0-65

 
1-0-66

 
1-0-67

 
1-0-68

 
1-0-69

 
1-0-70

 
1-0-71

 
1-0-72

 
1-0-73

 
1-0-74

 
1-0-75

 
1-0-76

 
1-0-77

 
1-0-78

25CE
1-0-79
7_
25C7
1-0-80

25C6
1-0-81

25A1
1-0-82

25A0
1-0-83

25B3
1-0-84

25B2
1-0-85

25BD
1-0-86

25BC
1-0-87

203B
1-0-88

3012
1-0-89

2192
1-0-90

2190
1-0-91

2191
1-0-92

2193
1-0-93

 
1-0-94

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x2121 (plane 1, row 1: reserved for controls)[edit]

No characters are assigned in plane 1 row 1, which is reserved for control codes.[1]

Character set 0x2122 (plane 1, row 2: mathematical operators)[edit]

This row contains mathematical operators. EACC leaves this row empty.[13] The following table is referenced against sources from Taiwan.[2][9]

CCCII (Taiwan)[9][15] (prefixed with 0x2122)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
221E
1-2-1

FF0B
1-2-2

2212
1-2-3
±
00B1
1-2-4
×
00D7
1-2-5

22C5
1-2-6
÷
00F7
1-2-7
 ∕ 
2215
1-2-8

FF1D
1-2-9

2260
1-2-10

2261
1-2-11

2248
1-2-12

223C
1-2-13

221D
1-2-14

FF1C
1-2-15
3_
FF1E
1-2-16

226E
1-2-17

226F
1-2-18

2264
1-2-19

2265
1-2-20

226A
1-2-21

226B
1-2-22

2202
1-2-23

222B
1-2-24
Δ
0394
1-2-25

2206
1-2-26

2207
1-2-27

25AB
1-2-28

2220
1-2-29

22A4
1-2-30

2225
1-2-31
4_
2245
1-2-32

225E
1-2-33

2234
1-2-34

2203
1-2-35

2200
1-2-36

222A
1-2-37

2229
1-2-38

2282
1-2-39

2283
1-2-40

21D2
1-2-41

21D4
1-2-42

220B
1-2-43

2208
1-2-44

2209
1-2-45

2211
1-2-46

33D2
1-2-47
5_
33D1
1-2-48

212F
1-2-49
π
03C0
1-2-50

221A
1-2-51

FE15
1-2-52

23A2
1-2-53

23A5
1-2-54

3008
1-2-55

3009
1-2-56

 
1-2-57

 
1-2-58

 
1-2-59

 
1-2-60

 
1-2-61

 
1-2-62

 
1-2-63
6_
 
1-2-64

 
1-2-65

 
1-2-66

 
1-2-67

 
1-2-68

 
1-2-69

 
1-2-70

 
1-2-71

 
1-2-72

 
1-2-73

 
1-2-74

 
1-2-75

 
1-2-76

 
1-2-77

 
1-2-78

 
1-2-79
7_
 
1-2-80

 
1-2-81

 
1-2-82

 
1-2-83

 
1-2-84

 
1-2-85

 
1-2-86

 
1-2-87

 
1-2-88

 
1-2-89

 
1-2-90

 
1-2-91

 
1-2-92

 
1-2-93

 
1-2-94

  Letter  Number  Punctuation  Symbol  Other  Undefined

The following table is referenced against CCCII data provided by the Hong Kong Innovative Users Group, a group of libraries in Hong Kong, and hosted by the University of Hong Kong.[16][8] It uses an entirely different layout in this row:

CCCII (Hong Kong)[8] (prefixed with 0x2122)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
2208
 

220B
1-2-1

2286
1-2-2

2287
1-2-3

2282
1-2-4

2283
1-2-5

222A
1-2-6

2229
1-2-7

2227
1-2-8

2228
1-2-9
¬
00AC
1-2-10

21D2
1-2-11

21D4
1-2-12

2200
1-2-13

2203
1-2-14

 
1-2-15
3_
2220
1-2-16

22A5
1-2-17

2312
1-2-18

2202
1-2-19

2207
1-2-20

2261
1-2-21

2252
1-2-22

226A
1-2-23

226B
1-2-24

221A
1-2-25

223D
1-2-26

221D
1-2-27

2235
1-2-28

222B
1-2-29

222C
1-2-30

 
1-2-31
4_
212B
1-2-32

2030
1-2-33

266F
1-2-34

266D
1-2-35

266A
1-2-36

2020
1-2-37

2021
1-2-38

00B6
1-2-39

25EF
1-2-40

 
1-2-41

 
1-2-42

 
1-2-43

 
1-2-44

 
1-2-45

 
1-2-46

 
1-2-47
5_
2500
1-2-48

2502
1-2-49

250C
1-2-50

2510
1-2-51

2518
1-2-52

2514
1-2-53

251C
1-2-54

252C
1-2-55

2524
1-2-56

2534
1-2-57

253C
1-2-58

2501
1-2-59

2503
1-2-60

250F
1-2-61

2513
1-2-62

251B
1-2-63
6_
2517
1-2-64

2523
1-2-65

2533
1-2-66

252B
1-2-67

253B
1-2-68

254B
1-2-69

2520
1-2-70

252F
1-2-71

2528
1-2-72

2537
1-2-73

253F
1-2-74

251D
1-2-75

2530
1-2-76

2525
1-2-77

2538
1-2-78

2542
1-2-79
7_
 
1-2-80

 
1-2-81

 
1-2-82

 
1-2-83

 
1-2-84

 
1-2-85

 
1-2-86

 
1-2-87

 
1-2-88

 
1-2-89

 
1-2-90

 
1-2-91

 
1-2-92

 
1-2-93

 
1-2-94

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x2123 (plane 1, row 3: Roman and punctuation)[edit]

This row includes punctuation, western Arabic numerals and Roman letters.[9] Compare row 3 of Wansung code and of GB 2312.

Different variants variously encode the ideographic space (U+3000) at 0x212320 (which the MARC specification acknowledges),[7][8] 0x212321 (which is listed in the ANSI standard, and is also acknowledged by MARC),[7][8] or 0x21635F.[9] EACC includes only the hyphen-minus, parentheses and ideographic space in this set.[7]

CCCII/EACC[13][9][15] (prefixed with 0x2123)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_ IDSP
3000
 
[a]
!/IDSP
0021/3000
1-3-1
[b]
"
0022
1-3-2
#
0023
1-3-3
$
0024
1-3-4
%
0025
1-3-5
&
0026
1-3-6
'
0027
1-3-7
(/
0028/FF08
1-3-8
)/
0029/FF09
1-3-9
*
002A
1-3-10
+
002B
1-3-11
,
002C
1-3-12
-/
002D/FF0D
1-3-13
.
002E
1-3-14
/
002F
1-3-15
3_/B_ 0
0030
1-3-16
1
0031
1-3-17
2
0032
1-3-18
3
0033
1-3-19
4
0034
1-3-20
5
0035
1-3-21
6
0036
1-3-22
7
0037
1-3-23
8
0038
1-3-24
9
0039
1-3-25
:
003A
1-3-26
;
003B
1-3-27
<
003C
1-3-28
=
003D
1-3-29
>
003E
1-3-30
?
003F
1-3-31
4_/C_ @
0040
1-3-32
A
0041
1-3-33
B
0042
1-3-34
C
0043
1-3-35
D
0044
1-3-36
E
0045
1-3-37
F
0046
1-3-38
G
0047
1-3-39
H
0048
1-3-40
I
0049
1-3-41
J
004A
1-3-42
K
004B
1-3-43
L
004C
1-3-44
M
004D
1-3-45
N
004E
1-3-46
O
004F
1-3-47
5_/D_ P
0050
1-3-48
Q
0051
1-3-49
R
0052
1-3-50
S
0053
1-3-51
T
0054
1-3-52
U
0055
1-3-53
V
0056
1-3-54
W
0057
1-3-55
X
0058
1-3-56
Y
0059
1-3-57
Z
005A
1-3-58
[
005B
1-3-59
\
005C
1-3-60
]
005D
1-3-61

2191
1-3-62
_
005F
1-3-63
6_/E_ `/'
0060/0027
1-3-64
a
0061
1-3-65
b
0062
1-3-66
c
0063
1-3-67
d
0064
1-3-68
e
0065
1-3-69
f
0066
1-3-70
g
0067
1-3-71
h
0068
1-3-72
i
0069
1-3-73
j
006A
1-3-74
k
006B
1-3-75
l
006C
1-3-76
m
006D
1-3-77
n
006E
1-3-78
o
006F
1-3-79
7_/F_ p
0070
1-3-80
q
0071
1-3-81
r
0072
1-3-82
s
0073
1-3-83
t
0074
1-3-84
u
0075
1-3-85
v
0076
1-3-86
w
0077
1-3-87
x
0078
1-3-88
y
0079
1-3-89
z
007A
1-3-90
{
007B
1-3-91
|
007C
1-3-92
}
007D
1-3-93
~
007E
1-3-94
 
 
 

  Letter  Number  Punctuation  Symbol  Other  Undefined  Included in EACC.

Character set 0x212A (plane 1, row 10: internal IME characters and geta mark)[edit]

In EACC, this row includes several Private Use Area mapped characters used internally to represent character components by the RLIN input method,[17] which is used by the Library of Congress for non-Roman cataloging.[18] These component characters should only be used internally by an IME and, if encountered elsewhere, may be replaced with the geta mark (U+3013),[17] which this row also includes at 0x212A46. This row is unassigned in CCCII,[1] but the geta mark is also listed at that location in some mappings for CCCII.[9]

EACC[13] (prefixed with 0x212A)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
E8D0
1-10-1

E8D1
1-10-2

E8D2
1-10-3

E8D3
1-10-4

E8D4
1-10-5

E8D5
1-10-6

 
1-10-7

E8D6
1-10-8

E8D7
1-10-9

E8D8
1-10-10

E8D9
1-10-11

E8DA
1-10-12

E8DB
1-10-13

E8DC
1-10-14

E8DD
1-10-15
3_
E8DE
1-10-16

 
1-10-17

E8DF
1-10-18

E8E0
1-10-19

E8E1
1-10-20

E8E2
1-10-21

E8E3
1-10-22

E8E4
1-10-23

E8E5
1-10-24

E8E6
1-10-25

E8E7
1-10-26

E8E8
1-10-27

E8E9
1-10-28

E8EA
1-10-29

E8EB
1-10-30

E8EC
1-10-31
4_
E8ED
1-10-32

E8EE
1-10-33

E8EF
1-10-34

E8F0
1-10-35

E8F1
1-10-36

E8F2
1-10-37

3013
1-10-38

 
1-10-39

 
1-10-40

 
1-10-41

 
1-10-42

 
1-10-43

 
1-10-44

 
1-10-45

 
1-10-46

 
1-10-47
5_
 
1-10-48

 
1-10-49

 
1-10-50

 
1-10-51

 
1-10-52

 
1-10-53

 
1-10-54

 
1-10-55

 
1-10-56

 
1-10-57

 
1-10-58

 
1-10-59

 
1-10-60

 
1-10-61

 
1-10-62

 
1-10-63
6_
 
1-10-64

 
1-10-65

 
1-10-66

 
1-10-67

 
1-10-68

 
1-10-69

 
1-10-70

 
1-10-71

 
1-10-72

 
1-10-73

 
1-10-74

 
1-10-75

 
1-10-76

 
1-10-77

 
1-10-78

 
1-10-79
7_
 
1-10-80

 
1-10-81

 
1-10-82

 
1-10-83

 
1-10-84

 
1-10-85

 
1-10-86

 
1-10-87

 
1-10-88

 
1-10-89

 
1-10-90

 
1-10-91

 
1-10-92

 
1-10-93

 
1-10-94

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x212B (plane 1, row 11: punctuation)[edit]

This row contains various punctuation marks used in Chinese,[1][7] in addition to other symbols. CCCII includes a set of 35 punctuation marks in this row.[1] EACC includes only 13 characters in this row (shown boxed below).[7]

CCCII/EACC[13][9][15] (prefixed with 0x212B)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
FE35
1-11-1

FE36
1-11-2

FE41
1-11-3

FE42
1-11-4

300C
1-11-5

300D
1-11-6

FE33
1-11-7
_
FF3F
1-11-8

FE43
1-11-9

FE44
1-11-10

300E
1-11-11

300F
1-11-12

FE34
1-11-13

FE4F
1-11-14

FE39
1-11-15
3_
FE3A
1-11-16
/
3014/FF3B
1-11-17
/
3015/FF3D
1-11-18

3002
1-11-19
/
30FB/FF0E
1-11-20

3001
1-11-21

22EE
1-11-22

22EF
1-11-23

FF0C
1-11-24

FF1B
1-11-25

FF1A
1-11-26

FF1F
1-11-27

FE31
1-11-28

FF01
1-11-29

FE32
1-11-30

FE31
1-11-31
4_
2018
1-11-32

2019
1-11-33

201C
1-11-34

201D
1-11-35

 
1-11-36

 
1-11-37

 
1-11-38

 
1-11-39

 
1-11-40

 
1-11-41

300A
1-11-42

300B
1-11-43

3010
1-11-44

3011
1-11-45

3016
1-11-46

3017
1-11-47
5_ Cifrão symbol.svg
 
1-11-48

FF04
1-11-49
¢
00A2
1-11-50

20A1
1-11-51
£
00A3
1-11-52
¥
00A5
1-11-53

20A8
1-11-54
d.
 
1-11-55
s.
 
1-11-56

FF0F
1-11-57

FF03
1-11-58

FF05
1-11-59

214C
1-11-60

FF20
1-11-61

00B6
1-11-62
®
00AE
1-11-63
6_ ©
00A9
1-11-64

2105
1-11-65

FF06
1-11-66
§
00A7
1-11-67

2020
1-11-68

2021
1-11-69

FF0A
1-11-70

 
1-11-71

 
1-11-72

 
1-11-73

 
1-11-74

 
1-11-75

 
1-11-76

 
1-11-77

 
1-11-78

 
1-11-79
7_
30FD
1-11-80

30FE
1-11-81

309D
1-11-82

309E
1-11-83
α
03B1
1-11-84

3012
1-11-85

 
1-11-86

 
1-11-87

 
1-11-88

 
1-11-89

 
1-11-90

 
1-11-91

 
1-11-92

 
1-11-93

 
1-11-94

  Letter  Number  Punctuation  Symbol  Other  Undefined  Included in EACC.

Character sets 0x212C–0x212E (plane 1, rows 12–14: radicals and ordinals)[edit]

These rows contain Chinese radicals,[1] Roman numerals,[9] celestial stems and terrestrial branches.[15]

CCCII[15] (prefixed with 0x212C)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
 
 

 
1-12-1

2F00
1-12-2

2F01
1-12-3

2F02
1-12-4

2F03
1-12-5

2F04
1-12-6

2F05
1-12-7

 
1-12-8

2F06
1-12-9

2F07
1-12-10

2F08
1-12-11

2F09
1-12-12

2F0A
1-12-13

2F0B
1-12-14

2F0C
1-12-15
3_
2F0D
1-12-16

2F0E
1-12-17

2F0F
1-12-18

2F10
1-12-19

2F11
1-12-20

2F12
1-12-21

2F13
1-12-22

2F14
1-12-23

2F15
1-12-24

2F16
1-12-25

2F17
1-12-26

2F18
1-12-27

2F19
1-12-28

2F1A
1-12-29

2F1B
1-12-30

2F1C
1-12-31
4_
 
1-12-32

2F1D
1-12-33

2F1E
1-12-34

2F1F
1-12-35

2F20
1-12-36

2F21
1-12-37

2F22
1-12-38

2F23
1-12-39

2F24
1-12-40

2F25
1-12-41

2F26
1-12-42

2F27
1-12-43

2F28
1-12-44

2F29
1-12-45

2F2A
1-12-46

2F2B
1-12-47
5_
2F2C
1-12-48

2F2D
1-12-49

2F2E
1-12-50

2F2F
1-12-51

2F30
1-12-52

2F31
1-12-53

2F32
1-12-54

2F33
1-12-55

2F34
1-12-56

2F35
1-12-57

2F36
1-12-58

2F37
1-12-59

2F38
1-12-60

2F39
1-12-61

2F3A
1-12-62

2F3B
1-12-63
6_
 
1-12-64

2F3C
1-12-65

2F3D
1-12-66

2F3E
1-12-67
⼿
2F3F
1-12-68

2F40
1-12-69

2F41
1-12-70

2F42
1-12-71

2F43
1-12-72

2F44
1-12-73

2F45
1-12-74

2F46
1-12-75

2F47
1-12-76

2F48
1-12-77

2F49
1-12-78

2F4A
1-12-79
7_
2F4B
1-12-80

2F4C
1-12-81

2F4D
1-12-82

2F4E
1-12-83

2F4F
1-12-84

2F50
1-12-85

2F51
1-12-86

2F52
1-12-87

2F53
1-12-88

2F54
1-12-89

2F55
1-12-90

2F56
1-12-91

2F57
1-12-92

2F58
1-12-93

2F59
1-12-94

 
 
CCCII[8][15] (prefixed with 0x212D)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
 
 

2F5A
1-13-1

2F5B
1-13-2

2F5C
1-13-3

2F5D
1-13-4

 
1-13-5

2F5E
1-13-6

2F5F
1-13-7

2F60
1-13-8

2F61
1-13-9

2F62
1-13-10

2F63
1-13-11

2F64
1-13-12

2F65
1-13-13

2F66
1-13-14

2F67
1-13-15
3_
2F68
1-13-16

2F69
1-13-17

2F6A
1-13-18

2F6B
1-13-19

2F6C
1-13-20

2F6D
1-13-21

2F6E
1-13-22

2F6F
1-13-23

2F70
1-13-24

2F71
1-13-25

2F72
1-13-26

2F73
1-13-27

2F74
1-13-28

 
1-13-29

2F75
1-13-30

2F76
1-13-31
4_
2F77
1-13-32

2F78
1-13-33

2F79
1-13-34

2F7A
1-13-35

2F7B
1-13-36

2F7C
1-13-37

2F7D
1-13-38

2F7E
1-13-39
⽿
2F7F
1-13-40

2F80
1-13-41

2F81
1-13-42

2F82
1-13-43

2F83
1-13-44

2F84
1-13-45

2F85
1-13-46

2F86
1-13-47
5_
2F87
1-13-48

2F88
1-13-49

2F89
1-13-50

2F8A
1-13-51

2F8B
1-13-52

2F8C
1-13-53

2F8D
1-13-54

2F8E
1-13-55

2F8F
1-13-56

2F90
1-13-57

2F91
1-13-58

 
1-13-59

2F92
1-13-60

2F93
1-13-61
/
2F94/8A01
1-13-62

2F95
1-13-63
6_
2F96
1-13-64

2F97
1-13-65

2F98
1-13-66

2F99
1-13-67

2F9A
1-13-68

2F9B
1-13-69

2F9C
1-13-70

2F9D
1-13-71

2F9E
1-13-72

2F9F
1-13-73

2FA0
1-13-74

2FA1
1-13-75

2FA2
1-13-76

2FA3
1-13-77

2FA4
1-13-78

2FA5
1-13-79
7_
 
1-13-80
/
2FA6/91D2
1-13-81

2FA7
1-13-82

2FA8
1-13-83

2FA9
1-13-84

2FAA
1-13-85

2FAB
1-13-86

2FAC
1-13-87

2FAD
1-13-88

2FAE
1-13-89

 
1-13-90

2FAF
1-13-91

2FB0
1-13-92

2FB1
1-13-93

2FB2
1-13-94

 
 
CCCII[9][8][15] (prefixed with 0x212E)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
 
 

2FB3
1-14-1

2FB4
1-14-2

2FB5
1-14-3

2FB6
1-14-4
/
2FB7/98E0
1-14-5

2FB8
1-14-6

2FB9
1-14-7

 
1-14-8

2FBA
1-14-9

2FBB
1-14-10

2FBC
1-14-11

2FBD
1-14-12

2FBE
1-14-13
⾿
2FBF
1-14-14

2FC0
1-14-15
3_
2FC1
1-14-16

 
1-14-17

2FC2
1-14-18

2FC3
1-14-19

2FC4
1-14-20

2FC5
1-14-21

2FC6
1-14-22

2FC7
1-14-23

 
1-14-24

2FC8
1-14-25

2FC9
1-14-26

2FCA
1-14-27

2FCB
1-14-28

 
1-14-29

2FCC
1-14-30

2FCD
1-14-31
4_
2FCE
1-14-32

2FCF
1-14-33

 
1-14-34

2FD0
1-14-35

2FD1
1-14-36

 
1-14-37

2FD2
1-14-38

 
1-14-39

2FD3
1-14-40

2FD4
1-14-41

 
1-14-42

2FD5
1-14-43

 
1-14-44

 
1-14-45

 
1-14-46

 
1-14-47
5_
7532
1-14-48

4E59
1-14-49

4E19
1-14-50

4E01
1-14-51

620A
1-14-52

5DF1
1-14-53

5E9A
1-14-54

8F9B
1-14-55

58EC
1-14-56

7678
1-14-57

 
1-14-58

 
1-14-59

 
1-14-60

 
1-14-61

 
1-14-62

 
1-14-63
6_
5B50
1-14-64

4E11
1-14-65

5BC5
1-14-66

536F
1-14-67

8FB0
1-14-68

5DF3
1-14-69

5348
1-14-70

672A
1-14-71

7533
1-14-72

9149
1-14-73

620C
1-14-74

4EA5
1-14-75

 
1-14-76

 
1-14-77

 
1-14-78

 
1-14-79
7_
 
1-14-80

2160
1-14-81

2161
1-14-82

2162
1-14-83

2163
1-14-84

2164
1-14-85

2165
1-14-86

2166
1-14-87

2167
1-14-88

2168
1-14-89

2169
1-14-90

216A
1-14-91

216B
1-14-92

 
1-14-93

 
1-14-94

 
 

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x212F (plane 1, row 15: Chinese numerals and bopomofo)[edit]

This row includes Chinese numerals and bopomofo characters.[1] EACC includes only the ideographic zero (〇).[7]

CCCII/EACC[13][15][9] (prefixed with 0x212F)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
3021
1-15-1

3022
1-15-2

3023
1-15-3

3024
1-15-4

3025
1-15-5

3026
1-15-6

3027
1-15-7

3028
1-15-8

3029
1-15-9

3038
1-15-10

3039
1-15-11

303A
1-15-12

 
1-15-13

 
1-15-14

 
1-15-15
3_
3007
1-15-16

4E00
1-15-17

4E8C
1-15-18

4E09
1-15-19

56DB
1-15-20

4E94
1-15-21

516D
1-15-22

4E03
1-15-23

516B
1-15-24

4E5D
1-15-25

5341
1-15-26

767E
1-15-27

5343
1-15-28

4E07
1-15-29

 
1-15-30

 
1-15-31
4_
96F6
1-15-32

58F9
1-15-33

8CB3
1-15-34

53C3
1-15-35

8086
1-15-36

4F0D
1-15-37

9678
1-15-38

67D2
1-15-39

634C
1-15-40

7396
1-15-41

62FE
1-15-42

4F70
1-15-43

4EDF
1-15-44

842C
1-15-45

5104
1-15-46

 
1-15-47
5_ ˊ
02CA
1-15-48
ˇ
02C7
1-15-49
ˋ
02CB
1-15-50
˙/[c]
02D9/FE52
1-15-51

3105
1-15-52

3106
1-15-53

3107
1-15-54

3108
1-15-55

3109
1-15-56

310A
1-15-57

310B
1-15-58

310C
1-15-59

310D
1-15-60

310E
1-15-61

310F
1-15-62

3110
1-15-63
6_
3111
1-15-64

3112
1-15-65

3113
1-15-66

3114
1-15-67

3115
1-15-68

3116
1-15-69

3117
1-15-70

3118
1-15-71

3119
1-15-72

311A
1-15-73

311B
1-15-74

311C
1-15-75

311D
1-15-76

311E
1-15-77

311F
1-15-78

3120
1-15-79
7_
3121
1-15-80

3122
1-15-81

3123
1-15-82

3124
1-15-83

3125
1-15-84

3126
1-15-85

3127
1-15-86

3128
1-15-87

3129
1-15-88

 
1-15-89
ü
00FC
1-15-90

 
1-15-91

 
1-15-92

 
1-15-93

 
1-15-94

  Letter  Number  Punctuation  Symbol  Other  Undefined  Included in EACC.

Character set 0x272B (plane 7, row 11: reference mark)[edit]

This row contains the reference mark (kome jirushi).[9]

CCCII[9] (prefixed with 0x272B)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
6_
 
7-11-64

 
7-11-65

 
7-11-66

 
7-11-67

 
7-11-68

 
7-11-69

 
7-11-70

 
7-11-71

 
7-11-72

 
7-11-73

 
7-11-74

 
7-11-75

 
7-11-76

 
7-11-77

 
7-11-78

203B
7-11-79

Character set 0x272E–0x272F (plane 7, rows 14–15: alternative bopomofo)[edit]

A variant used by libraries in Hong Kong does not include bopomofo characters in plane 1 row 15, but includes them in a different layout in plane 7.[8]

CCCII (Hong Kong)[8] (prefixed with 0x272E)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
7_
 
7-14-80

 
7-14-81

 
7-14-82

 
7-14-83

 
7-14-84

3105
7-14-85

3106
7-14-86

3107
7-14-87

3108
7-14-88

3109
7-14-89

310A
7-14-90

310B
7-14-91

310C
7-14-92

310D
7-14-93

310E
7-14-94
CCCII (Hong Kong) (prefixed with 0x272F)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
310F
7-15-1

3110
7-15-2

3111
7-15-3

3112
7-15-4

3113
7-15-5

3114
7-15-6

3115
7-15-7

3116
7-15-8

3117
7-15-9

3118
7-15-10

3119
7-15-11

311A
7-15-12

311B
7-15-13

311C
7-15-14

311D
7-15-15
3_
311E
7-15-16

311F
7-15-17

3120
7-15-18

3121
7-15-19

3122
7-15-20

3123
7-15-21

3124
7-15-22

3125
7-15-23

3126
7-15-24

3127
7-15-25

3128
7-15-26

3129
7-15-27

 
7-15-28

 
7-15-29

 
7-15-30

 
7-15-31

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x6921 (plane 73, row 1: Japanese punctuation)[edit]

This row is in plane 73, the first plane of layer 13, which contains characters included for Japanese language support.[12] It contains punctuation.[7] Compare row 1 of JIS X 0208, which this row tends to follow the layout of for the characters it includes.

CCCII/EACC[13][9][8] (prefixed with 0x6921)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
 
73-1-1

 
73-1-2

 
73-1-3

 
73-1-4

 
73-1-5

30FB
73-1-6

 
73-1-7

 
73-1-8

 
73-1-9

 
73-1-10

 
73-1-11

 
73-1-12

 
73-1-13

 
73-1-14

 
73-1-15
3_
 
73-1-16

 
73-1-17

 
73-1-18

 
73-1-19

 
73-1-20

 
73-1-21

 
73-1-22

 
73-1-23

 
73-1-24

3005
73-1-25

3006
73-1-26

 
73-1-27

30FC
73-1-28

 
73-1-29

 
73-1-30

 
73-1-31
4_
 
73-1-32

 
73-1-33

 
73-1-34

 
73-1-35

 
73-1-36

 
73-1-37

 
73-1-38

 
73-1-39

 
73-1-40

 
73-1-41

 
73-1-42

 
73-1-43

 
73-1-44

 
73-1-45

 
73-1-46

 
73-1-47
5_
 
73-1-48

 
73-1-49

3008
73-1-50

3009
73-1-51

300A
73-1-52

300B
73-1-53

 
73-1-54

 
73-1-55

 
73-1-56

 
73-1-57

 
73-1-58

 
73-1-59

 
73-1-60

 
73-1-61

 
73-1-62

 
73-1-63
6_
 
73-1-64

 
73-1-65

 
73-1-66

 
73-1-67

 
73-1-68

 
73-1-69

 
73-1-70

 
73-1-71

 
73-1-72

 
73-1-73

 
73-1-74

 
73-1-75

 
73-1-76

 
73-1-77

 
73-1-78

 
73-1-79
7_
 
73-1-80

 
73-1-81

 
73-1-82

 
73-1-83

 
73-1-84

 
73-1-85

 
73-1-86

 
73-1-87

 
73-1-88

 
73-1-89

 
73-1-90

 
73-1-91

 
73-1-92

 
73-1-93

 
73-1-94

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x6924 (plane 73, row 4: hiragana)[edit]

This row contains hiragana. Compare row 4 of JIS X 0208.

CCCII/EACC[13][9][8] (prefixed with 0x6924)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
3041
73-4-1

3042
73-4-2

3043
73-4-3

3044
73-4-4

3045
73-4-5

3046
73-4-6

3047
73-4-7

3048
73-4-8

3049
73-4-9

304A
73-4-10

304B
73-4-11

304C
73-4-12

304D
73-4-13

304E
73-4-14

304F
73-4-15
3_
3050
73-4-16

3051
73-4-17

3052
73-4-18

3053
73-4-19

3054
73-4-20

3055
73-4-21

3056
73-4-22

3057
73-4-23

3058
73-4-24

3059
73-4-25

305A
73-4-26

305B
73-4-27

305C
73-4-28

305D
73-4-29

305E
73-4-30

305F
73-4-31
4_
3060
73-4-32

3061
73-4-33

3062
73-4-34

3063
73-4-35

3064
73-4-36

3065
73-4-37

3066
73-4-38

3067
73-4-39

3068
73-4-40

3069
73-4-41

306A
73-4-42

306B
73-4-43

306C
73-4-44

306D
73-4-45

306E
73-4-46

306F
73-4-47
5_
3070
73-4-48

3071
73-4-49

3072
73-4-50

3073
73-4-51

3074
73-4-52

3075
73-4-53

3076
73-4-54

3077
73-4-55

3078
73-4-56

3079
73-4-57

307A
73-4-58

307B
73-4-59

307C
73-4-60

307D
73-4-61

307E
73-4-62

307F
73-4-63
6_
3080
73-4-64

3081
73-4-65

3082
73-4-66

3083
73-4-67

3084
73-4-68

3085
73-4-69

3086
73-4-70

3087
73-4-71

3088
73-4-72

3089
73-4-73

308A
73-4-74

308B
73-4-75

308C
73-4-76

308D
73-4-77

308E
73-4-78

308F
73-4-79
7_
3090
73-4-80

3091
73-4-81

3092
73-4-82

3093
73-4-83

 
73-4-84

 
73-4-85

 
73-4-86

 
73-4-87

 
73-4-88

 
73-4-89

 
73-4-90

 
73-4-91

 
73-4-92

 
73-4-93

 
73-4-94

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x6925 (plane 73, row 5: katakana)[edit]

This row contains katakana. Compare row 5 of JIS X 0208, which this row corresponds to, besides the addition of the separate dakuten and handakuten.

CCCII/EACC[13][9] (prefixed with 0x6925)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
30A1
73-5-1

30A2
73-5-2

30A3
73-5-3

30A4
73-5-4

30A5
73-5-5

30A6
73-5-6

30A7
73-5-7

30A8
73-5-8

30A9
73-5-9

30AA
73-5-10

30AB
73-5-11

30AC
73-5-12

30AD
73-5-13

30AE
73-5-14

30AF
73-5-15
3_
30B0
73-5-16

30B1
73-5-17

30B2
73-5-18

30B3
73-5-19

30B4
73-5-20

30B5
73-5-21

30B6
73-5-22

30B7
73-5-23

30B8
73-5-24

30B9
73-5-25

30BA
73-5-26

30BB
73-5-27

30BC
73-5-28

30BD
73-5-29

30BE
73-5-30

30BF
73-5-31
4_
30C0
73-5-32

30C1
73-5-33

30C2
73-5-34

30C3
73-5-35

30C4
73-5-36

30C5
73-5-37

30C6
73-5-38

30C7
73-5-39

30C8
73-5-40

30C9
73-5-41

30CA
73-5-42

30CB
73-5-43

30CC
73-5-44

30CD
73-5-45

30CE
73-5-46

30CF
73-5-47
5_
30D0
73-5-48

30D1
73-5-49

30D2
73-5-50

30D3
73-5-51

30D4
73-5-52

30D5
73-5-53

30D6
73-5-54

30D7
73-5-55

30D8
73-5-56

30D9
73-5-57

30DA
73-5-58

30DB
73-5-59

30DC
73-5-60

30DD
73-5-61

30DE
73-5-62

30DF
73-5-63
6_
30E0
73-5-64

30E1
73-5-65

30E2
73-5-66

30E3
73-5-67

30E4
73-5-68

30E5
73-5-69

30E6
73-5-70

30E7
73-5-71

30E8
73-5-72

30E9
73-5-73

30EA
73-5-74

30EB
73-5-75

30EC
73-5-76

30ED
73-5-77

30EE
73-5-78

30EF
73-5-79
7_
30F0
73-5-80

30F1
73-5-81

30F2
73-5-82

30F3
73-5-83

30F4
73-5-84

30F5
73-5-85

30F6
73-5-86
◌゙/
3099/309B
73-5-87
◌゚/
309A/309C
73-5-88

 
73-5-89

 
73-5-90

 
73-5-91

 
73-5-92

 
73-5-93

 
73-5-94

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x6F24–0x6F25 (plane 79, rows 4–5: jamo)[edit]

These rows contains Korean jamo.

EACC[13] (prefixed with 0x6F24)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
5_
 
79-4-48

 
79-4-49

 
79-4-50

 
79-4-51

 
79-4-52

 
79-4-53

 
79-4-54

3131
79-4-55

3134
79-4-56

3137
79-4-57

3139
79-4-58

3141
79-4-59

3142
79-4-60

3145
79-4-61

3147
79-4-62

3148
79-4-63
6_
314A
79-4-64

314B
79-4-65

314C
79-4-66

314D
79-4-67

314E
79-4-68

3132
79-4-69

 
79-4-70

 
79-4-71

 
79-4-72

3138
79-4-73

 
79-4-74

 
79-4-75

 
79-4-76

 
79-4-77

3143
79-4-78

 
79-4-79
7_
3146
79-4-80

3149
79-4-81

314F
79-4-82

3150
79-4-83

3151
79-4-84

 
79-4-85

3153
79-4-86

3154
79-4-87

3155
79-4-88

 
79-4-89

3157
79-4-90

3158
79-4-91

 
79-4-92

 
79-4-93

315B
79-4-94
EACC[13] (prefixed with 0x6F25)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
315C
79-5-1

 
79-5-2

 
79-5-3

 
79-5-4

3160
79-5-5

3161
79-5-6

3162
79-5-7

3163
79-5-8

 
79-5-9

 
79-5-10

 
79-5-11

 
79-5-12

 
79-5-13

 
79-5-14

 
79-5-15

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x6F76 (plane 79, row 86: archaic Hangul)[edit]

This row contains several historic Hangul characters no longer in regular use. Several of these are mapped to the Private Use Area.[17]

EACC[13] (prefixed with 0x6F76)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
2_
3181
79-86-1

3186
79-86-2

317F
79-86-3

E8B0
79-86-4

318D
79-86-5

 
79-86-6

 
79-86-7

 
79-86-8

 
79-86-9

 
79-86-10

 
79-86-11

 
79-86-12

 
79-86-13

 
79-86-14

 
79-86-15
3_
 
79-86-16

 
79-86-17

 
79-86-18

 
79-86-19

 
79-86-20

 
79-86-21

 
79-86-22

 
79-86-23

 
79-86-24

 
79-86-25

 
79-86-26

 
79-86-27

 
79-86-28

 
79-86-29

 
79-86-30

 
79-86-31
4_
E8B2
79-86-32

E8B3
79-86-33

E8B4
79-86-34

E8B5
79-86-35

E8B6
79-86-36

E8B7
79-86-37

E8B8
79-86-38

E8B9
79-86-39

E8BA
79-86-40

E8BB
79-86-41

E8BC
79-86-42

E8BD
79-86-43

E8BE
79-86-44

E8BF
79-86-45

E8C0
79-86-46

E8C1
79-86-47
5_
E8C2
79-86-48

E8C3
79-86-49

E8C4
79-86-50

E8C5
79-86-51

E8C6
79-86-52

E8C7
79-86-53

E8C8
79-86-54

E8C9
79-86-55

 
79-86-56

 
79-86-57

 
79-86-58

 
79-86-59

 
79-86-60

 
79-86-61

 
79-86-62

 
79-86-63
6_
 
79-86-64

 
79-86-65

 
79-86-66

 
79-86-67

 
79-86-68

 
79-86-69

 
79-86-70

 
79-86-71

 
79-86-72

 
79-86-73

 
79-86-74

 
79-86-75

 
79-86-76

 
79-86-77

 
79-86-78

 
79-86-79
7_
 
79-86-80

 
79-86-81

 
79-86-82

 
79-86-83

 
79-86-84

 
79-86-85

 
79-86-86

 
79-86-87

 
79-86-88

 
79-86-89

 
79-86-90

 
79-86-91

 
79-86-92

 
79-86-93

 
79-86-94

  Letter  Number  Punctuation  Symbol  Other  Undefined

Character set 0x7B25 (plane 91, row 5: supplementary Katakana)[edit]

This row contains additional katakana used to write foreign phonemes.[9]

CCCII[9] (prefixed with 0x7B25)
_0 _1 _2 _3 _4 _5 _6 _7 _8 _9 _A _B _C _D _E _F
7_
 
91-5-80

 
91-5-81

 
91-5-82

 
91-5-83

 
91-5-84

 
91-5-85

 
91-5-86

 
91-5-87

 
91-5-88

30F7
91-5-89

30F8
91-5-90

30F9
91-5-91

 
91-5-92

 
91-5-93

 
91-5-94

Footnotes[edit]

  1. ^ Outside of the trail byte range of an ISO 2022 94n set, but noted as being in use by some implementations.[7]
  2. ^ Coding of the ideographic space specified in the ANSI standard for EACC.[7] This is used as an exclamation mark in CCCII,[9] in addition to the exclamation mark at 0x212B3D.[15] The Hong Kong HKIUG variant of CCCII follows EACC here.[8]
  3. ^ The Encode::HanExtra mappings use U+FE52 for this character.[9] However, it appears here following ˊ, ˇ and ˋ,[15] which the other three tone marks for bopomofo. The mapping U+02D9 is more commonly used for this tone mark in bopomofo ranges of encodings, for example Big5.[19]

References[edit]

  1. ^ a b c d e f g h i j k l m n o p q r s Lunde, Ken (2009). CJKV Information Processing: Chinese, Japanese, Korean & Vietnamese Computing (2nd ed.). Sebastopol, CA: O'Reilly. pp. 122–124. ISBN 978-0-596-51447-1.
  2. ^ a b c Tang, Audrey (2007-11-10). "Encode::HanExtra - Extra sets of Chinese encodings". CCCII: The earliest (and most sophisticated) Traditional Chinese encoding, […] used mostly in library systems. […] Map for "CCCII" is supplied by the Koha Taiwan project.
  3. ^ a b c d Wittern, Christian (1995-05-01). "Chinese character codes: an update". International Research Institute for Zen Buddhism / Hanazono University. Archived from the original on 2004-10-12.
  4. ^ a b c Jenkins, John H.; Cook, Richard; Lunde, Ken (2020-03-05). "Unicode Han Database (Unihan)". Unicode Standard Annex #38.
  5. ^ "Archived copy". Archived from the original on 2016-06-15. Retrieved 2016-06-15.CS1 maint: archived copy as title (link)
  6. ^ Kangxi Dictionary, p. 1296, char. 1
  7. ^ a b c d e f g h i j Library of Congress (2007-12-05). "Code Table East Asian Punctuation Marks". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
  8. ^ a b c d e f g h i j k l m n o p q Hong Kong Innovative Users Group Unicode Task Force. "HKIUG Code Table for CJK Characters: Mapping to Unicode". University of Hong Kong Libraries.
  9. ^ a b c d e f g h i j k l m n o p q r s t u v w Tang, Audrey; Koha Taiwan. "Map for CCCII". Encode::HanExtra. CPAN.
  10. ^ "2.4: Multiple byte graphic character sets". International Register of Coded Character Sets to be Used With Escape Sequences (ISO-IR) (PDF). ITSCJ/IPSJ. p. 14.
  11. ^ Library of Congress (2007-12-05). "Technique 2: Using standard alternate graphic character sets". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
  12. ^ a b c Lunde, Ken (1995-12-18). "2.5.2: CCCII". CJK.INF Version 1.9.
  13. ^ a b c d e f g h i j k l Library of Congress (2007-12-05). "East Asian Code Tables". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
  14. ^ Library of Congress (2007-12-05). "Code Table Korean Hangul". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
  15. ^ a b c d e f g h i j Characters shown are, in part, cross-referenced against a representative BDF font for CCCII, distributed by Koichi Yasuoka of Kyoto University.
  16. ^ Hong Kong Innovative Users Group (2013-01-07). "Introduction to Hong Kong Innovative Users Group". University of Hong Kong Libraries.
  17. ^ a b c Library of Congress (2004-09-02). "Summary List of MARC 21 Characters Assigned to the Private Use Area (PUA)". MARC 21 Specifications for Record Structure, Character Sets, and Exchange Media.
  18. ^ Morris, Susan (2007). "Finding JACKPHY: Online Cataloging to Include Arabic, Hebrew, Other Scripts". Library of Congress Information Bulletin. 66 (12).
  19. ^ van Kesteren, Anne. "big5". Encoding Standard. WHATWG.

External links[edit]