Skip to content
This repository has been archived by the owner on Jan 15, 2024. It is now read-only.

Commit

Permalink
docs: 完善文档
Browse files Browse the repository at this point in the history
  • Loading branch information
kitty-panics committed Jun 16, 2022
1 parent f5f5ae2 commit 6de86cc
Showing 1 changed file with 89 additions and 50 deletions.
139 changes: 89 additions & 50 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,17 +1,19 @@
# Unicode CJK
# Unicode-CJK

整理所有 Unicode CJK 字符。
整理所有 [Unicode] CJK 字符。

[Unicode]: https://www.unicode.org/Public/UNIDATA/Blocks.txt

## 协作整理

由于疏忽、参考资料有误等,码表可能会存在一些错误,
如果你发现了错误请通过 [PR] 或者 [issues] 反馈给我。
由于疏忽、参考资料有误等,码表可能会存在一些错误,如果你发现了错误请通过
[Issues] 或者 [PR] 反馈给我。

+ 如果不会 Git 操作可在 [Issues] 中发起反馈。
+ 如果熟悉 Git 操作可在更正错误后发起 [PR]
+ 如果不会 Git 操作可在 [issues] 中发起反馈。

[Issues]: https://github.com/kitty-panics/unicode-cjk/issues
[PR]: https://github.com/kitty-panics/unicode-cjk/pulls
[issues]: https://github.com/kitty-panics/unicode-cjk/issues

## 数据格式

Expand All @@ -21,78 +23,115 @@
U+3007 〇
U+4E00 一
U+4E01 丁
U+4E02 丂
```

## 文件列表

+ [All.txt](All.txt)
+ 整合下面 CJK/A/B/C/D/E/F/G/Compatibility/Compatibility-Supplement。
+ [CJK-Unified-Ideographs.txt](CJK-Unified-Ideographs.txt) (中日韩统一表意文字)
+ 代码点:U+3007, U+4E00-U+9FFF (1 + 20992 个)
+ 已使用:U+3007, U+4E00-U+9FFF (1 + 20992 字)
+ [CJK-Unified-Ideographs-Extension-A.txt](CJK-Unified-Ideographs-Extension-A.txt) (中日韩统一表意文字扩展区 A)
+ 代码点:U+3400-U+4DBF (6592 个)
+ 已使用:U+3400-U+4DBF (6592 字)
+ [CJK-Unified-Ideographs-Extension-B.txt](CJK-Unified-Ideographs-Extension-B.txt) (中日韩统一表意文字扩展区 B)
+ 代码点:U+20000-U+2A6DF (42720 个)
+ 已使用:U+20000-U+2A6DF (42720 字)
+ [CJK-Unified-Ideographs-Extension-C.txt](CJK-Unified-Ideographs-Extension-C.txt) (中日韩统一表意文字扩展区 C)
+ 代码点:U+2A700-U+2B73F (4160 个)
+ 已使用:U+2A700-U+2B738 (4153 字)
+ [CJK-Unified-Ideographs-Extension-D.txt](CJK-Unified-Ideographs-Extension-D.txt) (中日韩统一表意文字扩展区 D)
+ 代码点:U+2B740-U+2B81F (224 个)
+ 已使用:U+2B740-U+2B81D (222 字)
+ [CJK-Unified-Ideographs-Extension-E.txt](CJK-Unified-Ideographs-Extension-E.txt) (中日韩统一表意文字扩展区 E)
+ 代码点:U+2B820-U+2CEAF (5776 个)
+ 已使用:U+2B820-U+2CEA1 (5762 字)
+ [CJK-Unified-Ideographs-Extension-F.txt](CJK-Unified-Ideographs-Extension-F.txt) (中日韩统一表意文字扩展区 F)
+ 代码点:U+2CEB0-U+2EBEF (7488 个)
+ 已使用:U+2CEB0-U+2EBE0 (7473 字)
+ [CJK-Unified-Ideographs-Extension-G.txt](CJK-Unified-Ideographs-Extension-G.txt) (中日韩统一表意文字扩展区 G)
+ 代码点:U+30000-U+3134F (4944 个)
+ 已使用:U+30000-U+3134A (4939 字)
+ [CJK-Compatibility-Ideographs.txt](CJK-Compatibility-Ideographs.txt) (中日韩兼容表意文字)
+ 代码点:U+F900-U+FAFF (512 个)
+ 已使用:U+F900-U+FA6D, U+FA70-U+FAD9 (472 字)
+ [CJK-Compatibility-Ideographs-Supplement.txt](CJK-Compatibility-Ideographs-Supplement.txt) (中日韩兼容表意文字增补)
+ 代码点:U+2F800-U+2FA1F (544 个)
+ 已使用:U+2F800-U+2FA1D (542 字)
+ [All.txt] (整合下面 CJK/A/B/C/D/E/F/G/Compatibility/Compatibility-Supplement)
+ [CJK-Unified-Ideographs.txt] (中日韩统一表意文字)
+ 代码点:U+3007, U+4E00-U+9FFF (1 + 20992 个)。
+ 已使用:U+3007, U+4E00-U+9FFF (1 + 20992 字)。
+ [CJK-Unified-Ideographs-Extension-A.txt] (中日韩统一表意文字扩展区 A)
+ 代码点:U+3400-U+4DBF (6592 个)。
+ 已使用:U+3400-U+4DBF (6592 字)。
+ [CJK-Unified-Ideographs-Extension-B.txt] (中日韩统一表意文字扩展区 B)
+ 代码点:U+20000-U+2A6DF (42720 个)。
+ 已使用:U+20000-U+2A6DF (42720 字)。
+ [CJK-Unified-Ideographs-Extension-C.txt] (中日韩统一表意文字扩展区 C)
+ 代码点:U+2A700-U+2B73F (4160 个)。
+ 已使用:U+2A700-U+2B738 (4153 字)。
+ [CJK-Unified-Ideographs-Extension-D.txt] (中日韩统一表意文字扩展区 D)
+ 代码点:U+2B740-U+2B81F (224 个)。
+ 已使用:U+2B740-U+2B81D (222 字)。
+ [CJK-Unified-Ideographs-Extension-E.txt] (中日韩统一表意文字扩展区 E)
+ 代码点:U+2B820-U+2CEAF (5776 个)。
+ 已使用:U+2B820-U+2CEA1 (5762 字)。
+ [CJK-Unified-Ideographs-Extension-F.txt] (中日韩统一表意文字扩展区 F)
+ 代码点:U+2CEB0-U+2EBEF (7488 个)。
+ 已使用:U+2CEB0-U+2EBE0 (7473 字)。
+ [CJK-Unified-Ideographs-Extension-G.txt] (中日韩统一表意文字扩展区 G)
+ 代码点:U+30000-U+3134F (4944 个)。
+ 已使用:U+30000-U+3134A (4939 字)。
+ [CJK-Compatibility-Ideographs.txt] (中日韩兼容表意文字)
+ 代码点:U+F900-U+FAFF (512 个)。
+ 已使用:U+F900-U+FA6D, U+FA70-U+FAD9 (472 字)。
+ [CJK-Compatibility-Ideographs-Supplement.txt] (中日韩兼容表意文字增补)
+ 代码点:U+2F800-U+2FA1F (544 个)。
+ 已使用:U+2F800-U+2FA1D (542 字)。

**注:**

为方便管理,将争议字符 "〇 U+3007" 由 "CJK Symbols and Punctuation (中日韩符号和标点)"
移动到 "CJK Unified Ideographs (中日韩统一表意文字)",并置于文件开头。

[All.txt]: All.txt
[CJK-Unified-Ideographs.txt]: CJK-Unified-Ideographs.txt
[CJK-Unified-Ideographs-Extension-A.txt]: CJK-Unified-Ideographs-Extension-A.txt
[CJK-Unified-Ideographs-Extension-B.txt]: CJK-Unified-Ideographs-Extension-B.txt
[CJK-Unified-Ideographs-Extension-C.txt]: CJK-Unified-Ideographs-Extension-C.txt
[CJK-Unified-Ideographs-Extension-D.txt]: CJK-Unified-Ideographs-Extension-D.txt
[CJK-Unified-Ideographs-Extension-E.txt]: CJK-Unified-Ideographs-Extension-E.txt
[CJK-Unified-Ideographs-Extension-F.txt]: CJK-Unified-Ideographs-Extension-F.txt
[CJK-Unified-Ideographs-Extension-G.txt]: CJK-Unified-Ideographs-Extension-G.txt
[CJK-Compatibility-Ideographs.txt]: CJK-Compatibility-Ideographs.txt
[CJK-Compatibility-Ideographs-Supplement.txt]: CJK-Compatibility-Ideographs-Supplement.txt

## 参考资料

参考资料可在 [参考资料] 目录下找到。其中非文件类的在线资料将转换成 PDF 快照存放。

+ [Blocks.txt - Unicode]
+ [Blocks.txt]

[参考资料]: 参考资料
[Blocks.txt - Unicode]: https://www.unicode.org/Public/UCD/latest/ucd/Blocks.txt
[Blocks.txt]: https://www.unicode.org/Public/UNIDATA/Blocks.txt

## 相关项目

### [cn-tables]

整理中国大陆简中、中国台湾繁中的国标汉字表。

[cn-tables]: https://github.com/kitty-panics/cn-tables

### [CNS11643-Unicode-Cangjie]

[CNS11643]、Unicode、Cangjie 对照表。

[CNS11643-Unicode-Cangjie]: https://github.com/kitty-panics/CNS11643-Unicode-Cangjie
[CNS11643]: https://data.gov.tw/dataset/5961

### [unicode-cjk]

收集整理所有 Unicode CJK 字符。
整理所有 [Unicode] CJK 字符。

[unicode-cjk]: https://github.com/kitty-panics/unicode-cjk
[Unicode]: https://www.unicode.org/Public/UNIDATA/Blocks.txt

### [cn-tables]
### [unicode-cjk-98wubi]

收集整理中国的国标汉字表,即 [通用规范汉字表][常用标准字体表 (甲表)]
[次常用标准字体表 (乙表)]
整理 Unicode CJK 字符的 [五笔98] 编码。

[cn-tables]: https://github.com/kitty-panics/cn-tables
[通用规范汉字表]: https://zh.wikipedia.org/wiki/%E9%80%9A%E7%94%A8%E8%A7%84%E8%8C%83%E6%B1%89%E5%AD%97%E8%A1%A8
[常用标准字体表 (甲表)]: https://zh.wikipedia.org/wiki/%E5%B8%B8%E7%94%A8%E5%9C%8B%E5%AD%97%E6%A8%99%E6%BA%96%E5%AD%97%E9%AB%94%E8%A1%A8
[次常用标准字体表 (乙表)]: https://baike.baidu.com/item/%E6%AC%A1%E5%B8%B8%E7%94%A8%E5%9B%BD%E5%AD%97%E6%A0%87%E5%87%86%E5%AD%97%E4%BD%93%E8%A1%A8
[unicode-cjk-98wubi]: https://github.com/kitty-panics/unicode-cjk-98wubi
[五笔98]: http://98wb.ysepan.com

### [unicode-cjk-ids]

备份 "[chise/ids.git]" 仓库
备份、修补 [chise/ids]

[unicode-cjk-ids]: https://github.com/kitty-panics/unicode-cjk-ids
[chise/ids.git]: http://git.chise.org/git/chise/ids.git
[chise/ids]: http://git.chise.org/git/chise/ids.git

### [unicode-cjk-zhlf]

整理 Unicode CJK 字符的 [字海两分] 编码。

[unicode-cjk-zhlf]: https://github.com/kitty-panics/unicode-cjk-zhlf
[字海两分]: http://cheonhyeong.com/Simplified/download.html

### [unicode-cjk-zhlf-sc]

整理 Unicode CJK 字符的 [字海两分速成] 编码。

[unicode-cjk-zhlf-sc]: https://github.com/kitty-panics/unicode-cjk-zhlf-sc
[字海两分速成]: http://cheonhyeong.com/Simplified/download.html

0 comments on commit 6de86cc

Please sign in to comment.