Han Unification - Unicode Ranges

Unicode Ranges

Ideographic characters assigned by Unicode appear in the following blocks:

  • CJK Unified Ideographs (4E00–9FFF)
  • CJK Unified Ideographs Extension A (3400–4DBF)
  • CJK Unified Ideographs Extension B (20000–2A6DF)
  • CJK Unified Ideographs Extension C (2A700–2B73F)
  • CJK Unified Ideographs Extension D (2B840–2B81F)
  • CJK Compatibility Ideographs (F900–FAFF) (the twelve characters at FA0E, FA0F, FA11, FA13, FA14, FA1F, FA21, FA23, FA24, FA27, FA28 and FA29 are actually "unified ideographs" not "compatibility ideographs")

Unicode includes support of CJKV radicals, strokes, punctuation, marks and symbols in the following blocks:

  • CJK Radicals Supplement (2E80–2EFF)
  • CJK Symbols and Punctuation (3000–303F) (chart)
  • CJK Strokes (31C0–31EF)
  • Ideographic Description Characters (2FF0–2FFF)

Additional compatibility (discouraged use) characters appear in these blocks:

  • Kangxi Radicals (2F00–2FDF)
  • Enclosed CJK Letters and Months (3200–32FF) (chart)
  • CJK Compatibility (3300–33FF) (chart)
  • CJK Compatibility Ideographs (F900–FAFF) (chart)
  • CJK Compatibility Ideographs (2F800–2FA1F)
  • CJK Compatibility Forms (FE30–FE4F) (chart)

These compatibility characters (excluding the twelve unified ideographs in the CJK Compatibility Ideographs block) are included for compatibility with legacy text handling systems and other legacy character sets. They include forms of characters for vertical text layout and rich text characters that Unicode recommends handling through other means.

Read more about this topic:  Han Unification