
|
|
|
- byte order mark - the bom
- character properties
- chinese characters - cjk/han ideographs
- codes for common punctuation and symbols
- data as text and text as data
- encoding architecture
- key gotchas
- text comparison
- text encoding conversions
- text rendering
|
- text segmentation
- text transformations
- unicode and text
- unicode code chart sample
- unicode codespace
- unicode in practice
- utf-16
- utf-32
- utf-8
|