# Confusables # DRAFT 1 # This is a draft list of visually confusable characters, for use in conjunction with the recommendations in http://www.unicode.org/reports/tr36/ # At this point the focus is on characters that characters that remain after NFKD normalization and case folding. # # The format is any number of strings (usually single characters), separated by semicolons. # The final data may use hex values, but while gathering the data the actual characters are easier to work with. # To fold using this list, first perform NFKD and case folding (if not already performed), then map to the first character(s) in the row. # Accents should be added to this list wherever letters with them will collide at small sizes / low resolutions, e.g. ş ; ș ; ṣ # Note: it is unclear whether it is worth trying to make the distinctions marked with @ below; feedback is welcome # =================== @identical in most fonts (where both are representable) ə ; ә а ; a e ; е o ; ο ; о s ; ѕ x ; х æ ; ӕ i ; і j ; ј p ; р c ; с y ; у v ; ѵ ; ν b ; ƅ #... @very similar or identical in some fonts a ; ɑ g ; ɡ ı ; ɩ ł ; ɫ l ; 1 # m ; rn #... @similar or identical at small sizes in some fonts o ; ɒ # dz ; ʣ # no upper p ; ρ ; р r ; г п ; π є ; ε n ; ո h ; հ u ; ս g ; ց f ; ք y ; ყ 0 ; o ; ୦ ; ০ ; ੦ ; ం ; ౦ ; ಂ ; ೦ ; ๐ ; ໐ ; ᠐ ; 〇 ; օ 6 ; б 8 ; ৪ ; ଃ 9 ; ৭ ; ੧ ; ୨ : ; ಃ ; ః - ; ‐ ; ‒ ; – ; — ; ― ̂ ; ̑ ; # circumflex & inverted breve @ syntax spoofs / ; ⁄ ; ∕ . ; ܂ ; ܂ : ; ︰; ։