Haskell Platform Proposal: add the 'text' library
Johan Tibell
johan.tibell at gmail.com
Wed Oct 20 11:28:15 EDT 2010
On Wed, Oct 20, 2010 at 5:11 PM, Tyson Whitehead <twhitehead at gmail.com> wrote:
> I read the wikipedia article on code points, but still do not feel I have a
> firm grasp as to what exactly you are referring to.
>
> If you have a few minutes, would you mind providing a short example to clarify
> this with a specific example (e.g., a specific code point that gives issues with
> a 1:1 model and what those issues are).
Have a look at combining characters:
http://en.wikipedia.org/wiki/Combining_character
For example, a Danish user would consider the single Unicode code
point A-RING the same as the two code points A + COMBINING RING.
If you process a string code point by code point you might mistakenly
confuse a plain "a" (A) with a "å" (A-RING *or* A + COMBINING RING).
Johan
More information about the Libraries
mailing list