Haskell Platform Proposal: add the 'text' library

Johan Tibell johan.tibell at gmail.com
Wed Oct 20 11:28:15 EDT 2010


On Wed, Oct 20, 2010 at 5:11 PM, Tyson Whitehead <twhitehead at gmail.com> wrote:
> I read the wikipedia article on code points, but still do not feel I have a
> firm grasp as to what exactly you are referring to.
>
> If you have a few minutes, would you mind providing a short example to clarify
> this with a specific example (e.g., a specific code point that gives issues with
> a 1:1 model and what those issues are).

Have a look at combining characters:

    http://en.wikipedia.org/wiki/Combining_character

For example, a Danish user would consider the single Unicode code
point A-RING the same as the two code points A + COMBINING RING.

If you process a string code point by code point you might mistakenly
confuse a plain "a" (A) with a "å" (A-RING *or* A + COMBINING RING).

Johan


More information about the Libraries mailing list