UTF8 libraries

Bayley, Alistair Alistair_Bayley at invescoperpetual.co.uk
Fri Feb 2 10:50:37 EST 2007


> From: libraries-bounces at haskell.org 
> 
> If someone is interested in suppourting UTF8, I recommend to
> use oniguruma.
> 
> http://www.geocities.jp/kosako3/oniguruma/
> 
> Oniguruma also supports UTF-16BE, UTF-16LE, UTF-32BE, UTF-32LE,
> etc .... And it is portable, it's available both on Unix and
> Windows.
> 
> So I think it is best regex C library to choose backend.

Sorry, I didn't explain this so well. I mean an decoder to marshal a
C-string that I know is UTF8 into a Haskell String (i.e. [Char]). An FFI
call out to C might be convenient, but will have overhead. It's not that
hard to write a UTF8 decoder (and encoder) in Haskell; I just wanted to
avoid wasted work.

Alistair
*****************************************************************
Confidentiality Note: The information contained in this message,
and any attachments, may contain confidential and/or privileged
material. It is intended solely for the person(s) or entity to
which it is addressed. Any review, retransmission, dissemination,
or taking of any action in reliance upon this information by
persons or entities other than the intended recipient(s) is
prohibited. If you received this in error, please contact the
sender and delete the material from any computer.
*****************************************************************


More information about the Libraries mailing list