[Haskell-cafe] invalid character encoding

John Meacham john at repetae.net
Sat Mar 19 21:25:41 EST 2005


On Sat, Mar 19, 2005 at 03:04:04PM +0000, Glynn Clements wrote:
> I'm not suggesting inventing conventions. I'm suggesting leaving such
> issues to the application programmer who, unlike the library
> programmer, probably has enough context to be able to reliably
> determine the correct encoding in any specific instance.

But the whole point of Foreign.C.String is to interface to existing C
code. And one of the most common conventions of said interfaces is to
represent strings in the current locale, Which is why locale honoring
conversion routines are useful. 

I don't think anyone is arguing that this is the end-all of charset
conversion, far from it. A general conversion library and parameterized
conversion routines are also needed for many of the reasons you said,
and will probably appear at some point. I have my own iconv interface
which I used for my initial implementation of with/peekCString etc. and
I am sure other people have written their own, eventually one will be
standardized. A general conversion facility has been on the wishlist for
a long time.

However, at the moment, the FFI is tackling a much simpler goal of
interfacing with existing C code, and non-parameterized locale-honoring
conversion routines are extremely useful for that. Even if we had a nice
generalized conversion routine, a simple locale-honoring front end would
be a very useful interface because it is so commonly needed when
interfacing to C code.

However, I am sure everyone would be happy if a nice cabalized general
charset conversion library appeared... I have the start of one here, which
should work on any POSIXy system, even if wchar_t is not unicode (no
windows support though)
 http://repetae.net/john/recent/out/HsLocale.html

        John

-- 
John Meacham - ⑆repetae.net⑆john⑈ 


More information about the Haskell-Cafe mailing list