CWString
John Meacham
john at repetae.net
Wed Aug 27 05:12:50 EDT 2003
On Wed, Aug 27, 2003 at 09:58:08AM +0100, Simon Marlow wrote:
> > > > In our new implementation of Data.Char.isUpper and
> > friends, I made the
> > > > simplifying assumption that Char==wchar_t==Unicode. With
> > glibc, this
> > > > appears to be valid as long as (a) you set LANG to
> > something other than
> > > > "C" or "POSIX", and (b) you call setlocale() first.
> > >
> > > The glibc Info file says:
> > > The wide character character set always is UCS4, at least on
> > > GNU systems.
> > yes. with glibc, wchar_t is always unicode no matter what the locale.
> > better yet, all ISO C implementations define a handy C symbol to test
> > for this. if __STDC_ISO_10646__ is defined then wchar_t is always
> > unicode no matter what.
>
> Sure, but as I've been saying, the implementation of glibc doesn't do
> this. In the C or POSIX locale, the ctype macros only recognise ASCII.
> Should this be considered a bug in glibc?
hmm.. how odd. I would consider it a bug, I think. I don't have a copy
of the ISO spec handy but will be sure to look up whether that is
conforming... It is certainly a malfeature if it is not a bug...
John
--
---------------------------------------------------------------------------
John Meacham - California Institute of Technology, Alum. - john at foo.net
---------------------------------------------------------------------------
More information about the FFI
mailing list