UTF-8 library
Ashley Yakeley
ashley@semantic.org
Wed, 7 Aug 2002 03:14:08 -0700
At 2002-08-07 03:03, Axel Simon wrote:
>> Text encoded with ISO 8859-1 or UTF-8 is octets. If you want to use
>> CChars, you should then subsequently convert the Word8s into CChars.
>Then I hope there is no C implementation where char is less than 8 bits
>long.
There may or may not be. Squeezing in octets from ISO 8859-1 encoding is
a problem you'll have to deal with either way, and is entirely separate
from the text encoding process.
--
Ashley Yakeley, Seattle WA