[Haskell-cafe] invalid character encoding

Dimitry Golubovsky dimitry at golubovsky.org
Sat Mar 19 23:13:29 EST 2005


Glynn Clements wrote:



> To get a better idea, you would need to consult users whose language
> doesn't use the roman alphabet, e.g. CJK or cyrillic. Unfortunately,
> you don't usually find too many of them on lists such as this.


In Russia, we still have multiple one byte encodings for Cyrillic: KOI-8 
(Unix), CP1251 (Windows), and getting more and more obsolete CP866 
(MSDOS, OS/2). Regarding filenames, I am sure Windows stores them in 
Unicode regarding of locale (I tried various chcp numbers in a console 
window, printing directory containing filenames in Russian and in German 
altogether, and it showed "non-characters" as question marks when 
locale-based codepage was set, and showed everything with chcp 65001 
which is Unicode). AFAIK Unix users do not create files named in Russian 
very often, and Windows users do this frequently.

Dimitry  Golubovsky
Middletown, CT




More information about the Haskell-Cafe mailing list