[Haskell-cafe] Re: String vs ByteString

wren ng thornton wren at freegeek.org
Sat Aug 14 21:53:29 EDT 2010


Yitzchak Gale wrote:
> Bryan O'Sullivan wrote:
>> In general, Unicode uptake is increasing rapidly:
>>  http://googleblog.blogspot.com/2010/01/unicode-nearing-50-of-web.html
> 
> These Google graphs are the oft-quoted source of
> Unicode's growing dominance. But the data for those graphs
> is taken from Google's own web indexing.

Note also that all those encodings near the bottom are remaining 
relatively constant. UTF8 is taking its market share from ASCII and 
Western European encodings, not so much from other encodings (as yet).

As Bryan mentioned, Unicode doesn't have wide acceptance in CJK 
countries. These days, Japanese websites seem to have finally started to 
standardize--- in that they use HTTP/HTML headers to say which encoding 
the pages are in (and generally use JIS or Shift-JIS). This is a big 
step up from a decade ago when non-commercial sites pretty invariably 
required fiddling with the browser to get rid of mojibake. Japan hasn't 
been bitten by the i18n/l10n bug and they don't have a strong F/OSS 
community to drive adoption either.

-- 
Live well,
~wren


More information about the Haskell-Cafe mailing list