[Haskell-cafe] ANNOUNCE: utf8-string-0.1
Eric Mertens
emertens at gmail.com
Mon Jul 9 12:37:18 EDT 2007
On 7/9/07, Henning Thielemann <lemming at henning-thielemann.de> wrote:
> HXT returns a list of warnings for invalid UTF-8 byte sequences:
> http://www.fh-wedel.de/~si/HXmlToolbox/hdoc_arrow/Text-XML-HXT-DOM-Unicode.html#v%3Autf8ToUnicode
>
> Is your decoder lazy?
>
Yes, the decoder is lazy.
Regarding error handling, I noticed that Python has three modes for
decoding UTF-8: strict, replace, and ignore.
strict: error "bad encoding"
replace: ('\xfffd' :)
ignore: id
which I could add if there was interest.
--
Eric Mertens
More information about the Haskell-Cafe
mailing list