[Haskell-cafe] ANNOUNCE: utf8-string-0.1

Eric Mertens emertens at gmail.com
Mon Jul 9 12:37:18 EDT 2007


On 7/9/07, Henning Thielemann <lemming at henning-thielemann.de> wrote:
> HXT returns a list of warnings for invalid UTF-8 byte sequences:
>  http://www.fh-wedel.de/~si/HXmlToolbox/hdoc_arrow/Text-XML-HXT-DOM-Unicode.html#v%3Autf8ToUnicode
>
> Is your decoder lazy?
>

Yes, the decoder is lazy.

Regarding error handling, I noticed that Python has three modes for
decoding UTF-8: strict, replace, and ignore.

strict: error "bad encoding"
replace: ('\xfffd' :)
ignore: id

which I could add if there was interest.

--
Eric Mertens


More information about the Haskell-Cafe mailing list