behaviour change in getDirectoryContents in GHC 7.2?
Simon Marlow
marlowsd at gmail.com
Tue Nov 8 12:43:31 CET 2011
On 02/11/2011 21:40, Max Bolingbroke wrote:
> On 2 November 2011 20:16, Ian Lynagh<igloo at earth.li> wrote:
>> Are you saying there's a bug that should be fixed?
>
> You can choose between two options:
>
> 1. Failing to roundtrip some strings (in our case, those containing
> the 0xEFNN byte sequences)
> 2. Having GHC's decoding functions return strings including
> codepoints that should not be allowed (i.e. lone surrogates)
>
> At the time I implemented this there was significant support for 2, so
> that is what we have.
Don't you mean 1 is what we have?
> At the time I was convinced that 2 was the right
> thing to do, but now I'm more agnostic. But anyway the current
> behaviour is not really a bug -- it is by design :-)
Failing to roundtrip in some cases, and doing so silently, seems highly
suboptimal to me. I'm sorry I didn't pick up on this at the time
(Unicode is a swamp :).
Cheers,
Simon
More information about the Glasgow-haskell-users
mailing list