[GHC] #9114: Invalid UTF8 not round-tripped correctly

GHC ghc-devs at haskell.org
Thu May 15 09:11:22 UTC 2014


#9114: Invalid UTF8 not round-tripped correctly
------------------------------------+-------------------------------------
       Reporter:  nomeata           |             Owner:
           Type:  bug               |            Status:  new
       Priority:  normal            |         Milestone:
      Component:  libraries/base    |           Version:  7.6.3
       Keywords:                    |  Operating System:  Unknown/Multiple
   Architecture:  Unknown/Multiple  |   Type of failure:  None/Unknown
     Difficulty:  Unknown           |         Test Case:
     Blocked By:                    |          Blocking:
Related Tickets:                    |
------------------------------------+-------------------------------------
 As reported by Robert Bihlmeyer at http://bugs.debian.org/748125, the
 promised round-tripping of invalid UTF8 sequences in filenames through
 String does not work:

 ```
 $ mkdir foo
 $ touch foo/$(echo -e '\xC0\xB7.txt')
 $ ghc -e 'System.Directory.getDirectoryContents "foo" >>= print . last'
 "7.txt"
 ```

 The sequence 0xC8B7 is an (invalid) encoding of 37, i.e. `'7'`, so if it
 is mapped to `'7'`, no round-tripping is possible. (Other invalid byte
 sequences are round-tripped.)

--
Ticket URL: <http://ghc.haskell.org/trac/ghc/ticket/9114>
GHC <http://www.haskell.org/ghc/>
The Glasgow Haskell Compiler


More information about the ghc-tickets mailing list