[GHC] #14589: The isUpper function should return true for the '\9438' character

GHC ghc-devs at haskell.org
Sat Dec 16 14:58:49 UTC 2017


#14589: The isUpper function should return true for the '\9438' character
-------------------------------------+-------------------------------------
        Reporter:  mrkkrp            |                Owner:  (none)
            Type:  bug               |               Status:  new
        Priority:  normal            |            Milestone:
       Component:  libraries/base    |              Version:  8.2.1
      Resolution:                    |             Keywords:
Operating System:  Unknown/Multiple  |         Architecture:
                                     |  Unknown/Multiple
 Type of failure:  None/Unknown      |            Test Case:
      Blocked By:                    |             Blocking:
 Related Tickets:                    |  Differential Rev(s):
       Wiki Page:                    |
-------------------------------------+-------------------------------------

Comment (by nomeata):

 This is documented behavior. `isUpper` says:

 > Selects upper-case or title-case alphabetic Unicode characters
 (letters). Title case is used by a small number of letter ligatures like
 the single-character form of Lj.

 Note that is says “letter”, and the code in libraries/base/cbits/WCsubst.c
 explicitly selects only uppper-case and title-case letters.

 I see how this is a bit unfortunate, but I am sure we should not change
 the semantics of `isUpper` .

 Maybe we are missing a function? `isUpperAnything` or something?

-- 
Ticket URL: <http://ghc.haskell.org/trac/ghc/ticket/14589#comment:2>
GHC <http://www.haskell.org/ghc/>
The Glasgow Haskell Compiler


More information about the ghc-tickets mailing list