[GHC] #10196: Regression regarding Unicode subscript characters in identifiers

GHC ghc-devs at haskell.org
Tue Jun 23 19:12:48 UTC 2015


#10196: Regression regarding Unicode subscript characters in identifiers
-------------------------------------+-------------------------------------
        Reporter:  thomie            |                   Owner:
            Type:  bug               |  thoughtpolice
        Priority:  normal            |                  Status:  patch
       Component:  Compiler          |               Milestone:  7.10.3
  (Parser)                           |                 Version:  7.10.1
      Resolution:                    |                Keywords:
Operating System:  Unknown/Multiple  |            Architecture:
 Type of failure:  GHC rejects       |  Unknown/Multiple
  valid program                      |               Test Case:
      Blocked By:                    |                Blocking:
 Related Tickets:  #5108             |  Differential Revisions:  Phab:D969
-------------------------------------+-------------------------------------

Comment (by thomie):

 Replying to [comment:5 hvr]:
 > We're planning to allow `Lm` from the 2nd character on in an identifier
 for 7.10.2

 The current patch does exactly this. It still needs a changelog entry.

 I would have preferred to only allow `Lm` in the suffix of an identifier.
 But we can leave that for 7.12 or later, as there is a slight chance it
 breaks someone's code. We could mention it in the docs.

 There's also the issue that ModifierLetter perhaps brings in too many
 weird characters:

 "15-06-18T11:46:27"<             hvr@> thomie: can we easily list all
 modifier letters in Haskell?
 "15-06-18T11:47:55"<             hvr@>  [ c | c <- ['\0'..],
 generalCategory c == ModifierLetter ]
 "15-06-18T11:47:56"<             hvr@> got it
 "15-06-18T11:48:56"<             hvr@> ok, there's a lot in there one
 doesn't want to allow in identifiers :-/
 "15-06-18T11:49:31"<          thomie > booh
 "15-06-18T11:49:50"<             hvr@> these look nasty:
 "15-06-18T11:50:46"<             hvr@> so many column variants, theres
 also "ː"

 hvr: do you think this a big enough issue to not proceed with the current
 patch?

--
Ticket URL: <http://ghc.haskell.org/trac/ghc/ticket/10196#comment:9>
GHC <http://www.haskell.org/ghc/>
The Glasgow Haskell Compiler


More information about the ghc-tickets mailing list