[GHC] #11609: Document unicode report deviations

GHC ghc-devs at haskell.org
Tue Feb 23 11:38:15 UTC 2016


#11609: Document unicode report deviations
-------------------------------------+-------------------------------------
        Reporter:  thomie            |                Owner:
            Type:  task              |               Status:  new
        Priority:  normal            |            Milestone:
       Component:  Documentation     |              Version:  7.10.3
      Resolution:                    |             Keywords:  unicode,
                                     |  report-impact
Operating System:  Unknown/Multiple  |         Architecture:
                                     |  Unknown/Multiple
 Type of failure:  None/Unknown      |            Test Case:
      Blocked By:                    |             Blocking:
 Related Tickets:  #1103, #4373,     |  Differential Rev(s):
  #7650, #10196, #11012              |
       Wiki Page:                    |
-------------------------------------+-------------------------------------
Changes (by thomie):

 * related:  #1103, #4373, #10196, #11012 => #1103, #4373, #7650, #10196,
     #11012


@@ -16,2 +16,3 @@
- * `ModifierLetter` (#10196) and `OtherNumber` (#4373) are allowed in
- identifiers, but only starting from the second character.
+ * `ModifierLetter` (#10196), `OtherNumber` (#4373) and `NonSpacingMark`
+ (#7650) are allowed in identifiers, but only starting from the second
+ character.

New description:

 @nomeata mentions in #10196:

 The report specifies “Haskell compilers are expected to make use of new
 versions of Unicode as they are made available.” So if we deviate from
 that, we should make sure that

 * the user’s guide explicitly lists all deviations from the report
 [https://downloads.haskell.org/~ghc/latest/docs/html/users_guide/bugs-and-
 infelicities.html#infelicities-lexical in this section], and
 * that the Haskell prime committee is going to be aware of these
 (sensible) deviations, so that they can become official.

 Certain deviations are (there might be more):
 * `OtherLetter` are treated as lowercase (#1103), and thus allowed in
 identifiers.
 * `ModifierLetter` (#10196), `OtherNumber` (#4373) and `NonSpacingMark`
 (#7650) are allowed in identifiers, but only starting from the second
 character.
 * `$decdigit = $ascdigit -- for now, should really be $digit (ToDo)` (see
 compiler/parser/Lexer.x)

--

--
Ticket URL: <http://ghc.haskell.org/trac/ghc/ticket/11609#comment:1>
GHC <http://www.haskell.org/ghc/>
The Glasgow Haskell Compiler


More information about the ghc-tickets mailing list