[GHC] #11609: Document unicode report deviations

#11609: Document unicode report deviations -------------------------------------+------------------------------------- Reporter: thomie | Owner: Type: task | Status: new Priority: normal | Milestone: Component: Documentation | Version: 7.10.3 Keywords: unicode, | Operating System: Unknown/Multiple report-impact | Architecture: | Type of failure: None/Unknown Unknown/Multiple | Test Case: | Blocked By: Blocking: | Related Tickets: #1103, #4373, | #10196, #11012 Differential Rev(s): | Wiki Page: -------------------------------------+------------------------------------- @nomeata mentions in #10196: The report specifies “Haskell compilers are expected to make use of new versions of Unicode as they are made available.” So if we deviate from that, we should make sure that * the user’s guide explicitly lists all deviations from the report [https://downloads.haskell.org/~ghc/latest/docs/html/users_guide/bugs-and- infelicities.html#infelicities-lexical in this section], and * that the Haskell prime committee is going to be aware of these (sensible) deviations, so that they can become official. Certain deviations are (there might be more): * `OtherLetter` are treated as lowercase (#1103), and thus allowed in identifiers. * `ModifierLetter` (#10196) and `OtherNumber` (#4373) are allowed in identifiers, but only starting from the second character. * `$decdigit = $ascdigit -- for now, should really be $digit (ToDo)` (see compiler/parser/Lexer.x) -- Ticket URL: http://ghc.haskell.org/trac/ghc/ticket/11609 GHC http://www.haskell.org/ghc/ The Glasgow Haskell Compiler

#11609: Document unicode report deviations -------------------------------------+------------------------------------- Reporter: thomie | Owner: Type: task | Status: new Priority: normal | Milestone: Component: Documentation | Version: 7.10.3 Resolution: | Keywords: unicode, | report-impact Operating System: Unknown/Multiple | Architecture: | Unknown/Multiple Type of failure: None/Unknown | Test Case: Blocked By: | Blocking: Related Tickets: #1103, #4373, | Differential Rev(s): #7650, #10196, #11012 | Wiki Page: | -------------------------------------+------------------------------------- Changes (by thomie): * related: #1103, #4373, #10196, #11012 => #1103, #4373, #7650, #10196, #11012 @@ -16,2 +16,3 @@ - * `ModifierLetter` (#10196) and `OtherNumber` (#4373) are allowed in - identifiers, but only starting from the second character. + * `ModifierLetter` (#10196), `OtherNumber` (#4373) and `NonSpacingMark` + (#7650) are allowed in identifiers, but only starting from the second + character. New description: @nomeata mentions in #10196: The report specifies “Haskell compilers are expected to make use of new versions of Unicode as they are made available.” So if we deviate from that, we should make sure that * the user’s guide explicitly lists all deviations from the report [https://downloads.haskell.org/~ghc/latest/docs/html/users_guide/bugs-and- infelicities.html#infelicities-lexical in this section], and * that the Haskell prime committee is going to be aware of these (sensible) deviations, so that they can become official. Certain deviations are (there might be more): * `OtherLetter` are treated as lowercase (#1103), and thus allowed in identifiers. * `ModifierLetter` (#10196), `OtherNumber` (#4373) and `NonSpacingMark` (#7650) are allowed in identifiers, but only starting from the second character. * `$decdigit = $ascdigit -- for now, should really be $digit (ToDo)` (see compiler/parser/Lexer.x) -- -- Ticket URL: http://ghc.haskell.org/trac/ghc/ticket/11609#comment:1 GHC http://www.haskell.org/ghc/ The Glasgow Haskell Compiler

#11609: Document unicode report deviations -------------------------------------+------------------------------------- Reporter: thomie | Owner: Type: task | Status: new Priority: normal | Milestone: Component: Documentation | Version: 7.10.3 Resolution: | Keywords: unicode, | report-impact Operating System: Unknown/Multiple | Architecture: | Unknown/Multiple Type of failure: None/Unknown | Test Case: Blocked By: | Blocking: Related Tickets: #1103, #4373, | Differential Rev(s): #7650, #10196, #11012 | Wiki Page: | -------------------------------------+------------------------------------- Comment (by rwbarton): Oh, I was going to comment on the subject of documentation, so I'll do it here. We should have a changelog entry about allowing combining characters in identifiers; and we should be clear about what kind of normalization we do to decide when a sequence involving combining characters is considered the same as a precomposed sequence. (I assume the answer is currently "none", but it would probably be nice to change that for 8.2. NFC normalization seems to be a popular choice for programming languages.) -- Ticket URL: http://ghc.haskell.org/trac/ghc/ticket/11609#comment:2 GHC http://www.haskell.org/ghc/ The Glasgow Haskell Compiler
participants (1)
-
GHC