#403: Cleanup lexical structure, recommendation: accept

Dear all, As the shepherd for proposal #403 Cleanup lexical structure of numbers and identifiers https://github.com/ghc-proposals/ghc-proposals/pull/403, let me recommend acceptance. This proposal does two things related to unicode characters - Codify the actual current lexing of unicode symbols, which is actually different from the Haskell 2010 report (in particular, only ascii digits are allowed in number literals) - Adds the possibility to use unicode letters which are neither upper case nor lower case (such as many asian scripts) as trailing symbols for identifier (as well as in String and Char literals) It doesn't make it possible (nor close the door) to use letter-like numbers (such as latin numerals) as identifiers, nor use non-ascii digits in number literals. As the title entails, it's essentially a clean up of the documentation, to match reality. It seems quite uncontroversial.

Hi, Am Donnerstag, den 04.03.2021, 15:53 +0100 schrieb Spiwack, Arnaud:
As the shepherd for proposal #403 Cleanup lexical structure of numbers and identifiers, let me recommend acceptance.
This proposal does two things related to unicode characters Codify the actual current lexing of unicode symbols, which is actually different from the Haskell 2010 report (in particular, only ascii digits are allowed in number literals) Adds the possibility to use unicode letters which are neither upper case nor lower case (such as many asian scripts) as trailing symbols for identifier (as well as in String and Char literals) It doesn't make it possible (nor close the door) to use letter-like numbers (such as latin numerals) as identifiers, nor use non-ascii digits in number literals.
As the title entails, it's essentially a clean up of the documentation, to match reality. It seems quite uncontroversial.
yes, this looks very reasonable. Cheers, Joachim -- Joachim Breitner mail@joachim-breitner.de http://www.joachim-breitner.de/

I don't have a well-informed opinion, but I certainly have no objection.
Simon
From: ghc-steering-committee

This proposal seems fine to me.
Alejandro
El El vie, 5 mar 2021 a las 10:22, Simon Peyton Jones via
ghc-steering-committee
I don’t have a well-informed opinion, but I certainly have no objection.
Simon
*From:* ghc-steering-committee
*On Behalf Of *Spiwack, Arnaud *Sent:* 04 March 2021 14:54 *To:* GHC Steering committee *Subject:* [ghc-steering-committee] #403: Cleanup lexical structure, recommendation: accept Dear all,
As the shepherd for proposal #403 Cleanup lexical structure of numbers and identifiers https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fghc-proposals%2Fghc-proposals%2Fpull%2F403&data=04%7C01%7Csimonpj%40microsoft.com%7Cb28a9399e2fd44379fd308d8df1d66b0%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637504665725379654%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=Rfp%2F%2BPsrs%2FQWa9k67NTAOLAI7PXccca%2BVLBZvcNVYME%3D&reserved=0, let me recommend acceptance.
This proposal does two things related to unicode characters
- Codify the actual current lexing of unicode symbols, which is actually different from the Haskell 2010 report (in particular, only ascii digits are allowed in number literals) - Adds the possibility to use unicode letters which are neither upper case nor lower case (such as many asian scripts) as trailing symbols for identifier (as well as in String and Char literals)
It doesn't make it possible (nor close the door) to use letter-like numbers (such as latin numerals) as identifiers, nor use non-ascii digits in number literals.
As the title entails, it's essentially a clean up of the documentation, to match reality. It seems quite uncontroversial. _______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee

I think this looks reasonable too (with the qualification that I didn't
look at it in much detail).
On Fri, Mar 5, 2021 at 9:46 AM Alejandro Serrano Mena
This proposal seems fine to me.
Alejandro
El El vie, 5 mar 2021 a las 10:22, Simon Peyton Jones via ghc-steering-committee
escribió: I don’t have a well-informed opinion, but I certainly have no objection.
Simon
*From:* ghc-steering-committee < ghc-steering-committee-bounces@haskell.org> *On Behalf Of *Spiwack, Arnaud *Sent:* 04 March 2021 14:54 *To:* GHC Steering committee
*Subject:* [ghc-steering-committee] #403: Cleanup lexical structure, recommendation: accept Dear all,
As the shepherd for proposal #403 Cleanup lexical structure of numbers and identifiers https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fghc-proposals%2Fghc-proposals%2Fpull%2F403&data=04%7C01%7Csimonpj%40microsoft.com%7Cb28a9399e2fd44379fd308d8df1d66b0%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637504665725379654%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=Rfp%2F%2BPsrs%2FQWa9k67NTAOLAI7PXccca%2BVLBZvcNVYME%3D&reserved=0, let me recommend acceptance.
This proposal does two things related to unicode characters
- Codify the actual current lexing of unicode symbols, which is actually different from the Haskell 2010 report (in particular, only ascii digits are allowed in number literals) - Adds the possibility to use unicode letters which are neither upper case nor lower case (such as many asian scripts) as trailing symbols for identifier (as well as in String and Char literals)
It doesn't make it possible (nor close the door) to use letter-like numbers (such as latin numerals) as identifiers, nor use non-ascii digits in number literals.
As the title entails, it's essentially a clean up of the documentation, to match reality. It seems quite uncontroversial. _______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee

I'm in support. Thanks, Richard
On Mar 5, 2021, at 1:03 PM, Iavor Diatchki
wrote: I think this looks reasonable too (with the qualification that I didn't look at it in much detail).
On Fri, Mar 5, 2021 at 9:46 AM Alejandro Serrano Mena
mailto:trupill@gmail.com> wrote: This proposal seems fine to me. Alejandro
El El vie, 5 mar 2021 a las 10:22, Simon Peyton Jones via ghc-steering-committee
mailto:ghc-steering-committee@haskell.org> escribió: I don’t have a well-informed opinion, but I certainly have no objection. Simon
From: ghc-steering-committee
mailto:ghc-steering-committee-bounces@haskell.org> On Behalf Of Spiwack, Arnaud Sent: 04 March 2021 14:54 To: GHC Steering committee mailto:ghc-steering-committee@haskell.org> Subject: [ghc-steering-committee] #403: Cleanup lexical structure, recommendation: accept Dear all,
As the shepherd for proposal #403 Cleanup lexical structure of numbers and identifiers https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fghc-proposals%2Fghc-proposals%2Fpull%2F403&data=04%7C01%7Csimonpj%40microsoft.com%7Cb28a9399e2fd44379fd308d8df1d66b0%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637504665725379654%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=Rfp%2F%2BPsrs%2FQWa9k67NTAOLAI7PXccca%2BVLBZvcNVYME%3D&reserved=0, let me recommend acceptance.
This proposal does two things related to unicode characters
Codify the actual current lexing of unicode symbols, which is actually different from the Haskell 2010 report (in particular, only ascii digits are allowed in number literals) Adds the possibility to use unicode letters which are neither upper case nor lower case (such as many asian scripts) as trailing symbols for identifier (as well as in String and Char literals) It doesn't make it possible (nor close the door) to use letter-like numbers (such as latin numerals) as identifiers, nor use non-ascii digits in number literals.
As the title entails, it's essentially a clean up of the documentation, to match reality. It seems quite uncontroversial.
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org mailto:ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee _______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org mailto:ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee _______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee

Dear all,
About half of us have opined, so far. If nobody complains, I'll treat
silence as assent and mark this proposal as accepted on Tuesday (16 March).
On Tue, Mar 9, 2021 at 4:41 AM Richard Eisenberg
I'm in support.
Thanks, Richard
On Mar 5, 2021, at 1:03 PM, Iavor Diatchki
wrote: I think this looks reasonable too (with the qualification that I didn't look at it in much detail).
On Fri, Mar 5, 2021 at 9:46 AM Alejandro Serrano Mena
wrote: This proposal seems fine to me.
Alejandro
El El vie, 5 mar 2021 a las 10:22, Simon Peyton Jones via ghc-steering-committee
escribió: I don’t have a well-informed opinion, but I certainly have no objection.
Simon
*From:* ghc-steering-committee < ghc-steering-committee-bounces@haskell.org> *On Behalf Of *Spiwack, Arnaud *Sent:* 04 March 2021 14:54 *To:* GHC Steering committee
*Subject:* [ghc-steering-committee] #403: Cleanup lexical structure, recommendation: accept Dear all,
As the shepherd for proposal #403 Cleanup lexical structure of numbers and identifiers https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fgithub.com%2Fghc-proposals%2Fghc-proposals%2Fpull%2F403&data=04%7C01%7Csimonpj%40microsoft.com%7Cb28a9399e2fd44379fd308d8df1d66b0%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637504665725379654%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C2000&sdata=Rfp%2F%2BPsrs%2FQWa9k67NTAOLAI7PXccca%2BVLBZvcNVYME%3D&reserved=0, let me recommend acceptance.
This proposal does two things related to unicode characters
- Codify the actual current lexing of unicode symbols, which is actually different from the Haskell 2010 report (in particular, only ascii digits are allowed in number literals) - Adds the possibility to use unicode letters which are neither upper case nor lower case (such as many asian scripts) as trailing symbols for identifier (as well as in String and Char literals)
It doesn't make it possible (nor close the door) to use letter-like numbers (such as latin numerals) as identifiers, nor use non-ascii digits in number literals.
As the title entails, it's essentially a clean up of the documentation, to match reality. It seems quite uncontroversial. _______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee

I’m in favor of the proposal, but there’s something I don’t entirely understand. We are about to double down on a deviation from the Haskell 2010 report, with no way to turn it off. Such deviations are typically documented in the “Known bugs and infelicities” section of the User’s Guide. And yet this is not a bug, it’s a feature, driven by user requests (the proposal even goes as far as to cite the tickets). So, where will the new canonical lexical grammar live? Where do we point our users to when they try to figure out what’s allowed in identifiers/literals and what isn't? - Vlad
On 11 Mar 2021, at 12:46, Spiwack, Arnaud
wrote: Dear all,
About half of us have opined, so far. If nobody complains, I'll treat silence as assent and mark this proposal as accepted on Tuesday (16 March).
On Tue, Mar 9, 2021 at 4:41 AM Richard Eisenberg
wrote: I'm in support. Thanks, Richard
On Mar 5, 2021, at 1:03 PM, Iavor Diatchki
wrote: I think this looks reasonable too (with the qualification that I didn't look at it in much detail).
On Fri, Mar 5, 2021 at 9:46 AM Alejandro Serrano Mena
wrote: This proposal seems fine to me. Alejandro
El El vie, 5 mar 2021 a las 10:22, Simon Peyton Jones via ghc-steering-committee
escribió: I don’t have a well-informed opinion, but I certainly have no objection. Simon
From: ghc-steering-committee
On Behalf Of Spiwack, Arnaud Sent: 04 March 2021 14:54 To: GHC Steering committee Subject: [ghc-steering-committee] #403: Cleanup lexical structure, recommendation: accept Dear all,
As the shepherd for proposal #403 Cleanup lexical structure of numbers and identifiers, let me recommend acceptance.
This proposal does two things related to unicode characters
• Codify the actual current lexing of unicode symbols, which is actually different from the Haskell 2010 report (in particular, only ascii digits are allowed in number literals) • Adds the possibility to use unicode letters which are neither upper case nor lower case (such as many asian scripts) as trailing symbols for identifier (as well as in String and Char literals) It doesn't make it possible (nor close the door) to use letter-like numbers (such as latin numerals) as identifiers, nor use non-ascii digits in number literals.
As the title entails, it's essentially a clean up of the documentation, to match reality. It seems quite uncontroversial.
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee _______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee _______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee _______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee

I think we can figure out the right place for the documentation when the MR comes around. Nobody having opposed the proposal, I'll now mark it as accepted. On Thu, Mar 11, 2021 at 3:26 PM Vladislav Zavialov (int-index) < vlad.z.4096@gmail.com> wrote:
I’m in favor of the proposal, but there’s something I don’t entirely understand. We are about to double down on a deviation from the Haskell 2010 report, with no way to turn it off. Such deviations are typically documented in the “Known bugs and infelicities” section of the User’s Guide.
And yet this is not a bug, it’s a feature, driven by user requests (the proposal even goes as far as to cite the tickets).
So, where will the new canonical lexical grammar live? Where do we point our users to when they try to figure out what’s allowed in identifiers/literals and what isn't?
- Vlad
On 11 Mar 2021, at 12:46, Spiwack, Arnaud
wrote: Dear all,
About half of us have opined, so far. If nobody complains, I'll treat silence as assent and mark this proposal as accepted on Tuesday (16 March).
On Tue, Mar 9, 2021 at 4:41 AM Richard Eisenberg
wrote: I'm in support. Thanks, Richard
On Mar 5, 2021, at 1:03 PM, Iavor Diatchki
wrote: I think this looks reasonable too (with the qualification that I didn't look at it in much detail).
On Fri, Mar 5, 2021 at 9:46 AM Alejandro Serrano Mena < trupill@gmail.com> wrote: This proposal seems fine to me.
Alejandro
El El vie, 5 mar 2021 a las 10:22, Simon Peyton Jones via ghc-steering-committee
escribió: I don’t have a well-informed opinion, but I certainly have no objection. Simon
From: ghc-steering-committee < ghc-steering-committee-bounces@haskell.org> On Behalf Of Spiwack, Arnaud Sent: 04 March 2021 14:54 To: GHC Steering committee
Subject: [ghc-steering-committee] #403: Cleanup lexical structure, recommendation: accept Dear all,
As the shepherd for proposal #403 Cleanup lexical structure of numbers and identifiers, let me recommend acceptance.
This proposal does two things related to unicode characters
• Codify the actual current lexing of unicode symbols, which is actually different from the Haskell 2010 report (in particular, only ascii digits are allowed in number literals) • Adds the possibility to use unicode letters which are neither upper case nor lower case (such as many asian scripts) as trailing symbols for identifier (as well as in String and Char literals) It doesn't make it possible (nor close the door) to use letter-like numbers (such as latin numerals) as identifiers, nor use non-ascii digits in number literals.
As the title entails, it's essentially a clean up of the documentation, to match reality. It seems quite uncontroversial.
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org
https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org
https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org
https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee
_______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee _______________________________________________ ghc-steering-committee mailing list ghc-steering-committee@haskell.org https://mail.haskell.org/cgi-bin/mailman/listinfo/ghc-steering-committee
participants (7)
-
Alejandro Serrano Mena
-
Iavor Diatchki
-
Joachim Breitner
-
Richard Eisenberg
-
Simon Peyton Jones
-
Spiwack, Arnaud
-
Vladislav Zavialov (int-index)