Re: [Haskell-i18n] unicode notation \uhhhh implementation

16 Aug 2002


      On Fri, 2002-08-16 at 10:26, Ketil Z. Malde wrote:
...
Ashley Yakeley  writes:
...
Sure, but bear in mind Unicode names for characters are quite long, for 
instance
...
GREEK SMALL LETTER THETA
Hmm...yes.  My personal preference would be something close to
(La)TeX.  Although it is perhaps a bit niche, it *is* a standard lhs
style (and one which I quite like, too).
I think there would be no harm in having the TeX names for convenience.
...
Would we need to maintain the list manually, then?  Perhaps we could
standardise Unicode names, but additionally maintain short synonyms it
for greek letters and similar mathematical symbols, which I suspect
are rather commonly used?
Would allowing the full Unicode names give an advantage? Something like
GREEK_SMALL_LETTER_THETA is almost half a line and might do more harm to
the code readability than uhhhh.
...
...
Right, but whatever it is it really should be an ASCII character: the 
point is to allow representation of all identifiers from 7-bit ASCII.
What's available, really?  "~!?$%.,^:;" are taken, along with quotes,
numerical symbols and parens.  Are '#' and '&' still free?
Candidates I can think of might be:
1        &alpha  -> similar to HTML entities
2        #alpha  -> possible problems with C preprocessor?
How about #uhhhh? There is no C preprocessor directive like that, so it
should be safe to run the unicode-preproc before cpp. The only thing is
that GHC uses # in identifiers and pragmas, as far as I can see. Can
someone comment?


Sven Moritz

Re: [Haskell-i18n] unicode notation \uhhhh implementation

Sven Moritz Hallberg