Valid unicode characters cause compiler lexing error
Summary
Here's a perfectly valid Haskell program:
module Main (main) where
main = putStrLn "🫂"
This does not compile. Instead, GHC reports lexical error in string/character literal at character '\129730'
.
My guess is that this is a similar problem to #5518 (closed). It looks like this emoji ("people hugging") was added in Unicode 13, released on March 10th, 2020.
I know nothing about how GHC works, but maybe the Unicode tables need to be renewed (taking the terminology from #5518 (comment 158430))?
Environment
- GHC version used: 8.10.4
Optional:
- Operating System: Arch Linux
- System Architecture: x86_64