Disallow additional invisible characters in Julia source

There's many invisible unicode characters.

https://invisible-characters.com/

We currently disallow several of them

https://github.com/JuliaLang/julia/blob/c24517918dd7ea33df4eb0c965dd7d45d530d7b0/src/julia-parser.scm#L596

Namely

```julia
'\u00ad' # soft hyphen
'\u200b' # zero width space
'\u200c' # zero width non-joiner
'\u200d' # zero width joiner
'\u200e' # left-to-right mark
'\u200f' # right-to-left mark
'\u2060' # word joiner
'\u2061' # function application
```

It appears the reference parser also attempts to disallow `'\u115f' # Hangul Choseong filler` , but this doesn't work due to `Base.is_id_char` returning true for that character. I'm not sure this is a problem, I didn't find obvious information about what this filler character is for and whether it might be required for writing identifiers in Korean.

Anyway, should we disallow more of the list from https://invisible-characters.com/ ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Disallow additional invisible characters in Julia source #49850

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Disallow additional invisible characters in Julia source #49850

Description

Activity

inkydragon commented on May 17, 2023

c42f commented on May 18, 2023

Seelengrab commented on May 18, 2023

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions