`next_token_inner` in the lexer has a comment saying to do NKFC normalization. I have no idea what that is, but I guess we should do it. reference: [NFKC](http://www.unicode.org/reports/tr15/#Norm_Forms) is one of four unicode Normalization Forms. reference: [UAX-31](http://www.unicode.org/reports/tr31/) supplies guidelines for use of normalization with identifiers.