Incorrect handling of multibyte UTF-16 encodings

**Describe the bug**
It seems the handling of multibyte UTF-16 encodings is incorrect in `lsp-mode`.

**To Reproduce**
Create a file with just these contents: "🍋" - a single lemon emoji. Place the cursor after the lemon and type a character ("l", say, for lemon). Most emojis including this one are represented by two UTF-16 bytes, so since LSP [specifies](https://microsoft.github.io/language-server-protocol/specifications/specification-current/#textDocuments) offsets as in a UTF-16 string representation, this is at column *2*.

But `lsp-mode` sends column 1:

```json
{"jsonrpc":"2.0","method":"textDocument/didChange","params":{"textDocument":{"uri":"file:///home/w/utf16.lean","version":1},"contentChanges":[{"range":{"start":{"line":0,"character":1},"end":{"line":0,"character":1}},"rangeLength":0,"text":"l"}]}}
```

**Expected behavior**
Compare with e.g. the VSCode [sample](https://github.com/microsoft/vscode-extension-samples/tree/83261e3f32421da84513c5a76e8b6a08fff3c332/lsp-sample) client, which sends 2 as it should:

```json
{"jsonrpc":"2.0","method":"textDocument/didChange","params":{"textDocument":{"uri":"file:///home/w/utf16.lean","version":56},"contentChanges":[{"range":{"start":{"line":0,"character":2},"end":{"line":0,"character":2}},"rangeLength":0,"text":"l"}]}}
```

**Which Language Server did you use**
Custom one added via the [tutorial](https://emacs-lsp.github.io/lsp-mode/page/adding-new-language/). `lsp-mode` version 7.0.1.

**OS**
Linux

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Incorrect handling of multibyte UTF-16 encodings #2080

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

Incorrect handling of multibyte UTF-16 encodings #2080

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions