RFE: Warn about implicit Unicode conversions which could raise an exception #2182

Herst · 2016-09-25T15:35:10Z

What I'm looking look for is basically a solution to the following SO question concerning problematic implicit conversions into unicode (in Python 2, I guess in Python 3 the equivalent problem with bytes to string conversions exists).

Summary of the SO question: I have a large existing projects where people weren't thinking much about Unicode and happily mixing encoded strs and unicode together without thinking twice about telling the decoder what the encoding used is and what to do with byte sequences which can't be decoded. Apparently I'm not the only one with the problem if you do a SO search or search for the horrible hack I mentioned in the SO question.

Now it would be great if e.g. there was a mypy parameter with which all implicit conversions into unicode (which also means that the programmer didn't explicitly state any encoding) would raise a warning, including the examples from the SO post:

# potentially problematic cases if someStr can also have non-ASCII characters in it
u"foo" + someStr
u"foo{}".format(someStr)

Note that I don't think that mypy should be making guesses about the content of the strings. Also, this RFE is probably related to #1141.

/edit: I just noticed that in python/typing#208 such solutions were already being discussed. Sorry for not noticing it earlier, think of this post here as an argument in favor of the strict approach then.

The text was updated successfully, but these errors were encountered:

gvanrossum · 2016-09-25T21:36:50Z

Thanks! I think we have enough issues open for this topic already (you found both of them) so I'm going to close this one. We're going slowly here but I have a good hope that we'll end up in a good spot with support for your use case.

gvanrossum closed this as completed Sep 25, 2016

Herst mentioned this issue Nov 10, 2016

Decide how to handle str/unicode python/typing#208

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

RFE: Warn about implicit Unicode conversions which could raise an exception #2182

RFE: Warn about implicit Unicode conversions which could raise an exception #2182

Herst commented Sep 25, 2016 •

edited

Loading

gvanrossum commented Sep 25, 2016

Uh oh!

Uh oh!

RFE: Warn about implicit Unicode conversions which could raise an exception #2182

RFE: Warn about implicit Unicode conversions which could raise an exception #2182

Comments

Herst commented Sep 25, 2016 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

gvanrossum commented Sep 25, 2016

Uh oh!

Herst commented Sep 25, 2016 •

edited

Loading