Can we weaken the requirements for `offset`? (Was: Should we / can we make all "getelementptr inbounds" into "getelementptr nowrap"?)

The "inbounds" semantics of `offset` are [notoriously tricky and confusing](https://rust-lang.zulipchat.com/#narrow/stream/136281-t-lang.2Fwg-unsafe-code-guidelines/topic/inbounds.20offsets.20can.20leave.20the.20provenance.20region). From what I hear from @nikic, the "inbounds" part of them is also [not nearly as useful as one might think](https://discourse.llvm.org/t/question-about-getelementptr-inbounds-with-offset-0/62533/9), and the main payoff is being sure that the pointer is not wrapped around either end of the address space.

So... is there a chance that we could significantly simplify the language at acceptable cost for analyses by changing the rules of `offset` (and all other "inbounds" offsets that the language does implicitly, like when applying place projections) such that the only case of UB here is overflow wrapping around the address space (both below `0` and above `usize::MAX`)? I think that would be great, but of course we have to be careful not to give up too much information here. (That said, we *do* have a ton of information of the form "this pointer is dereferenceable for size N", which conveys bounds information much more directly than `getelementptr inbounds`.)

However, we'd probably need LLVM support for this, adding some sort of `getelementptr nowrap`. (There *is* the possible alternative of using plain `getelementptr`, and upgrading that to `inbounds` whenever we can derive from other information that the pointer is indeed dereferenceable for a sufficiently large memory range. I am not sure how tricky that would be to implement though.)

So I wonder, @nikic, do you think that would be a reasonable and realistic option? And everyone, do you think that would be a reasonable semantics to shoot for?

In particular, this would resolve https://github.com/rust-lang/unsafe-code-guidelines/issues/299.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Can we weaken the requirements for `offset`? (Was: Should we / can we make all "getelementptr inbounds" into "getelementptr nowrap"?) #350

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Can we weaken the requirements for offset? (Was: Should we / can we make all "getelementptr inbounds" into "getelementptr nowrap"?) #350

Description

Activity

Lokathor commented on Jul 11, 2022

scottmcm commented on Jul 11, 2022

Lokathor commented on Jul 11, 2022

eddyb commented on Jul 17, 2022

RalfJung commented on Feb 10, 2023

RalfJung commented on Jun 14, 2023

RalfJung commented on Jun 28, 2023

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Participants

Issue actions

Can we weaken the requirements for `offset`? (Was: Should we / can we make all "getelementptr inbounds" into "getelementptr nowrap"?) #350