[StackColoring] Incorrect slot merging due to stackcoloring-lifetime-start-on-first-use #132085

tmiasko · 2025-03-19T19:12:59Z

In the program below, the allocation a and allocation b have overlapping
live ranges, so it should be impossible to observe them having the same
address. Nevertheless StackColoring merges stack slots for a and b. This
is incorrect since addresses of those allocations might be captured by g.

define void @f() {
start:
  %a = alloca [1000 x i8], align 1
  %b = alloca [1000 x i8], align 1
  call void @llvm.lifetime.start.p0(i64 1000, ptr %a)
  call void @llvm.lifetime.start.p0(i64 1000, ptr %b)
  call void @g(ptr %a)
  call void @llvm.lifetime.end.p0(i64 1000, ptr %a)
  call void @g(ptr %b)
  call void @llvm.lifetime.end.p0(i64 1000, ptr %b)
  ret void
}
declare void @g(ptr %n)

$ llc-21 a.ll -print-before=stack-coloring -print-after=stack-coloring
# Machine code for function f: IsSSA, TracksLiveness
Frame Objects:
  fi#0: size=1000, align=1, at location [SP+8]
  fi#1: size=1000, align=1, at location [SP+8]
<snip>
# *** IR Dump After Merge disjoint stack slots (stack-coloring) ***:
# Machine code for function f: IsSSA, TracksLiveness
Frame Objects:
  fi#0: size=1000, align=1, at location [SP+8]
  fi#1: dead
<snip>

This is caused by stackcoloring-lifetime-start-on-first-use (the default) where
stack coloring shrinks live range towards first use.

The text was updated successfully, but these errors were encountered:

nikic · 2025-03-19T22:00:30Z

In the program below, the allocation a and allocation b have overlapping live ranges, so it should be impossible to observe them having the same address.

Where is this requirement specified? I don't believe that overlapping lifetimes are intended to guarantee that the addresses differ. They should be consistently observed as equal or not equal at runtime, but I believe both outcomes are legal. (Similar to how e.g. two different functions may or may not compare equal.)

tmiasko · 2025-03-20T08:49:08Z

LangRef guarantees that pointers returned by alloca are unique, so if any, the
issue is explaining why it is possible to observe distinct allocas having the
same address.

Since the entire purpose of lifetime intrinsics is for them to be sufficient
for merging stack slots, it must be possible to observe distinct allocas having
the same address, and LangRef needs clarification.

Step back for a moment and consider the same function without any lifetime
intrinsics at all. In all possible executions addresses of a and b must
differ. Now suppose the function is inlined. This introduces overlapping
lifetimes around the code, but the behavior must remain the same. Therefore in
executions where lifetimes are overlapping their addresses must differ.

Alive2 agrees that with overlapping lifetimes addresses must differ:
https://alive2.llvm.org/ce/z/W_FDbF

nikic · 2025-03-20T09:17:26Z

LangRef guarantees that pointers returned by alloca are unique, so if any, the issue is explaining why it is possible to observe distinct allocas having the same address.

Since the entire purpose of lifetime intrinsics is for them to be sufficient for merging stack slots, it must be possible to observe distinct allocas having the same address, and LangRef needs clarification.

Could you please point me to where LangRef guarantees that allocas can never have the same address? I couldn't find wording to that effect, and as you say, the presence of lifetime intrinsics would certainly make such a requirement incorrect.

Step back for a moment and consider the same function without any lifetime intrinsics at all. In all possible executions addresses of a and b must differ. Now suppose the function is inlined. This introduces overlapping lifetimes around the code, but the behavior must remain the same. Therefore in executions where lifetimes are overlapping their addresses must differ.

I don't follow the example. If you inline the function in the issue description the lifetimes will be placed differently (the %a lifetime will not end early).

But in any case, that still assumes that "equal" and "not equal" cannot be both valid outcomes.

Alive2 agrees that with overlapping lifetimes addresses must differ: https://alive2.llvm.org/ce/z/W_FDbF

When it comes to address identity, I would not blindly trust alive2 output. I've encountered may bugs it has in this particular area.

tmiasko · 2025-03-20T11:11:50Z

Do I understand correctly, that you are claiming that in the case without
lifetime intrinsics pointers to distinct non-zero sized allocations might
compare equal?

define i1 @g() {
  %a = alloca ptr
  %b = alloca ptr
  %c = icmp eq ptr %a, %b
  ret i1 %c
}

I take it as a fundamental that allocated objects that are live at the same
time have disjoint storage (unless specified otherwise), so in g those
pointers always compare not equal. Furthermore programming languages like C
(6.5.9p7), C++, and Rust prescribe the outcome of such comparisons as not equal
as well.

I don't understand why equal would be a valid outcome, or how would C, C++, or
Rust be lowered to LLVM IR in that case.

Semantics section for alloca claims that "Allocating zero bytes is legal, but
the returned pointer may not be unique." What is point of describing the
exception for zero sized allocations unless there is a general rule that those
pointers are indeed unique (notwithstanding interaction with lifetime
intrinsics)?

nikic · 2025-03-20T13:59:23Z

Do I understand correctly, that you are claiming that in the case without
lifetime intrinsics pointers to distinct non-zero sized allocations might
compare equal?

My general assumption here was that lifetime intrinsics are an optimization hint rather than a specification of when exactly allocation/deallocation must occur, so that it's possible to shrink the lifetime based on usage (e.g. by "sinking" the allocation point, as happens in the example from the issue description).

For your simplified example from the last comment, I don't think that can return true as both allocas are used in the icmp.

Anyway, my assumption about how this is supposed to work is probably wrong. In any case, we should explicitly document the semantics in LangRef.

cc @efriedma-quic

efriedma-quic · 2025-03-20T16:45:53Z

The semantics of lifetime intrinsics is a longstanding issue; see #45725. At first glance, this is basically a duplicate.

tmiasko · 2025-03-20T18:51:31Z

A simpler demonstration of the issue, this time with proper nesting of lifetimes:

define i32 @main() {
entry:
  %a = alloca [1000 x i8]
  %b = alloca [1000 x i8]
  call void @llvm.lifetime.start.p0(i64 1000, ptr %a)
  call void @llvm.lifetime.start.p0(i64 1000, ptr %b)
  call void @llvm.lifetime.end.p0(i64 1000, ptr %b)
  %0 = icmp eq ptr %a, %b
  call void @llvm.lifetime.end.p0(i64 1000, ptr %a)
  %1 = zext i1 %0 to i32
  ret i32 %1
}

$ lli-21 -stackcoloring-lifetime-start-on-first-use=0 a.ll; echo $?
0
$ lli-21 -stackcoloring-lifetime-start-on-first-use=1 a.ll; echo $?
1

RalfJung · 2025-03-21T21:45:06Z

Whether or not two variables can be equal has to be tied to some clear property of the program. Maybe it's not lifetime intrinsics, but then what is it? The answer cannot be "look at how the variables are used" because there are optimizations that change how the variables are used, and if the syntactic position of the first and last use was somehow semantically relevant, then those optimizations would be wrong.

tmiasko added llvm:codegen miscompilation labels Mar 19, 2025

tmiasko mentioned this issue Mar 21, 2025

Miscompilation: Equal pointers comparing as unequal rust-lang/rust#107975

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[StackColoring] Incorrect slot merging due to stackcoloring-lifetime-start-on-first-use #132085

[StackColoring] Incorrect slot merging due to stackcoloring-lifetime-start-on-first-use #132085

tmiasko commented Mar 19, 2025

nikic commented Mar 19, 2025

tmiasko commented Mar 20, 2025

nikic commented Mar 20, 2025 •

edited

Loading

tmiasko commented Mar 20, 2025 •

edited

Loading

nikic commented Mar 20, 2025

efriedma-quic commented Mar 20, 2025

tmiasko commented Mar 20, 2025

RalfJung commented Mar 21, 2025

[StackColoring] Incorrect slot merging due to stackcoloring-lifetime-start-on-first-use #132085

[StackColoring] Incorrect slot merging due to stackcoloring-lifetime-start-on-first-use #132085

Comments

tmiasko commented Mar 19, 2025

nikic commented Mar 19, 2025

tmiasko commented Mar 20, 2025

nikic commented Mar 20, 2025 • edited Loading

tmiasko commented Mar 20, 2025 • edited Loading

nikic commented Mar 20, 2025

efriedma-quic commented Mar 20, 2025

tmiasko commented Mar 20, 2025

RalfJung commented Mar 21, 2025

nikic commented Mar 20, 2025 •

edited

Loading

tmiasko commented Mar 20, 2025 •

edited

Loading