Skip to content

Commit 7c4342e

Browse files
committed
ARM: fix folding of stack-adjustment (yet again).
When trying to eliminate an "sub sp, sp, #N" instruction by folding it into an existing push/pop using dummy registers, we need to account for the fact that this might affect precisely how "fp" gets set in the prologue. We were attempting this, but assuming that *whenever* we performed a fold it would make a difference. This is false, for example, in: push {r4, r7, lr} add fp, sp, brson#4 vpush {d8} sub sp, sp, brson#8 we can fold the "sub" into the "vpush", forming "vpush {d7, d8}". However, in that case the "add fp" instruction mustn't change, which we were getting wrong before. Should fix PR18160. git-svn-id: https://llvm.org/svn/llvm-project/llvm/trunk@196725 91177308-0d34-0410-b5e6-96231b3b80d8
1 parent d34102e commit 7c4342e

File tree

2 files changed

+13
-3
lines changed

2 files changed

+13
-3
lines changed

lib/Target/ARM/ARMFrameLowering.cpp

+4-3
Original file line numberDiff line numberDiff line change
@@ -256,9 +256,10 @@ void ARMFrameLowering::emitPrologue(MachineFunction &MF) const {
256256

257257
if (NumBytes) {
258258
// Adjust SP after all the callee-save spills.
259-
if (tryFoldSPUpdateIntoPushPop(STI, MF, LastPush, NumBytes))
260-
FramePtrOffsetInPush += NumBytes;
261-
else
259+
if (tryFoldSPUpdateIntoPushPop(STI, MF, LastPush, NumBytes)) {
260+
if (LastPush == FramePtrPush)
261+
FramePtrOffsetInPush += NumBytes;
262+
} else
262263
emitSPUpdate(isARM, MBB, MBBI, dl, TII, -NumBytes,
263264
MachineInstr::FrameSetup);
264265

test/CodeGen/ARM/fold-stack-adjust.ll

+9
Original file line numberDiff line numberDiff line change
@@ -137,6 +137,15 @@ define void @test_fold_point(i1 %tst) minsize {
137137
; CHECK: {{LBB[0-9]+_2}}:
138138
; CHECK-NEXT: vpop {d7, d8}
139139
; CHECK-NEXT: pop {r4, pc}
140+
141+
; With a guaranteed frame-pointer, we want to make sure that its offset in the
142+
; push block is correct, even if a few registers have been tacked onto a later
143+
; vpush (PR18160).
144+
; CHECK-IOS-LABEL: test_fold_point:
145+
; CHECK-IOS: push {r4, r7, lr}
146+
; CHECK-IOS-NEXT: add r7, sp, #4
147+
; CHECK-IOS-NEXT: vpush {d7, d8}
148+
140149
; We want some memory so there's a stack adjustment to fold...
141150
%var = alloca i8, i32 8
142151

0 commit comments

Comments
 (0)