Skip to content

Commit 3f4bad5

Browse files
committed
[AArch64] Fix for the pre-indexed paired load/store optimization.
This patch fixes an issue where a pre-indexed store e.g., STR x1, [x0, #24]! with a store like STR x0, [x0, #8] are merged into a single store: STP x1, x0, [x0, #24]! . They shouldn’t be merged because the second store uses x0 as both the stored value and the address and so it needs to be using the updated x0. Therefore, it should not be folded into a STP <>pre. Additionally a new test case is added to verify this fix. Differential Revision: https://reviews.llvm.org/D101888 Change-Id: I26f1985ac84e970961e2cdca23c590fa6773851a
1 parent e994e74 commit 3f4bad5

File tree

2 files changed

+34
-1
lines changed

2 files changed

+34
-1
lines changed

llvm/lib/Target/AArch64/AArch64LoadStoreOptimizer.cpp

+7-1
Original file line numberDiff line numberDiff line change
@@ -1610,7 +1610,13 @@ AArch64LoadStoreOpt::findMatchingInsn(MachineBasicBlock::iterator I,
16101610
!UsedRegUnits.available(getLdStBaseOp(MI).getReg());
16111611
bool IsBaseRegModified =
16121612
!ModifiedRegUnits.available(getLdStBaseOp(MI).getReg());
1613-
if (IsOutOfBounds || IsBaseRegUsed || IsBaseRegModified) {
1613+
// If the stored value and the address of the second instruction is
1614+
// the same, it needs to be using the updated register and therefore
1615+
// it must not be folded.
1616+
bool IsMIRegTheSame =
1617+
getLdStRegOp(MI).getReg() == getLdStBaseOp(MI).getReg();
1618+
if (IsOutOfBounds || IsBaseRegUsed || IsBaseRegModified ||
1619+
IsMIRegTheSame) {
16141620
LiveRegUnits::accumulateUsedDefed(MI, ModifiedRegUnits,
16151621
UsedRegUnits, TRI);
16161622
MemInsns.push_back(&MI);

llvm/test/CodeGen/AArch64/strpre-str-merge.mir

+27
Original file line numberDiff line numberDiff line change
@@ -424,3 +424,30 @@ body: |
424424
STRSui killed renamable $s1, renamable $x0, 1 :: (store 4)
425425
RET undef $lr, implicit $x0
426426
...
427+
428+
---
429+
name: 16-strxpre-strxui-same-reg-no-merge
430+
alignment: 4
431+
tracksRegLiveness: true
432+
liveins:
433+
- { reg: '$x0' }
434+
- { reg: '$x1' }
435+
- { reg: '$x2' }
436+
frameInfo:
437+
maxAlignment: 1
438+
maxCallFrameSize: 0
439+
machineFunctionInfo:
440+
hasRedZone: false
441+
body: |
442+
bb.0.entry:
443+
liveins: $x0, $x1, $x2
444+
; CHECK-LABEL: name: 16-strxpre-strxui-same-reg-no-merge
445+
; CHECK: liveins: $x0, $x1, $x2
446+
; CHECK: early-clobber renamable $x0 = STRXpre renamable $x1, renamable $x0, 24, implicit $w0 :: (store 8)
447+
; CHECK: STRXui renamable $x0, renamable $x0, 1 :: (store 8)
448+
; CHECK: RET undef $lr, implicit $x0
449+
early-clobber renamable $x0 = STRXpre killed renamable $x1, killed renamable $x0, 24 :: (store 8)
450+
STRXui renamable $x0, renamable $x0, 1 :: (store 8)
451+
RET undef $lr, implicit $x0
452+
453+
...

0 commit comments

Comments
 (0)