Skip to content

Commit 80f337f

Browse files
committed
Auto merge of rust-lang#122568 - RalfJung:mentioned-items, r=<try>
recursively evaluate the constants in everything that is 'mentioned' This is another attempt at fixing rust-lang#107503. The previous attempt at rust-lang#112879 seems stuck in figuring out where the [perf regression](https://perf.rust-lang.org/compare.html?start=c55d1ee8d4e3162187214692229a63c2cc5e0f31&end=ec8de1ebe0d698b109beeaaac83e60f4ef8bb7d1&stat=instructions:u) comes from. In rust-lang#122258 I learned some things, which informed the approach this PR is taking. Quoting from the new collector docs, which explain the high-level idea: ```rust //! One important role of collection is to evaluate all constants that are used by all the items //! which are being collected. Codegen can then rely on only encountering constants that evaluate //! successfully, and if a constant fails to evaluate, the collector has much better context to be //! able to show where this constant comes up. //! //! However, the exact set of "used" items (collected as described above), and therefore the exact //! set of used constants, can depend on optimizations. Optimizing away dead code may optimize away //! a function call that uses a failing constant, so an unoptimized build may fail where an //! optimized build succeeds. This is undesirable. //! //! To fix this, the collector has the concept of "mentioned" items. Some time during the MIR //! pipeline, before any optimization-level-dependent optimizations, we compute a list of all items //! that syntactically appear in the code. These are considered "mentioned", and even if they are in //! dead code and get optimized away (which makes them no longer "used"), they are still //! "mentioned". For every used item, the collector ensures that all mentioned items, recursively, //! do not use a failing constant. This is reflected via the [`CollectionMode`], which determines //! whether we are visiting a used item or merely a mentioned item. //! //! The collector and "mentioned items" gathering (which lives in `rustc_mir_transform::mentioned_items`) //! need to stay in sync in the following sense: //! //! - For every item that the collector gather that could eventually lead to build failure (most //! likely due to containing a constant that fails to evaluate), a corresponding mentioned item //! must be added. This should use the exact same strategy as the ecollector to make sure they are //! in sync. However, while the collector works on monomorphized types, mentioned items are //! collected on generic MIR -- so any time the collector checks for a particular type (such as //! `ty::FnDef`), we have to just onconditionally add this as a mentioned item. //! - In `visit_mentioned_item`, we then do with that mentioned item exactly what the collector //! would have done during regular MIR visiting. Basically you can think of the collector having //! two stages, a pre-monomorphization stage and a post-monomorphization stage (usually quite //! literally separated by a call to `self.monomorphize`); the pre-monomorphizationn stage is //! duplicated in mentioned items gathering and the post-monomorphization stage is duplicated in //! `visit_mentioned_item`. //! - Finally, as a performance optimization, the collector should fill `used_mentioned_item` during //! its MIR traversal with exactly what mentioned item gathering would have added in the same //! situation. This detects mentioned items that have *not* been optimized away and hence don't //! need a dedicated traversal. enum CollectionMode { /// Collect items that are used, i.e., actually needed for codegen. /// /// Which items are used can depend on optimization levels, as MIR optimizations can remove /// uses. UsedItems, /// Collect items that are mentioned. The goal of this mode is that it is independent of /// optimizations: the set of "mentioned" items is computed before optimizations are run. /// /// The exact contents of this set are *not* a stable guarantee. (For instance, it is currently /// computed after drop-elaboration. If we ever do some optimizations even in debug builds, we /// might decide to run them before computing mentioned items.) The key property of this set is /// that it is optimization-independent. MentionedItems, } ``` And the `mentioned_items` MIR body field docs: ```rust /// Further items that were mentioned in this function and hence *may* become monomorphized, /// depending on optimizations. We use this to avoid optimization-dependent compile errors: the /// collector recursively traverses all "mentioned" items and evaluates all their /// `required_consts`. /// /// This is *not* soundness-critical and the contents of this list are *not* a stable guarantee. /// All that's relevant is that this set is optimization-level-independent, and that it includes /// everything that the collector would consider "used". (For example, we currently compute this /// set after drop elaboration, so some drop calls that can never be reached are not considered /// "mentioned".) See the documentation of `CollectionMode` in /// `compiler/rustc_monomorphize/src/collector.rs` for more context. pub mentioned_items: Vec<Spanned<MentionedItem<'tcx>>>, ``` Fixes rust-lang#107503
2 parents 0f706af + 4cee6b3 commit 80f337f

File tree

59 files changed

+1582
-400
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

59 files changed

+1582
-400
lines changed

compiler/rustc_data_structures/src/sync.rs

+6-3
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@
2020
//! | ----------------------- | ------------------- | ------------------------------- |
2121
//! | `Lrc<T>` | `rc::Rc<T>` | `sync::Arc<T>` |
2222
//! |` Weak<T>` | `rc::Weak<T>` | `sync::Weak<T>` |
23+
//! | `LRef<'a, T>` [^2] | `&'a mut T` | `&'a T` |
2324
//! | | | |
2425
//! | `AtomicBool` | `Cell<bool>` | `atomic::AtomicBool` |
2526
//! | `AtomicU32` | `Cell<u32>` | `atomic::AtomicU32` |
@@ -38,7 +39,7 @@
3839
//! of a `RefCell`. This is appropriate when interior mutability is not
3940
//! required.
4041
//!
41-
//! [^2] `MTLockRef` is a typedef.
42+
//! [^2] `MTRef`, `MTLockRef` are type aliases.
4243
4344
pub use crate::marker::*;
4445
use std::collections::HashMap;
@@ -208,7 +209,7 @@ cfg_match! {
208209

209210
use std::cell::RefCell as InnerRwLock;
210211

211-
pub type MTLockRef<'a, T> = &'a mut MTLock<T>;
212+
pub type LRef<'a, T> = &'a mut T;
212213

213214
#[derive(Debug, Default)]
214215
pub struct MTLock<T>(T);
@@ -274,7 +275,7 @@ cfg_match! {
274275
pub use std::sync::Arc as Lrc;
275276
pub use std::sync::Weak as Weak;
276277

277-
pub type MTLockRef<'a, T> = &'a MTLock<T>;
278+
pub type LRef<'a, T> = &'a T;
278279

279280
#[derive(Debug, Default)]
280281
pub struct MTLock<T>(Lock<T>);
@@ -314,6 +315,8 @@ cfg_match! {
314315
}
315316
}
316317

318+
pub type MTLockRef<'a, T> = LRef<'a, MTLock<T>>;
319+
317320
#[derive(Default)]
318321
#[cfg_attr(parallel_compiler, repr(align(64)))]
319322
pub struct CacheAligned<T>(pub T);

compiler/rustc_middle/src/mir/mod.rs

+46
Original file line numberDiff line numberDiff line change
@@ -20,6 +20,7 @@ use rustc_hir::def_id::{DefId, CRATE_DEF_ID};
2020
use rustc_hir::{self, CoroutineDesugaring, CoroutineKind, ImplicitSelfKind};
2121
use rustc_hir::{self as hir, HirId};
2222
use rustc_session::Session;
23+
use rustc_span::source_map::Spanned;
2324
use rustc_target::abi::{FieldIdx, VariantIdx};
2425

2526
use polonius_engine::Atom;
@@ -44,6 +45,7 @@ use std::ops::{Index, IndexMut};
4445
use std::{iter, mem};
4546

4647
pub use self::query::*;
48+
use self::visit::TyContext;
4749
pub use basic_blocks::BasicBlocks;
4850

4951
mod basic_blocks;
@@ -312,6 +314,21 @@ impl<'tcx> CoroutineInfo<'tcx> {
312314
}
313315
}
314316

317+
/// Some item that needs to monomorphize successfully for a MIR body to be considered well-formed.
318+
#[derive(Copy, Clone, PartialEq, Eq, Debug, Hash, HashStable, TyEncodable, TyDecodable)]
319+
#[derive(TypeFoldable, TypeVisitable)]
320+
pub enum MentionedItem<'tcx> {
321+
/// A function that gets called. We don't necessarily know its precise type yet, since it can be
322+
/// hidden behind a generic.
323+
Fn(Ty<'tcx>),
324+
/// A type that has its drop shim called.
325+
Drop(Ty<'tcx>),
326+
/// Unsizing casts might require vtables, so we have to record them.
327+
UnsizeCast { source_ty: Ty<'tcx>, target_ty: Ty<'tcx> },
328+
/// A closure that is coerced to a function pointer.
329+
Closure(Ty<'tcx>),
330+
}
331+
315332
/// The lowered representation of a single function.
316333
#[derive(Clone, TyEncodable, TyDecodable, Debug, HashStable, TypeFoldable, TypeVisitable)]
317334
pub struct Body<'tcx> {
@@ -375,8 +392,24 @@ pub struct Body<'tcx> {
375392

376393
/// Constants that are required to evaluate successfully for this MIR to be well-formed.
377394
/// We hold in this field all the constants we are not able to evaluate yet.
395+
///
396+
/// This is soundness-critical, we make a guarantee that all consts syntactically mentioned in a
397+
/// function have successfully evaluated if the function ever gets executed at runtime.
378398
pub required_consts: Vec<ConstOperand<'tcx>>,
379399

400+
/// Further items that were mentioned in this function and hence *may* become monomorphized,
401+
/// depending on optimizations. We use this to avoid optimization-dependent compile errors: the
402+
/// collector recursively traverses all "mentioned" items and evaluates all their
403+
/// `required_consts`.
404+
///
405+
/// This is *not* soundness-critical and the contents of this list are *not* a stable guarantee.
406+
/// All that's relevant is that this set is optimization-level-independent, and that it includes
407+
/// everything that the collector would consider "used". (For example, we currently compute this
408+
/// set after drop elaboration, so some drop calls that can never be reached are not considered
409+
/// "mentioned".) See the documentation of `CollectionMode` in
410+
/// `compiler/rustc_monomorphize/src/collector.rs` for more context.
411+
pub mentioned_items: Vec<Spanned<MentionedItem<'tcx>>>,
412+
380413
/// Does this body use generic parameters. This is used for the `ConstEvaluatable` check.
381414
///
382415
/// Note that this does not actually mean that this body is not computable right now.
@@ -453,6 +486,7 @@ impl<'tcx> Body<'tcx> {
453486
var_debug_info,
454487
span,
455488
required_consts: Vec::new(),
489+
mentioned_items: Vec::new(),
456490
is_polymorphic: false,
457491
injection_phase: None,
458492
tainted_by_errors,
@@ -482,6 +516,7 @@ impl<'tcx> Body<'tcx> {
482516
spread_arg: None,
483517
span: DUMMY_SP,
484518
required_consts: Vec::new(),
519+
mentioned_items: Vec::new(),
485520
var_debug_info: Vec::new(),
486521
is_polymorphic: false,
487522
injection_phase: None,
@@ -576,6 +611,17 @@ impl<'tcx> Body<'tcx> {
576611
}
577612
}
578613

614+
pub fn span_for_ty_context(&self, ty_context: TyContext) -> Span {
615+
match ty_context {
616+
TyContext::UserTy(span) => span,
617+
TyContext::ReturnTy(source_info)
618+
| TyContext::LocalDecl { source_info, .. }
619+
| TyContext::YieldTy(source_info)
620+
| TyContext::ResumeTy(source_info) => source_info.span,
621+
TyContext::Location(loc) => self.source_info(loc).span,
622+
}
623+
}
624+
579625
/// Returns the return type; it always return first element from `local_decls` array.
580626
#[inline]
581627
pub fn return_ty(&self) -> Ty<'tcx> {

compiler/rustc_mir_build/src/build/custom/mod.rs

+1
Original file line numberDiff line numberDiff line change
@@ -56,6 +56,7 @@ pub(super) fn build_custom_mir<'tcx>(
5656
var_debug_info: Vec::new(),
5757
span,
5858
required_consts: Vec::new(),
59+
mentioned_items: Vec::new(),
5960
is_polymorphic: false,
6061
tainted_by_errors: None,
6162
injection_phase: None,

compiler/rustc_mir_transform/src/inline.rs

+20-1
Original file line numberDiff line numberDiff line change
@@ -565,7 +565,8 @@ impl<'tcx> Inliner<'tcx> {
565565
mut callee_body: Body<'tcx>,
566566
) {
567567
let terminator = caller_body[callsite.block].terminator.take().unwrap();
568-
let TerminatorKind::Call { args, destination, unwind, target, .. } = terminator.kind else {
568+
let TerminatorKind::Call { func, args, destination, unwind, target, .. } = terminator.kind
569+
else {
569570
bug!("unexpected terminator kind {:?}", terminator.kind);
570571
};
571572

@@ -717,6 +718,24 @@ impl<'tcx> Inliner<'tcx> {
717718
Const::Val(..) | Const::Unevaluated(..) => true,
718719
},
719720
));
721+
// Now that we incorporated the callee's `required_consts`, we can remove the callee from
722+
// `mentioned_items` -- but we have to take their `mentioned_items` in return. This does
723+
// some extra work here to save the monomorphization collector work later. It helps a lot,
724+
// since monomorphization can avoid a lot of work when the "mentioned items" are similar to
725+
// the actually used items. By doing this we can entirely avoid visiting the callee!
726+
// We need to reconstruct the `required_item` for the callee so that we can find and
727+
// remove it.
728+
let callee_item = MentionedItem::Fn(func.ty(caller_body, self.tcx));
729+
if let Some(idx) =
730+
caller_body.mentioned_items.iter().position(|item| item.node == callee_item)
731+
{
732+
// We found the callee, so remove it and add its items instead.
733+
caller_body.mentioned_items.remove(idx);
734+
caller_body.mentioned_items.extend(callee_body.mentioned_items);
735+
} else {
736+
// If we can't find the callee, there's no point in adding its items.
737+
// Probably it already got removed by being inlined elsewhere in the same function.
738+
}
720739
}
721740

722741
fn make_call_args(

compiler/rustc_mir_transform/src/lib.rs

+5
Original file line numberDiff line numberDiff line change
@@ -89,6 +89,7 @@ mod lint;
8989
mod lower_intrinsics;
9090
mod lower_slice_len;
9191
mod match_branches;
92+
mod mentioned_items;
9293
mod multiple_return_terminators;
9394
mod normalize_array_len;
9495
mod nrvo;
@@ -566,6 +567,10 @@ fn run_optimization_passes<'tcx>(tcx: TyCtxt<'tcx>, body: &mut Body<'tcx>) {
566567
tcx,
567568
body,
568569
&[
570+
// Before doing anything, remember which items are being mentioned so that the set of items
571+
// visited does not depend on the optimization level.
572+
&mentioned_items::MentionedItems,
573+
// Add some UB checks before any UB gets optimized away.
569574
&check_alignment::CheckAlignment,
570575
// Before inlining: trim down MIR with passes to reduce inlining work.
571576

Original file line numberDiff line numberDiff line change
@@ -0,0 +1,117 @@
1+
use rustc_middle::mir::visit::Visitor;
2+
use rustc_middle::mir::{self, Location, MentionedItem, MirPass};
3+
use rustc_middle::ty::{self, adjustment::PointerCoercion, TyCtxt};
4+
use rustc_session::Session;
5+
use rustc_span::source_map::Spanned;
6+
7+
pub struct MentionedItems;
8+
9+
struct MentionedItemsVisitor<'a, 'tcx> {
10+
tcx: TyCtxt<'tcx>,
11+
body: &'a mir::Body<'tcx>,
12+
mentioned_items: &'a mut Vec<Spanned<MentionedItem<'tcx>>>,
13+
}
14+
15+
impl<'tcx> MirPass<'tcx> for MentionedItems {
16+
fn is_enabled(&self, _sess: &Session) -> bool {
17+
// If this pass is skipped the collector assume that nothing got mentioned! We could
18+
// potentially skip it in opt-level 0 if we are sure that opt-level will never *remove* uses
19+
// of anything, but that still seems fragile. Furthermore, even debug builds use level 1, so
20+
// special-casing level 0 is just not worth it.
21+
true
22+
}
23+
24+
fn run_pass(&self, tcx: TyCtxt<'tcx>, body: &mut mir::Body<'tcx>) {
25+
debug_assert!(body.mentioned_items.is_empty());
26+
let mut mentioned_items = Vec::new();
27+
MentionedItemsVisitor { tcx, body, mentioned_items: &mut mentioned_items }.visit_body(body);
28+
body.mentioned_items = mentioned_items;
29+
}
30+
}
31+
32+
// This visitor is carefully in sync with the one in `rustc_monomorphize::collector`. We are
33+
// visiting the exact same places but then instead of monomorphizing and creating `MonoItems`, we
34+
// have to remain generic and just recording the relevant information in `mentioned_items`, where it
35+
// will then be monomorphized later during "mentioned items" collection.
36+
impl<'tcx> Visitor<'tcx> for MentionedItemsVisitor<'_, 'tcx> {
37+
fn visit_terminator(&mut self, terminator: &mir::Terminator<'tcx>, location: Location) {
38+
self.super_terminator(terminator, location);
39+
let span = || self.body.source_info(location).span;
40+
match &terminator.kind {
41+
mir::TerminatorKind::Call { func, .. } => {
42+
let callee_ty = func.ty(self.body, self.tcx);
43+
self.mentioned_items
44+
.push(Spanned { node: MentionedItem::Fn(callee_ty), span: span() });
45+
}
46+
mir::TerminatorKind::Drop { place, .. } => {
47+
let ty = place.ty(self.body, self.tcx).ty;
48+
self.mentioned_items.push(Spanned { node: MentionedItem::Drop(ty), span: span() });
49+
}
50+
mir::TerminatorKind::InlineAsm { ref operands, .. } => {
51+
for op in operands {
52+
match *op {
53+
mir::InlineAsmOperand::SymFn { ref value } => {
54+
self.mentioned_items.push(Spanned {
55+
node: MentionedItem::Fn(value.const_.ty()),
56+
span: span(),
57+
});
58+
}
59+
_ => {}
60+
}
61+
}
62+
}
63+
_ => {}
64+
}
65+
}
66+
67+
fn visit_rvalue(&mut self, rvalue: &mir::Rvalue<'tcx>, location: Location) {
68+
self.super_rvalue(rvalue, location);
69+
let span = || self.body.source_info(location).span;
70+
match *rvalue {
71+
// We need to detect unsizing casts that required vtables.
72+
mir::Rvalue::Cast(
73+
mir::CastKind::PointerCoercion(PointerCoercion::Unsize),
74+
ref operand,
75+
target_ty,
76+
)
77+
| mir::Rvalue::Cast(mir::CastKind::DynStar, ref operand, target_ty) => {
78+
// This isn't monomorphized yet so we can't tell what the actual types are -- just
79+
// add everything that may involve a vtable.
80+
let source_ty = operand.ty(self.body, self.tcx);
81+
let may_involve_vtable = match (
82+
source_ty.builtin_deref(true).map(|t| t.ty.kind()),
83+
target_ty.builtin_deref(true).map(|t| t.ty.kind()),
84+
) {
85+
(Some(ty::Array(..)), Some(ty::Str | ty::Slice(..))) => false, // &str/&[T] unsizing
86+
_ => true,
87+
};
88+
if may_involve_vtable {
89+
self.mentioned_items.push(Spanned {
90+
node: MentionedItem::UnsizeCast { source_ty, target_ty },
91+
span: span(),
92+
});
93+
}
94+
}
95+
// Similarly, record closures that are turned into function pointers.
96+
mir::Rvalue::Cast(
97+
mir::CastKind::PointerCoercion(PointerCoercion::ClosureFnPointer(_)),
98+
ref operand,
99+
_,
100+
) => {
101+
let source_ty = operand.ty(self.body, self.tcx);
102+
self.mentioned_items
103+
.push(Spanned { node: MentionedItem::Closure(source_ty), span: span() });
104+
}
105+
// And finally, function pointer reification casts.
106+
mir::Rvalue::Cast(
107+
mir::CastKind::PointerCoercion(PointerCoercion::ReifyFnPointer),
108+
ref operand,
109+
_,
110+
) => {
111+
let fn_ty = operand.ty(self.body, self.tcx);
112+
self.mentioned_items.push(Spanned { node: MentionedItem::Fn(fn_ty), span: span() });
113+
}
114+
_ => {}
115+
}
116+
}
117+
}

compiler/rustc_mir_transform/src/shim.rs

+3-1
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ use std::iter;
1717

1818
use crate::{
1919
abort_unwinding_calls, add_call_guards, add_moves_for_packed_drops, deref_separator,
20-
pass_manager as pm, remove_noop_landing_pads, simplify,
20+
mentioned_items, pass_manager as pm, remove_noop_landing_pads, simplify,
2121
};
2222
use rustc_middle::mir::patch::MirPatch;
2323
use rustc_mir_dataflow::elaborate_drops::{self, DropElaborator, DropFlagMode, DropStyle};
@@ -147,6 +147,7 @@ fn make_shim<'tcx>(tcx: TyCtxt<'tcx>, instance: ty::InstanceDef<'tcx>) -> Body<'
147147
tcx,
148148
&mut body,
149149
&[
150+
&mentioned_items::MentionedItems,
150151
&abort_unwinding_calls::AbortUnwindingCalls,
151152
&add_call_guards::CriticalCallEdges,
152153
],
@@ -178,6 +179,7 @@ fn make_shim<'tcx>(tcx: TyCtxt<'tcx>, instance: ty::InstanceDef<'tcx>) -> Body<'
178179
tcx,
179180
&mut result,
180181
&[
182+
&mentioned_items::MentionedItems,
181183
&add_moves_for_packed_drops::AddMovesForPackedDrops,
182184
&deref_separator::Derefer,
183185
&remove_noop_landing_pads::RemoveNoopLandingPads,

0 commit comments

Comments
 (0)