-
Notifications
You must be signed in to change notification settings - Fork 13.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Make metadata a workproduct and reuse it #114669
base: master
Are you sure you want to change the base?
Conversation
@bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
⌛ Trying commit 3895d701e4a4c854cc003e599ac6efd49435aaef with merge 6a3ac25372eb1f930be018a4061b9b281bb3f42f... |
This comment has been minimized.
This comment has been minimized.
☀️ Try build successful - checks-actions |
This comment has been minimized.
This comment has been minimized.
Finished benchmarking commit (6a3ac25372eb1f930be018a4061b9b281bb3f42f): comparison URL. Overall result: ❌✅ regressions and improvements - ACTION NEEDEDBenchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @bors rollup=never Warning ⚠: The following benchmark(s) failed to build:
Instruction countThis is a highly reliable metric that was used to determine the overall result at the top of this comment.
Max RSS (memory usage)ResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
CyclesResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Binary sizeResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Bootstrap: 632.289s -> 632.478s (0.03%) |
} | ||
//for (def_id, traits) in &tcx.resolutions(()).doc_link_traits_in_scope { | ||
// record_array!(self.tables.doc_link_traits_in_scope[def_id.to_def_id()] <- traits); | ||
//} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@petrochenkov I've been trying to avoid the dependency on resolutions
by using doc_link_resolutions
and doc_link_traits_in_scope
queries. But I could not understand in which case a definition would have an entry in those, and in which case they would ICE.
Is there a general rule?
Query description says "for a module". In which sense? resolve or mod
items?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Query description says "for a module". In which sense? resolve or
mod
items?
mod
items (or crate root).
There will be an entry if a module has doc links on items inside it (only exported items or all items, depending on options, resolve/late.rs determines the precise logic).
This is the data that rustdoc currently wants from us.
The query ICEs instead of producing None
because rustdoc is unreliable, and will have future bugs and unpredictable changes with high probability, this ICE prevents it from going out of sync with the logic in rustc.
Maybe you could use a "private" None
-returning query used for metadata, and a public ICEing wrapper around it used by rustdoc.
@bors try @rust-timer queue |
This comment has been minimized.
This comment has been minimized.
⌛ Trying commit 5f13bb1 with merge 2fce67ca52380d54b739a68cac1efc7c468cbaa6... |
☀️ Try build successful - checks-actions |
This comment has been minimized.
This comment has been minimized.
Finished benchmarking commit (2fce67ca52380d54b739a68cac1efc7c468cbaa6): comparison URL. Overall result: ❌✅ regressions and improvements - ACTION NEEDEDBenchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @bors rollup=never Instruction countThis is a highly reliable metric that was used to determine the overall result at the top of this comment.
Max RSS (memory usage)ResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
CyclesResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Binary sizeResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Bootstrap: 632.009s -> 632.421s (0.07%) |
Make metadata a workproduct and reuse it This PR aims to skip the generation of metadata by reusing the infrastructure that already exists for compiled codegen-units, namely "workproducts". This can yield substantial gains (~10%) when we can demonstrate that metadata does not change between an incremental session and the next. This is the case if the crate is unchanged, or if all the changes are in upstream crates and have no effect on it. This latter case is most interesting, as it arises regularly for users with several crates in their workspace. TODO: - [ ] Materialize the fact that metadata encoding relies on the relative order of definitions; - [ ] Refactor the handling of doc links.
☀️ Try build successful - checks-actions |
This comment has been minimized.
This comment has been minimized.
Finished benchmarking commit (9310a5e): comparison URL. Overall result: ❌✅ regressions and improvements - ACTION NEEDEDBenchmarking this pull request likely means that it is perf-sensitive, so we're automatically marking it as not fit for rolling up. While you can manually mark this PR as fit for rollup, we strongly recommend not doing so since this PR may lead to changes in compiler perf. Next Steps: If you can justify the regressions found in this try perf run, please indicate this with @bors rollup=never Instruction countThis is a highly reliable metric that was used to determine the overall result at the top of this comment.
Max RSS (memory usage)ResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
CyclesResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Binary sizeResultsThis is a less reliable metric that may be of interest but was not used to determine the overall result at the top of this comment.
Bootstrap: 630.664s -> 631.304s (0.10%) |
@@ -56,6 +56,21 @@ impl OngoingCodegen { | |||
backend_config: &BackendConfig, | |||
) -> (CodegenResults, FxIndexMap<WorkProductId, WorkProduct>) { | |||
let mut work_products = FxIndexMap::default(); | |||
|
|||
if !backend_config.disable_incr_cache { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reusing the crate metadata when disable_incr_cache is set is fine. This option exists to allow using incr comp while still recreating all object files. This allows for faster iteration of cg_clif changes by allowing the frontend results to be cached while still respecting changes to cg_clif.
{ | ||
work_products.insert(id, product); | ||
} | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This should be done for all codegen backends. Maybe move it to rustc_incremental or rustc_interface? rustc_interface is responsible for ensuring that the .rmeta file is written in the first place.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rustc_interface
has the .rmeta
file written by calling encode_and_write_metadata(tcx)
. This specific code path fetches the metadata to put into the rlib
file.
These commits modify the If this was unintentional then you should revert the changes before this PR is merged. Some changes occurred in compiler/rustc_codegen_cranelift cc @bjorn3 |
From triage. @cjgillot Am I right that comments were addressed and it's ready for review? |
☔ The latest upstream changes (presumably #130724) made this pull request unmergeable. Please resolve the merge conflicts. |
☔ The latest upstream changes (presumably #135592) made this pull request unmergeable. Please resolve the merge conflicts. |
Status is not very clear, still waiting on author. |
This PR aims to skip the generation of metadata by reusing the infrastructure that already exists for compiled codegen-units, namely "workproducts".
This can yield substantial gains (~10%) when we can demonstrate that metadata does not change between an incremental session and the next. This is the case if the crate is unchanged, or if all the changes are in upstream crates and have no effect on it. This latter case is most interesting, as it arises regularly for users with several crates in their workspace.
TODO: