Skip to content

Latest commit

 

History

History
198 lines (156 loc) · 5.85 KB

0000-annotated-diagnostic-json-output.md

File metadata and controls

198 lines (156 loc) · 5.85 KB
  • Feature Name: annotated_diagnostic_json_output
  • Start Date: 2017-01-18
  • RFC PR:
  • Rust Issue:

Summary

Allow the compiler to annotate DiagnosticBuilder messages with metadata related to the contents being displayed, to allow external tooling more control over the displayed format, as well as allowing deeper integration with the type system.

Motivation

The compiler currently allows diagnostic messages to have highlighted parts:

These highlights are purely presentational, with no semantic meaning. By exposing these annotations, external tools can make use of it.

By changing it to have semantic meaning, these tools can take advantage of this extra information for deeper integration, like allowing navigation to type definition for portions of text that have been annotated as kind type, as well as proper highlighting of sections of the text.

Detailed design

The JsonEmitter Diagnostic Emitter shall be expanded to include a new field message_annotations in its output. This field would contain a list of objects with metadata about portions of the diagnostic's message field, in a similar way as Spans reffer to portions of a source code file.

The metadata shall include:

  • the kind of annotation,
  • the region of the message that it spans
  • the text content, (Maybe?, this information can be gotten from the message itself. The biggest benefits I see are that it makes it easier for tool makers to be lazy and that we could include different text from what is actually in the message, but both of these points may prove troublesome.)
  • an important boolean flag to allow us to communicate wether this annotation should be treated in such a way as to increase its visibility, i.e., bolded or highlighted in the presentation.

The kind of annotations should be:

  • type
  • highlight (alternative name: bold)
  • statement (alternative name: code)
  • file (alternative name: path)
  • url
  • ...

The existing json output of the compiler (rustc -Z unstable-options --error-format=json) for messages only includes the formatted text of the diagnostic message:

{
    "message": "expected type `usize`\n   found type `&'static str`",
    "code": null,
    "level": "note",
    "spans": [],
    "children": [],
    "rendered": null
}

After the proposed change the output would include new field containing semantic information for the content in the message:

{
  "message": "expected type `usize`\n   found type `&'static str`",
  "message_annotations": [
    {
      "start": 15,
      "end": 20,
      "kind": "type",
      "content": "usize",
      "important": true
    },
    {
      "start": 37,
      "end": 49,
      "kind":"type",
      "content": "&'static str"
      "highlighted": true
    },
  ],
  "code": null,
  "level": "note",
  "spans": [],
  "children": [],
  "rendered": null
}

On rustc, these same annotations presented to tools through the json error-format shall be used for colorizing and highlighting the output when possible.

How We Teach This

What names and terminology work best for these concepts and why?

Annotation: A message annotation is equivalent to a Span, only instead of code they carry information contained in a diagnostic message.

How is this idea best presented—as a continuation of existing Rust patterns, or as a wholly new one?

This is a continuation of the work started to provide highlights for diagnostic messages.

Would the acceptance of this proposal change how Rust is taught to new users at any level?

Not for end users, only for tool makers integrating error messages, and even then the change is only additive.

How should this feature be introduced and taught to existing Rust users?

What additions or changes to the Rust Reference, The Rust Programming Language, and/or Rust by Example does it entail?

Documenting the new output (I don't think there's any official documentation about this flag anywhere outside of internals and the bug tracker).

Drawbacks

Expanding the amount of information being outputted by the compiler could get unwildy for larger projects.

Introducing this can be seen by contributors as an encouragement to add tons of annotations to diagnostic messages, while restraint will probably be needed to avoid unnecessary extra information being added.

Alternatives

The current alternative that tools have as an option is trying to parse common diagnostic messages in search for interesting parts, like in the case of "expected"/"found" messages. This is not ideal, as it can lead to brittle interactions that can break often as the diagnostic messages have their wording and presentation refined.

A variation on this proposal s to modify the message field itself to contain the annotations:

{
  "message": [
    {"content": "expected type `"},
    {
      "start": 15,
      "end": 20,
      "kind": "type",
      "content": "usize",
      "highlighted": true
    },
    {"content": "`\n   found type `"},
    {
      "start": 37,
      "end": 49,
      "kind":"type",
      "content": "&'static str",
      "highlighted": true
    },
    {"content": "`"}
  ],
  "code":null,
  "level":"note",
  "spans":[],
  "children":[],
  "rendered":null
}

I find this output to be less ergonomic, as tools that might not want to bother with the new metadata being exposed to explicitly have to deal with it.

Unresolved questions

What parts of the design are still TBD?