Skip to content

Add --fail-under flag for minimum coverage threshold #2075

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 3 commits into
base: main
Choose a base branch
from

Conversation

vvs-personalstash
Copy link

@vvs-personalstash vvs-personalstash commented Apr 18, 2025

Implement --fail-under Flag for Enforcing Minimum Code Coverage Thresholds

This pull request introduces a new command-line flag, --fail-under, to the package:coverage tool. The feature is modeled after the behavior of the --fail-under flag in Python’s coverage.py library.

Overview of Changes

New Feature: --fail-under

  • Accepts a numeric percentage value representing the minimum acceptable code coverage.
  • If the actual test coverage falls below this threshold, the tool will return a non-zero exit code (exit(1)), signaling failure.

Optional Precision Control: --precision

  • An additional --precision flag has been introduced to control the number of decimal places shown in the computed coverage percentage.

Need Clarity

-Currently i have only added tests for directly testing the coverage percentage calculator,Is it sufficient or should I modify the test_with_coverage_test to test the flag also

  • I’ve reviewed the contributor guide and applied the relevant portions to this PR.

Fixes #514

Signed-off-by: vvs-personalstash <[email protected]>
}
}

int precision;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What's this option for? Was there a reason you decided to add it?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It was present in the coverage.py implementation of fail-under to accurately match the coverage percentage in decimals while printing and therefore i added it as it seemed like a good feature for better cosmetics.If its not required i can remove it. An example is like if we have a 93.267% of actual coverage and we do --fail-under=93.3 it will print Error: Coverage 93.0% is less than required 93.3% but if we do --precision=1 we will have an output of Error: Coverage 93.2% is less than required 93.3%

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems unnecessary to me. I think just printing 2 decimal places should be fine.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hmm it seems you are right I will just round it up to 2 places then, I shall begin working on your proposed changes.Thank You.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No, I mean I wouldn't really bother rounding at all. Just print 2 decimal places when you're printing the unrounded number

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Oh yes, that makes more sense — I will do that. Thank you.

Copy link
Author

@vvs-personalstash vvs-personalstash Apr 22, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Hi Liam, Sorry for asking you about this again but I have thought about it more and think that isn't it more convenient to use the --precision as well as rounding off upto our precision i.e. round up to the precision value for the users of the package because:-

  1. If we compare hard values while enforcing threshold then in cases such as lets say e.g 94.999 and 95 are almost same and we wont need our CI to fail because of this.
  2. In medium and small projects 1 extra line covered leads to drastic change in percentages hence we can keep precision on a low or =0 value as even a difference of 0.1 maybe acceptable to users in CI i.e they would want a 95.4 to pass if they keep the threshold 95.5
  3. In large projects we can keep a more precise threshold but still we wont want our CI to fail at 94.4999 if our threshold is 95.5.
    Therefore I still propose to keep the --precision and rounding off to precision in the implementation as thresholding will provide users more explicit control on how strongly they want to enforce their threshold for nearby values rather than them adding tests to cover 1 more line or constantly changing the -fail-under value.
    [I just saw I didn't mention about the rounding off upto precision in the first comment sorry about that]
    I maybe making some wrong assumptions or inferences in this and would really like your opinion on this matter again.Thank you.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't choosing a threshold of X with a precision of Y is the same as choosing a threshold of X-Y?

  1. If we compare hard values while enforcing threshold then in cases such as lets say e.g 94.999 and 95 are almost same and we wont need our CI to fail because of this.

You'd still have a hard threshold. If precision=0.1, you're just shifting the hard threshold from 95 to 94.9

If the project is healthy, then the coverage should be significantly above whatever threshold they've set, so a change of a few lines won't matter. I think of this flag as a safety net to warn them when they've been lax in their testing for a while, and need a concerted effort of test writing to bring their coverage ratio back up. If users want more fine grained monitoring they should use Coveralls.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks For Your explanation. I have implemented the requested changes and I am working on the integration tests

@@ -160,6 +173,29 @@ ${parser.usage}
);
}

double? failUnder;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is duplicated logic with the formatter. Maybe remove it and pass through the flag string unparsed? The formatter step will take care of parsing and validation.

var coveredLines = 0;

for (final entry in hitmap.entries) {
final lineHits = entry.value.lineHits;
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you also include the branchHits, if any? It's possible to have 100% line coverage but miss some branches. No need to include the funcHits as they are a subset of the lineHits.

Copy link
Contributor

@liamappelbe liamappelbe left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The unit test that checks the calculateCoveragePercentage function is a good start, but we'll also need an integration test to verify that the check's logic works: one test that verifies that the command succeeds when the coverage percentage is above the threshold, and test that verifies it fails when the coverage is below the threshold. There's a few existing integration tests that you can piggy back on. Maybe test_with_coverage_test.dart, but there's others too.

Copy link

Package publishing

Package Version Status Publish tag (post-merge)
package:bazel_worker 1.1.3 already published at pub.dev
package:benchmark_harness 2.3.1 already published at pub.dev
package:boolean_selector 2.1.2 already published at pub.dev
package:browser_launcher 1.1.3 already published at pub.dev
package:cli_config 0.2.1-wip WIP (no publish necessary)
package:cli_util 0.4.2 already published at pub.dev
package:clock 1.1.2 already published at pub.dev
package:code_builder 4.10.2-wip WIP (no publish necessary)
package:coverage 1.13.0-wip (error) pubspec version (1.13.0-wip) and changelog (1.13.0) don't agree
package:csslib 1.0.2 already published at pub.dev
package:extension_discovery 2.1.0 already published at pub.dev
package:file 7.0.2-wip WIP (no publish necessary)
package:file_testing 3.1.0-wip WIP (no publish necessary)
package:glob 2.1.3 already published at pub.dev
package:graphs 2.3.3-wip WIP (no publish necessary)
package:html 0.15.5+1 already published at pub.dev
package:io 1.1.0-wip WIP (no publish necessary)
package:json_rpc_2 3.0.3 already published at pub.dev
package:markdown 7.3.1-wip WIP (no publish necessary)
package:mime 2.0.0 already published at pub.dev
package:oauth2 2.0.4-wip WIP (no publish necessary)
package:package_config 2.3.0-wip WIP (no publish necessary)
package:pool 1.5.2-wip WIP (no publish necessary)
package:pub_semver 2.2.0 already published at pub.dev
package:pubspec_parse 1.5.0 already published at pub.dev
package:source_map_stack_trace 2.1.3-wip WIP (no publish necessary)
package:source_maps 0.10.14-wip WIP (no publish necessary)
package:source_span 1.10.1 already published at pub.dev
package:sse 4.1.8 ready to publish sse-v4.1.8
package:stack_trace 1.12.1 already published at pub.dev
package:stream_channel 2.1.4 already published at pub.dev
package:stream_transform 2.1.2-wip WIP (no publish necessary)
package:string_scanner 1.4.1 already published at pub.dev
package:term_glyph 1.2.3-wip WIP (no publish necessary)
package:test_reflective_loader 0.2.3 already published at pub.dev
package:timing 1.0.2 already published at pub.dev
package:unified_analytics 8.0.1 already published at pub.dev
package:watcher 1.1.1 already published at pub.dev
package:yaml 3.1.3 already published at pub.dev
package:yaml_edit 2.2.2 already published at pub.dev

Documentation at https://github.com/dart-lang/ecosystem/wiki/Publishing-automation.

Copy link

PR Health

Breaking changes ✔️
Package Change Current Version New Version Needed Version Looking good?
coverage None 1.12.0 1.13.0-wip 1.12.0 ✔️
Changelog Entry ✔️
Package Changed Files

Changes to files need to be accounted for in their respective changelogs.

Coverage ⚠️
File Coverage
pkgs/coverage/bin/format_coverage.dart 💔 45 % ⬇️ 8 %
pkgs/coverage/bin/test_with_coverage.dart 💚 36 % ⬆️ 1 %
pkgs/coverage/lib/src/coverage_percentage.dart 💚 100 %

This check for test coverage is informational (issues shown here will not fail the PR).

This check can be disabled by tagging the PR with skip-coverage-check.

API leaks ✔️

The following packages contain symbols visible in the public API, but not exported by the library. Export these symbols or remove them from your publicly visible API.

Package Leaked API symbols
License Headers ✔️
// Copyright (c) 2025, the Dart project authors. Please see the AUTHORS file
// for details. All rights reserved. Use of this source code is governed by a
// BSD-style license that can be found in the LICENSE file.
Files
no missing headers

All source files should start with a license header.

Unrelated files missing license headers
Files
pkgs/bazel_worker/benchmark/benchmark.dart
pkgs/bazel_worker/example/client.dart
pkgs/bazel_worker/example/worker.dart
pkgs/benchmark_harness/integration_test/perf_benchmark_test.dart
pkgs/boolean_selector/example/example.dart
pkgs/clock/lib/clock.dart
pkgs/clock/lib/src/clock.dart
pkgs/clock/lib/src/default.dart
pkgs/clock/lib/src/stopwatch.dart
pkgs/clock/lib/src/utils.dart
pkgs/clock/test/clock_test.dart
pkgs/clock/test/default_test.dart
pkgs/clock/test/stopwatch_test.dart
pkgs/clock/test/utils.dart
pkgs/coverage/lib/src/coverage_options.dart
pkgs/coverage/test/collect_coverage_config_test.dart
pkgs/coverage/test/config_file_locator_test.dart
pkgs/html/example/main.dart
pkgs/html/lib/dom.dart
pkgs/html/lib/dom_parsing.dart
pkgs/html/lib/html_escape.dart
pkgs/html/lib/parser.dart
pkgs/html/lib/src/constants.dart
pkgs/html/lib/src/encoding_parser.dart
pkgs/html/lib/src/html_input_stream.dart
pkgs/html/lib/src/list_proxy.dart
pkgs/html/lib/src/query_selector.dart
pkgs/html/lib/src/token.dart
pkgs/html/lib/src/tokenizer.dart
pkgs/html/lib/src/treebuilder.dart
pkgs/html/lib/src/utils.dart
pkgs/html/test/dom_test.dart
pkgs/html/test/parser_feature_test.dart
pkgs/html/test/parser_test.dart
pkgs/html/test/query_selector_test.dart
pkgs/html/test/selectors/level1_baseline_test.dart
pkgs/html/test/selectors/level1_lib.dart
pkgs/html/test/selectors/selectors.dart
pkgs/html/test/support.dart
pkgs/html/test/tokenizer_test.dart
pkgs/pubspec_parse/test/git_uri_test.dart
pkgs/stack_trace/example/example.dart
pkgs/watcher/test/custom_watcher_factory_test.dart
pkgs/yaml_edit/example/example.dart

@coveralls
Copy link

Pull Request Test Coverage Report for Build 14562796465

Warning: This coverage report may be inaccurate.

This pull request's base commit is no longer the HEAD commit of its target branch. This means it includes changes from outside the original pull request, including, potentially, unrelated coverage changes.

Details

  • 12 of 12 (100.0%) changed or added relevant lines in 1 file are covered.
  • No unchanged relevant lines lost coverage.
  • Overall coverage increased (+0.1%) to 93.57%

Totals Coverage Status
Change from base Build 14554281888: 0.1%
Covered Lines: 684
Relevant Lines: 731

💛 - Coveralls

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Add --fail-under option
3 participants