[ET-VK] Modify quantized linear tiling shader to linearly dispatch work to improve thread occupancy and performance. #10508

trivedivivek · 2025-04-28T03:43:39Z

Stack from ghstack (oldest at bottom):

[ET-VK] Using uint16 for quantized linear tiling shader to reduce register pressure and improve performance. #10509
-> [ET-VK] Modify quantized linear tiling shader to linearly dispatch work to improve thread occupancy and performance. #10508

This diff changes tiled 8 bit quantized linear mat mul op to linearly dispatch work which increases thread occupancy and improves performance.

Differential Revision: D73751979

…rk to improve thread occupancy and performance. This diff changes tiled 8 bit quantized linear mat mul op to linearly dispatch work which increases thread occupancy and improves performance. Differential Revision: [D73751979](https://our.internmc.facebook.com/intern/diff/D73751979/) [ghstack-poisoned]

pytorch-bot · 2025-04-28T03:43:43Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10508

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

⏳ No Failures, 69 Pending

As of commit 6bac017 with merge base df75088 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2025-04-28T03:43:57Z

This pull request was exported from Phabricator. Differential Revision: D73751979

…dispatch work to improve thread occupancy and performance." This diff changes tiled 8 bit quantized linear mat mul op to linearly dispatch work which increases thread occupancy and improves performance. Differential Revision: [D73751979](https://our.internmc.facebook.com/intern/diff/D73751979/) [ghstack-poisoned]

facebook-github-bot · 2025-04-28T13:53:20Z

This pull request was exported from Phabricator. Differential Revision: D73751979

…dispatch work to improve thread occupancy and performance." This diff changes tiled 8 bit quantized linear mat mul op to linearly dispatch work which increases thread occupancy and improves performance. Differential Revision: [D73751979](https://our.internmc.facebook.com/intern/diff/D73751979/) [ghstack-poisoned]

facebook-github-bot · 2025-04-28T17:52:41Z

This pull request was exported from Phabricator. Differential Revision: D73751979

trivedivivek requested a review from SS-JIA as a code owner April 28, 2025 03:43

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Apr 28, 2025

trivedivivek mentioned this pull request Apr 28, 2025

[ET-VK] Using uint16 for quantized linear tiling shader to reduce register pressure and improve performance. #10509

Open

facebook-github-bot added the fb-exported label Apr 28, 2025

trivedivivek added the topic: not user facing label Apr 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ET-VK] Modify quantized linear tiling shader to linearly dispatch work to improve thread occupancy and performance. #10508

[ET-VK] Modify quantized linear tiling shader to linearly dispatch work to improve thread occupancy and performance. #10508

trivedivivek commented Apr 28, 2025 •

edited

Loading

pytorch-bot bot commented Apr 28, 2025 •

edited

Loading

facebook-github-bot commented Apr 28, 2025

facebook-github-bot commented Apr 28, 2025

facebook-github-bot commented Apr 28, 2025

[ET-VK] Modify quantized linear tiling shader to linearly dispatch work to improve thread occupancy and performance. #10508

Are you sure you want to change the base?

[ET-VK] Modify quantized linear tiling shader to linearly dispatch work to improve thread occupancy and performance. #10508

Conversation

trivedivivek commented Apr 28, 2025 • edited Loading

pytorch-bot bot commented Apr 28, 2025 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/10508

⏳ No Failures, 69 Pending

facebook-github-bot commented Apr 28, 2025

facebook-github-bot commented Apr 28, 2025

facebook-github-bot commented Apr 28, 2025

trivedivivek commented Apr 28, 2025 •

edited

Loading

pytorch-bot bot commented Apr 28, 2025 •

edited

Loading