pytorch / ao Public

Notifications You must be signed in to change notification settings
Fork 252
Star 2k

Code
Issues 233
Pull requests 115
Discussions
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Discussions
Actions
Projects
Security
Insights

Issues: pytorch/ao

[RFC] torchao Contributor Guide

#391 opened Jun 18, 2024 by jerryzh168

Open 16

low precision training upcoming feature tracker

#556 opened Jul 30, 2024 by vkuzo

Open 2

Multibackend tracker

#1082 opened Oct 15, 2024 by msaroufim

Open

Beta

Labels 55 Milestones 0

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

233 Open 223 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

AO/GemLite tensors produce incorrect outputs in vLLM

#2141 opened Apr 28, 2025 by mobicham

QAT model drops accuracy after converting with torch.ao.quantization.convert

#2138 opened Apr 28, 2025 by tranngocduvnvp

Conda recipe? binaries

#2137 opened Apr 27, 2025 by sarthakpati

int8 quantization with FSDP for inference error

#2127 opened Apr 25, 2025 by Andy0422

Question about dtype check in marlin_qqq validation for w4a8 functionality

#2115 opened Apr 23, 2025 by xxw11

[PT2E] observers do not handle inputs with different shapes correctly

#2112 opened Apr 23, 2025 by Xia-Weiwen

Got unexpected low speed using quantization inference on qwen models.

#2102 opened Apr 22, 2025 by HaoKang-Timmy

[Tracker] TorchAO activation sparsity acceleration 🚀

#2095 opened Apr 22, 2025 by jcaip

2 of 9 tasks

[Quant][PT2E] AffineQuantized observers failed Resnet18

#2094 opened Apr 22, 2025 by Xia-Weiwen

Deprecate/Remove GPTQ.py

#2089 opened Apr 21, 2025 by jerryzh168

How to automatically install the latest TorchAO nightly wheel

#2086 opened Apr 21, 2025 by MingxuZh

Refactor torchao and tests to use model architectures from torchao.testing.model_architectures good first issue

Good for newcomers

#2078 opened Apr 18, 2025 by jainapurva

Dynamo error with large mesh + AdamWFp8 + bf16 stochastic rounding bug

Something isn't working

distributed optimizer

#2074 opened Apr 18, 2025 by cassanof

Make lm_eval optional dependency

#2073 opened Apr 18, 2025 by jainapurva

Remove old subclass implementation to reduce maintainence cost topic: deprecation

Use this tag if this PR deprecates a feature

#2056 opened Apr 14, 2025 by jerryzh168

Making RCEIL the default for MXFP scale derivation mx

#2035 opened Apr 10, 2025 by frsun-nvda

Fix remaining issues when running on H100 machines

#2028 opened Apr 8, 2025 by jerryzh168

Failed to save the static quantized model quantize

#1950 opened Mar 25, 2025 by yiliu30

cast to mxfp8 across dim1 should be performant float8

#1945 opened Mar 24, 2025 by vkuzo

Torchao import time

#1944 opened Mar 24, 2025 by felipemello1

[Bug] FSDP2 FP8 compatibility problem with nn.Linear layers (GPU count > out_features) distributed float8

#1938 opened Mar 24, 2025 by HIT-cwh

FSDP2 + CPU Offload + AdamW8bit issue

#1931 opened Mar 21, 2025 by psinger

Torchao's CPU overhead counteracts the performance benefit of using quantization kernel.

#1930 opened Mar 21, 2025 by LuFinch

fp8 quantization with FSDP2 error

#1929 opened Mar 20, 2025 by happynear

Does torchao support FP8 Grouped GEMM? float8

#1928 opened Mar 20, 2025 by zigzagcai

Previous 1 2 3 4 5 … 9 10 Next

Previous Next

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly