-
Notifications
You must be signed in to change notification settings - Fork 252
Issues: pytorch/ao
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
QAT model drops accuracy after converting with torch.ao.quantization.convert
#2138
opened Apr 28, 2025 by
tranngocduvnvp
Question about dtype check in marlin_qqq validation for w4a8 functionality
#2115
opened Apr 23, 2025 by
xxw11
[PT2E] observers do not handle inputs with different shapes correctly
#2112
opened Apr 23, 2025 by
Xia-Weiwen
Got unexpected low speed using quantization inference on qwen models.
#2102
opened Apr 22, 2025 by
HaoKang-Timmy
[Tracker] TorchAO activation sparsity acceleration 🚀
#2095
opened Apr 22, 2025 by
jcaip
2 of 9 tasks
Refactor torchao and tests to use model architectures from torchao.testing.model_architectures
good first issue
Good for newcomers
#2078
opened Apr 18, 2025 by
jainapurva
Dynamo error with large mesh + AdamWFp8 + bf16 stochastic rounding
bug
Something isn't working
distributed
optimizer
#2074
opened Apr 18, 2025 by
cassanof
Remove old subclass implementation to reduce maintainence cost
topic: deprecation
Use this tag if this PR deprecates a feature
#2056
opened Apr 14, 2025 by
jerryzh168
[Bug] FSDP2 FP8 compatibility problem with nn.Linear layers (GPU count > out_features)
distributed
float8
#1938
opened Mar 24, 2025 by
HIT-cwh
Torchao's CPU overhead counteracts the performance benefit of using quantization kernel.
#1930
opened Mar 21, 2025 by
LuFinch
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.