QAT model drops accuracy after converting with torch.ao.quantization.convert #2138

tranngocduvnvp · 2025-04-28T01:39:36Z

Hello everyone.

I am implementing QAT model yolov8 in 4bit mode for weight and 8bit for activation by setting quant_min, quant_max in config. The model when training and eval gives quite good results, however when I convert using torch.ao.quantization.convert method, the model gives very bad evaluation results. Does anyone know how to solve this problem?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QAT model drops accuracy after converting with torch.ao.quantization.convert #2138

QAT model drops accuracy after converting with torch.ao.quantization.convert #2138

tranngocduvnvp commented Apr 28, 2025

QAT model drops accuracy after converting with torch.ao.quantization.convert #2138

QAT model drops accuracy after converting with torch.ao.quantization.convert #2138

Comments

tranngocduvnvp commented Apr 28, 2025