Improve Fused8BitRowwiseQuantizedSBFloatToFloatOrHalfNeon by 5%-15% #218
fbgemm_gpu_benchmark_cuda.yml
on: pull_request
Matrix: build_artifact
Matrix: benchmark_artifact
Artifacts
Produced during runtime
Name | Size | Digest | |
---|---|---|---|
fbgemm_gpu_nightly_cuda_x86_gcc_py3.13_cu12.8.0.whl
|
962 MB |
sha256:8d6e4ca22bd186ff6919266685275dbe48d1aa2cc00753d65dc319a4e76ca1f3
|
|