Skip to content

Enable FP4 CUTLASS GEMM and CUDA quantization kernels (#4004) #5949

Enable FP4 CUTLASS GEMM and CUDA quantization kernels (#4004)

Enable FP4 CUTLASS GEMM and CUDA quantization kernels (#4004) #5949

Triggered via push April 29, 2025 00:11
Status Cancelled
Total duration 27m 54s
Artifacts 24
Matrix: build_artifact
Matrix: test_and_publish_artifact
Fit to window
Zoom out
Zoom in

Annotations

62 errors
build_artifact (x86, linux.24xlarge, 3.9, 12.8.0, clang)
The process '/usr/bin/git' failed with exit code 128
build_artifact (x86, linux.24xlarge, 3.13, 12.8.0, gcc)
The self-hosted runner lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
build_artifact (x86, linux.24xlarge, 3.10, 12.8.0, gcc)
The self-hosted runner lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
build_artifact (x86, linux.24xlarge, 3.12, 12.8.0, clang)
The self-hosted runner lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
build_artifact (x86, linux.24xlarge, 3.11, 12.8.0, gcc)
The self-hosted runner lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
build_artifact (x86, linux.24xlarge, 3.11, 12.8.0, clang)
The self-hosted runner lost communication with the server. Verify the machine is running and has a healthy network connection. Anything in your workflow that terminates the runner process, starves it for CPU/Memory, or blocks its network access can cause this error.
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.11, 12.8.0, 12.6.3, gcc)
Unable to download artifact(s): Artifact not found for name: fbgemm_gpu_nightly_genai_x86_gcc_py3.11_cu12.8.0.whl Please ensure that your artifact is not expired and the artifact was uploaded using a compatible version of toolkit/upload-artifact. For more information, visit the GitHub Artifacts FAQ: https://github.com/actions/toolkit/blob/main/packages/artifact/docs/faq.md
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.12, 12.8.0, 12.6.3, clang)
Unable to download artifact(s): Artifact not found for name: fbgemm_gpu_nightly_genai_x86_clang_py3.12_cu12.8.0.whl Please ensure that your artifact is not expired and the artifact was uploaded using a compatible version of toolkit/upload-artifact. For more information, visit the GitHub Artifacts FAQ: https://github.com/actions/toolkit/blob/main/packages/artifact/docs/faq.md
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.10, 12.8.0, 12.6.3, gcc)
Unable to download artifact(s): Artifact not found for name: fbgemm_gpu_nightly_genai_x86_gcc_py3.10_cu12.8.0.whl Please ensure that your artifact is not expired and the artifact was uploaded using a compatible version of toolkit/upload-artifact. For more information, visit the GitHub Artifacts FAQ: https://github.com/actions/toolkit/blob/main/packages/artifact/docs/faq.md
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.9, 12.8.0, 12.6.3, clang)
Unable to download artifact(s): Artifact not found for name: fbgemm_gpu_nightly_genai_x86_clang_py3.9_cu12.8.0.whl Please ensure that your artifact is not expired and the artifact was uploaded using a compatible version of toolkit/upload-artifact. For more information, visit the GitHub Artifacts FAQ: https://github.com/actions/toolkit/blob/main/packages/artifact/docs/faq.md
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.13, 12.8.0, 12.6.3, gcc)
Unable to download artifact(s): Artifact not found for name: fbgemm_gpu_nightly_genai_x86_gcc_py3.13_cu12.8.0.whl Please ensure that your artifact is not expired and the artifact was uploaded using a compatible version of toolkit/upload-artifact. For more information, visit the GitHub Artifacts FAQ: https://github.com/actions/toolkit/blob/main/packages/artifact/docs/faq.md
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.11, 12.8.0, 12.6.3, clang)
Unable to download artifact(s): Artifact not found for name: fbgemm_gpu_nightly_genai_x86_clang_py3.11_cu12.8.0.whl Please ensure that your artifact is not expired and the artifact was uploaded using a compatible version of toolkit/upload-artifact. For more information, visit the GitHub Artifacts FAQ: https://github.com/actions/toolkit/blob/main/packages/artifact/docs/faq.md
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.12, 12.6.3, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.9, 12.6.3, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.9, 11.8.0, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.9, 12.6.3, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.9, 12.8.0, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.10, 12.6.3, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.9, 11.8.0, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.10, 11.8.0, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.12, 11.8.0, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.10, 12.8.0, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.11, 12.6.3, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.12, 12.8.0, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.11, 11.8.0, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.13, 11.8.0, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.13, 11.8.0, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.12, 11.8.0, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.10, 11.8.0, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.10, 12.6.3, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.11, 11.8.0, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.12, 12.6.3, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.13, 12.6.3, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.11, 12.6.3, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.13, 12.6.3, 12.6.3, gcc)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
FBGEMM_GPU-GenAI CI
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists
test_and_publish_artifact (x86, linux.g5.4xlarge.nvidia.gpu, 3.13, 12.8.0, 12.6.3, clang)
Canceling since a higher priority waiting request for FBGEMM_GPU-GenAI CI-refs/heads/main exists

Artifacts

Produced during runtime
Name Size Digest
fbgemm_gpu_nightly_genai_x86_clang_py3.10_cu11.8.0.whl
5.25 MB
sha256:06402937e8682fdec27e142bdd8f0ae18f583ab9f8bc6892bb3375a43bd8c4ba
fbgemm_gpu_nightly_genai_x86_clang_py3.10_cu12.6.3.whl
11.9 MB
sha256:d707376e3c1d1f9de4f456240faa6e0c47f5f37fd834ced306e8bdab4272f4d5
fbgemm_gpu_nightly_genai_x86_clang_py3.10_cu12.8.0.whl
17.5 MB
sha256:1ec2e0612103b1e6b4f4305bd33d690bc9f6f5e733508c1040ee85279e1629fb
fbgemm_gpu_nightly_genai_x86_clang_py3.11_cu11.8.0.whl
5.25 MB
sha256:cbba2b078b68a8859c8734efab68129b267d2f0950c54ea59afbd1d3db5a3d3a
fbgemm_gpu_nightly_genai_x86_clang_py3.11_cu12.6.3.whl
11.9 MB
sha256:2ccb11ea8726c2622e1252575d179a3ef16fcbe51b72941473ad4602051a8ec6
fbgemm_gpu_nightly_genai_x86_clang_py3.12_cu11.8.0.whl
5.25 MB
sha256:d3de5f02c8a725a337e8875383a223ed695b6c26bee4c686bc6327a9f83f1e5b
fbgemm_gpu_nightly_genai_x86_clang_py3.12_cu12.6.3.whl
11.9 MB
sha256:7d7c4f3e85c70694da152e08e406f9bf657a9f6395daa54bb1598e6fcd1eda0a
fbgemm_gpu_nightly_genai_x86_clang_py3.13_cu11.8.0.whl
5.25 MB
sha256:929857162b57e1e0b3854e44da4b369c47e1748d6af1641a96dd2e04cda72776
fbgemm_gpu_nightly_genai_x86_clang_py3.13_cu12.6.3.whl
11.9 MB
sha256:041aedefd30f5573600868e0a2f57736192b466ec953d92d168e10c890918d42
fbgemm_gpu_nightly_genai_x86_clang_py3.13_cu12.8.0.whl
17.5 MB
sha256:d34bab703913fe3891ce74f6260033c2007eab21c32121f8d3e959c8eac11376
fbgemm_gpu_nightly_genai_x86_clang_py3.9_cu11.8.0.whl
5.25 MB
sha256:0e695c332004f6548b4993dc26a9a5472c1d7141fc9b7421a94d5f668c187ebd
fbgemm_gpu_nightly_genai_x86_clang_py3.9_cu12.6.3.whl
11.9 MB
sha256:0de79d868475d131031d70445b8f923f78b558756eb9a5724fbe1deb995fa079
fbgemm_gpu_nightly_genai_x86_gcc_py3.10_cu11.8.0.whl
5.15 MB
sha256:669d7c088ef0a84ccd3a79c6a960cf5bd1252c6cc8bca5c37100efe3b82f740f
fbgemm_gpu_nightly_genai_x86_gcc_py3.10_cu12.6.3.whl
11.8 MB
sha256:4fbfb0529ac695e9e04314bae13ea032bbbf84cd25be3770659802f54ef7dfaf
fbgemm_gpu_nightly_genai_x86_gcc_py3.11_cu11.8.0.whl
5.15 MB
sha256:e39f9302c2e9ad302cb4c9063160ca5531afc7bd211f170d213fc084c3a80f3e
fbgemm_gpu_nightly_genai_x86_gcc_py3.11_cu12.6.3.whl
11.8 MB
sha256:de040c71f957c13c747467c3012e4e5ba71e1c883cc99e09053ee9485d19629d
fbgemm_gpu_nightly_genai_x86_gcc_py3.12_cu11.8.0.whl
5.15 MB
sha256:a0ab2e1044ae07aee7347c7a4a6fa8fb51abf5afe342a17633bb27e9a0a7fc4b
fbgemm_gpu_nightly_genai_x86_gcc_py3.12_cu12.6.3.whl
11.8 MB
sha256:afcc5a50da05438fde3c61553d16dcfc72ca92af620931acda92813a00e6cf3e
fbgemm_gpu_nightly_genai_x86_gcc_py3.12_cu12.8.0.whl
17.5 MB
sha256:2ffce32ae9de9f945476d48432804b3b75ddc8c543183cecea7f0d0c6963e728
fbgemm_gpu_nightly_genai_x86_gcc_py3.13_cu11.8.0.whl
5.15 MB
sha256:d6c3b0618335de7b26020adeab4a9a02393041505e133d72d62eb7c2ec02b5cc
fbgemm_gpu_nightly_genai_x86_gcc_py3.13_cu12.6.3.whl
11.8 MB
sha256:dd811313b07e9bf776e21a0de9cec7ec226f5b78c615c042986ce1c00e9c3908
fbgemm_gpu_nightly_genai_x86_gcc_py3.9_cu11.8.0.whl
5.15 MB
sha256:3eaceeacf791fcc76e492d6e49df8ed0332b58bc22cb941cb79967b269bde4e4
fbgemm_gpu_nightly_genai_x86_gcc_py3.9_cu12.6.3.whl
11.8 MB
sha256:05ed393c5da0cde486c1499a8204a5ff0d957e7951675ea52cec09b8c14f4c92
fbgemm_gpu_nightly_genai_x86_gcc_py3.9_cu12.8.0.whl
17.5 MB
sha256:81172c4efb6a64358ebddf6fca86a8ae37bb754d8639e148e60e94397e7b1bfc