-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Issues: NVIDIA/cutlass
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
[QST] Adding new parameter to Conv2dFprop in Python
? - Needs Triage
question
Question
#2166
opened Mar 12, 2025 by
IzanCatalan
[QST] Variable size Gemm that can be Cuda graphed
? - Needs Triage
question
Question
#2164
opened Mar 12, 2025 by
otropp
[QST] if(runtime_value) (cute::copy or cute::gemm) Generates NaNs
? - Needs Triage
question
Question
#2163
opened Mar 12, 2025 by
phantaurus
[BUG] Possible race condition in sm90_gemm_array_tma_warpspecialized_cooperative
? - Needs Triage
bug
Something isn't working
#2162
opened Mar 11, 2025 by
thefacetakt
[DOC] CUTLASS INT4 GEMM: Missing SM89 Dispatch Configuration for L40S/4090
? - Needs Triage
documentation
Documentation
#2158
opened Mar 8, 2025 by
hxu296
[BUG] "Got cutlass error: Error Internal at: " even though compilation is successful
? - Needs Triage
bug
Something isn't working
#2157
opened Mar 8, 2025 by
henrylhtsang
[BUG] BF16 * BF16 GEMM with pingpong schedule and non-zero beta hangs
? - Needs Triage
bug
Something isn't working
#2152
opened Mar 6, 2025 by
manishucsd
[BUG] CUTLASS Python Interface nvrtc fails on Hopper
? - Needs Triage
bug
Something isn't working
#2150
opened Mar 5, 2025 by
sommerlukas
[BUG] Possible numerical problems with a warpspecialized_cooperative_epi_tma op
? - Needs Triage
bug
Something isn't working
#2147
opened Mar 4, 2025 by
henrylhtsang
[QST] which is optimised way to iterate over the conv2d filter tensor
? - Needs Triage
question
Question
#2146
opened Mar 4, 2025 by
IzanCatalan
[QST] Bitwise Operations with Cutlass datatypes
? - Needs Triage
question
Question
#2145
opened Mar 4, 2025 by
IzanCatalan
[QST] Why does GenerateSM90_TensorOp_16b_WGMMA_alignx_gemm not generate C.element = DataType.void?
? - Needs Triage
question
Question
#2144
opened Mar 4, 2025 by
henrylhtsang
[QST] Permute in K mode for consistent LDSM results
? - Needs Triage
question
Question
#2140
opened Feb 26, 2025 by
capybara-club
[BUG] EpilogueTileAuto doesn't work when tile shape is (128, 112, 64)
? - Needs Triage
bug
Something isn't working
#2133
opened Feb 25, 2025 by
henrylhtsang
[QST] Permutation layout for contiguous stores
? - Needs Triage
question
Question
#2127
opened Feb 22, 2025 by
capybara-club
[BUG] Mixed Input H100 Kernel Hangs
? - Needs Triage
bug
Something isn't working
#2121
opened Feb 20, 2025 by
manishucsd
[QST] Utilizing both Tensor Cores and Cuda Cores, Possible to overlay GEMM calls?
? - Needs Triage
question
Question
#2117
opened Feb 17, 2025 by
zzhou292
I believe the layout composition in CUTLASS is not so robust
#2113
opened Feb 14, 2025 by
seemingwang
[BUG] sm_count may be ignored in persistent GEMMs
? - Needs Triage
bug
Something isn't working
#2108
opened Feb 13, 2025 by
milesvant
[QST]when will DelayTmaStore be important?
? - Needs Triage
question
Question
#2106
opened Feb 13, 2025 by
ziyuhuang123
[BUG] Wrong default reduction_identity on Epilogue Reduction Store operations
? - Needs Triage
bug
Something isn't working
#2105
opened Feb 13, 2025 by
bdh0404
[QST] Does fp8 on Ada Lovelace require CUDA 12.4 (>=550) driver?
? - Needs Triage
question
Question
#2102
opened Feb 12, 2025 by
yuc8939
[QST] Question about UniversalFMA
? - Needs Triage
question
Question
#2101
opened Feb 12, 2025 by
leven-comeon
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-02-12.