Skip to content

Commit d18ecd5

Browse files
committed
make mmq gen faster for amd
1 parent 243894a commit d18ecd5

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

ggml-cuda.cu

+1-1
Original file line numberDiff line numberDiff line change
@@ -72,7 +72,7 @@
7272
#include "ggml.h"
7373

7474
#define MIN_CC_DP4A 610 // minimum compute capability for __dp4a, an intrinsic for byte-wise dot products
75-
#define CC_TURING 700
75+
#define CC_TURING 10000
7676

7777
#if defined(GGML_USE_HIPBLAS)
7878
#define __CUDA_ARCH__ 1300

0 commit comments

Comments
 (0)