ci: add linux binaries to release build #1505

Green-Sky · 2023-05-17T16:50:39Z

adds ubuntu20.04 binaries to the releases.
and also cublas linux builds.

I changed the path for where the dynamic library is put. It was in the cmake build directory before, now its next to the executables (down into bin/).

I always build shared, with relative rpath (so no export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:. for libllama.so)
Distributing the lib makes the life for wrappers (eg python libs) easier.

not in release yet:

avx512 (i can't test this)
openblas (system lib, maybe ship?)
clbast (system lib, maybe ship?)

example release: https://github.com/Green-Sky/llama.cpp/releases/tag/ci_cublas_linux-e344540

CMakeLists.txt

SlyEcho · 2023-05-18T14:10:22Z

I think we should just add GNUInstallDirs in the CMakeLists.txt, that way distributors can configure the paths they want to install the files. cmake --install will also do strip and RPATH config, also.

Green-Sky · 2023-05-18T17:21:39Z

I think we should just add GNUInstallDirs in the CMakeLists.txt, that way distributors can configure the paths they want to install the files. cmake --install will also do strip and RPATH config, also.

will look into that tomorrow

Green-Sky · 2023-06-09T13:36:15Z

update: The cuda toolkit install now nukes the github action runners. they use too much disk space.

SlyEcho · 2023-06-09T14:10:26Z

Maybe we can keep just one CUDA version?

Green-Sky · 2023-06-09T14:31:25Z

Maybe we can keep just one CUDA version?

I think 1 runner does 1 job at a time. So I don't think that would make a difference.
Going to play around with selective installs again <.<

also, I once before saw a gh workflow where some non essentials where deleted, to make some space...

SlyEcho · 2023-06-13T14:15:34Z

OK, managed to download everything and even run main but I get this error:

CUDA error 222 at /home/runner/work/llama.cpp/llama.cpp/ggml-cuda.cu:1244: the provided PTX was compiled with an unsupported toolchain.

This is with release fafc8ae and the CUDA 12 version. The machine also had 12 and 2080 Ti.

Green-Sky · 2023-06-13T20:01:46Z

@SlyEcho btw, I switched to the "networked" installer, which is just setting up an apt repo ...
but that works for us.

OK, managed to download everything and even run main but I get this error:
CUDA error 222 at /home/runner/work/llama.cpp/llama.cpp/ggml-cuda.cu:1244: the provided PTX was compiled with an unsupported toolchain.
This is with release fafc8ae and the CUDA 12 version. The machine also had 12 and 2080 Ti.

this looks very weird, no idea what is happening here. since I don't use nvprune, I thought it just works. I can run the cuda11.7 just fine on my system. My driver is too old for 12...

SlyEcho · 2023-07-08T10:10:27Z

Isn’t there an image with CUDA already installed?

I plan to go with that approach for ROCm.

Green-Sky · 2023-07-08T10:14:33Z

Image, hmm. installing cuda now only takes as long as the compile ~1min. so i dont really see the point of using docker (im assuming thats what you mean with image)

Green-Sky · 2023-07-08T10:15:42Z

Jimver/cuda-toolkit#249

this made installing not the full toolkit viable (without me manually installing the apt sources 😄 )

SlyEcho · 2023-07-08T10:20:29Z

Yeah, I meant Docker. AMD publishes their images with everything installed already. Although I don’t know if it’s possible to redistribute some of those runtime components

Green-Sky · 2023-07-08T10:29:12Z

It would be cool for windows build, those take for ever. but for linux builds is now <50% of total build time.

there is still the problem of distributing the binaries NOT every release, those uploads now take up a significant amount of time (comparatively)

Green-Sky marked this pull request as ready for review May 17, 2023 20:40

SlyEcho reviewed May 18, 2023

View reviewed changes

CMakeLists.txt Show resolved Hide resolved

Green-Sky force-pushed the ci_cublas_linux branch from 99b7d15 to 3f008ca Compare May 20, 2023 16:35

Green-Sky force-pushed the ci_cublas_linux branch from 3f008ca to a150e6b Compare June 9, 2023 12:40

Green-Sky force-pushed the ci_cublas_linux branch 8 times, most recently from dad9e66 to fafc8ae Compare June 12, 2023 19:59

Green-Sky force-pushed the ci_cublas_linux branch from fafc8ae to d9f3846 Compare June 13, 2023 19:47

Green-Sky force-pushed the ci_cublas_linux branch 2 times, most recently from a6c5f59 to e344540 Compare July 8, 2023 10:02

Green-Sky force-pushed the ci_cublas_linux branch 3 times, most recently from b41deaf to ed418de Compare July 30, 2023 14:09

Green-Sky force-pushed the ci_cublas_linux branch from ed418de to fc60a27 Compare August 11, 2023 18:38

Green-Sky force-pushed the ci_cublas_linux branch 4 times, most recently from c5d7b68 to 20f7f4c Compare August 25, 2023 22:33

Green-Sky added 2 commits August 28, 2023 14:42

ci: add linux binaries to release build

0e1730a

temporarily disable broken 512 build

cec628e

Green-Sky force-pushed the ci_cublas_linux branch from 5562e3e to cec628e Compare August 28, 2023 12:42

Bearsaerker mentioned this pull request Mar 12, 2025

Eval bug: Gemma 3 extremly slow prompt processing when using quantized kv cache. #12352

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ci: add linux binaries to release build #1505

ci: add linux binaries to release build #1505

Green-Sky commented May 17, 2023 •

edited

Loading

SlyEcho commented May 18, 2023

Green-Sky commented May 18, 2023

Green-Sky commented Jun 9, 2023

SlyEcho commented Jun 9, 2023

Green-Sky commented Jun 9, 2023

SlyEcho commented Jun 13, 2023 •

edited

Loading

Green-Sky commented Jun 13, 2023

SlyEcho commented Jul 8, 2023

Green-Sky commented Jul 8, 2023

Green-Sky commented Jul 8, 2023

SlyEcho commented Jul 8, 2023

Green-Sky commented Jul 8, 2023

ci: add linux binaries to release build #1505

Are you sure you want to change the base?

ci: add linux binaries to release build #1505

Conversation

Green-Sky commented May 17, 2023 • edited Loading

SlyEcho commented May 18, 2023

Green-Sky commented May 18, 2023

Green-Sky commented Jun 9, 2023

SlyEcho commented Jun 9, 2023

Green-Sky commented Jun 9, 2023

SlyEcho commented Jun 13, 2023 • edited Loading

Green-Sky commented Jun 13, 2023

SlyEcho commented Jul 8, 2023

Green-Sky commented Jul 8, 2023

Green-Sky commented Jul 8, 2023

SlyEcho commented Jul 8, 2023

Green-Sky commented Jul 8, 2023

Green-Sky commented May 17, 2023 •

edited

Loading

SlyEcho commented Jun 13, 2023 •

edited

Loading