Build with AMD AOCC + AOCL (CPU only) #5005

GANJAC · 2024-01-17T18:03:09Z

Hi, on my Debian 11 (AMD EPYC 75F3 32-Core Processor, 64 GB ram) i've just installed AMD AOCC + AOCL:

https://www.amd.com/en/developer/aocc.html
https://www.amd.com/en/developer/aocl.html

How can i build llama.cpp with that ? And what arguments for the best optimization (BLAS, LAPACK ...)

Any suggestions are super appreciated!

pnb · 2024-01-17T23:46:52Z

I haven't tried AOCC but AOCL works fine and produces a small speedup versus no BLAS library, though I haven't compared it to others. You can compile like this, more or less (based on my bash history from November):

# Assuming you downloaded AOCL from https://www.amd.com/en/developer/aocl.html and put it in ~
cd
tar xf aocl-linux-gcc-4.1.0.tar.gz
cd aocl-linux-gcc-4.1.0
./install.sh
cd
git clone https://github.com/ggerganov/llama.cpp.git
cd llama.cpp
mkdir build && cd build
source ~/aocl/4.1.0/gcc/amd-libs.cfg
cmake .. -DLLAMA_BLAS=ON -DLLAMA_BLAS_VENDOR=AOCL -DBLAS_INCLUDE_DIRS=~/aocl/4.1.0/gcc/include
make -j

This is assuming Linux.

GANJAC · 2024-01-18T08:45:51Z

Thank you Nigel!!! Now i try to complete the commands with the AOCC clang++ and Zen3 specific optimization, if everything works i share the results here.

HansvanHespen · 2024-01-21T02:28:32Z

Exactly. If I do in the dockerfile ENV cmake_cxx_flags="-march=znver2", then it turns out in the logs that somewhere, make puts afterwards -march=native, thus cancelling my -march=znver2 directive. The end result is a sigill on the linux of the cloud. How do you crosscompile? How do you adapt the dockerfile so that the gcc compiler compiles to the target AMD Epyc processor, and not my Intel i7 cpu?

github-actions · 2024-03-18T01:33:26Z

This issue is stale because it has been open for 30 days with no activity.

github-actions · 2024-04-03T01:13:39Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

GANJAC added the bug-unconfirmed label Jan 17, 2024

ikawrakow mentioned this issue Jan 21, 2024

Add Q3_K_XS #5060

Merged

github-actions bot added the stale label Mar 18, 2024

github-actions bot closed this as completed Apr 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build with AMD AOCC + AOCL (CPU only) #5005

Build with AMD AOCC + AOCL (CPU only) #5005

GANJAC commented Jan 17, 2024

pnb commented Jan 17, 2024

GANJAC commented Jan 18, 2024

HansvanHespen commented Jan 21, 2024

github-actions bot commented Mar 18, 2024

github-actions bot commented Apr 3, 2024

Build with AMD AOCC + AOCL (CPU only) #5005

Build with AMD AOCC + AOCL (CPU only) #5005

Comments

GANJAC commented Jan 17, 2024

pnb commented Jan 17, 2024

GANJAC commented Jan 18, 2024

HansvanHespen commented Jan 21, 2024

github-actions bot commented Mar 18, 2024

github-actions bot commented Apr 3, 2024