-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OpenBLAS 0.3.27 (same applies to 0.3.26) compiled with Intel oneAPI (latest) fails the testing - dblas3 and zblas1 tests are problematic #4739
Comments
This is probably related to (or even a duplicate of) #4713 - newer oneAPI versions default to fp-model=fast which apparently makes some unsafe assumptions when optimizing. |
I managed to resolve the problem. After examining the content of the cmake folder, I realized that OpenBLAS keeps a compiler-explicit configuration model and does not examine the compiler to assume the correct set of capabilities and flags. Instead, it expects the user to pass the type of the compiler to the cmake, which in turn expands the corresponding macros. Therefore, the correct way for configuring the compilation process should look similar to:
Here one should also add |
I'm not entirely sure about that, C_COMPILER/F_COMPILER should normally be autodetected, and only the CMAKE_C_COMPILER and CMAKE_Fortran_COMPILER flags need to be given. Maybe there is something missing in the autodetection scripts. |
By having the macros definitions gone over and by examining the generated Makefiles, I got that
|
#4718 should be setting |
Passing
Given bellow is the reason zblas1 fails:
|
There is an issue with the testing of (at least) the last two release of OpenBLAS whenever Intel oneAPI compilers on Linux (RHEL 9.2 and RHEL 8.4) are employed for compiling the code. Shortly, the tests dblas3 and zblast1 fails:
Bellow are the command lines executed to configure and build the source code, and then run the tests:
Note that the error is not related to the adopted CPU target. We were able to reproduce the error on AMD EPYC CPU that corresponds to ZEN target. Changing the compilers to GNU (gcc + gfortran) makes the errors gone, but the produced OpenBLAS libraries perform slower. We are not sure if that is a runtime issue. On the other side, setting ulimit -s unlimited does not solve the issue.
Attached is the LastTest.log file, which may shed a light upon the issue.
LastTest.log
Anybody experienced suimilar issue?
The text was updated successfully, but these errors were encountered: