-
Notifications
You must be signed in to change notification settings - Fork 1.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about version 0.3.5 build with gcc 9.2.0 #2579
Comments
You did not mention the platform, but there were several unsafe assumptions in particular in x86_64 assembly code that led to miscompilation with recent, more aggressively optimizing compilers. Is there any specific reason why you prefer staying with 0.3.5 from over a year ago rather than updating to the current version ? (IIRC the assembly bugs were fixed in 0.3.6, more |
Hi Martin, |
You could compile with DYNAMIC_ARCH=1 and TARGET set to the hardware of the weakest node, that way the AVX512 kernels would get used on supporting nodes while others still work with whatever is appropriate for them. One problem could be that it might make results less deterministic if they experience different rounding effects from FMA depending on which collection of nodes they get scheduled on. |
Thank you for so quick feedback! Let's close this ticket then. Bon weekend! |
On Centos7 with gcc 9.2.0 built from sources, there are two tests from OpenBlas ctests fail while everything runs well using gcc 8.3.0
Also, we don't see anything strange in configure and build logs.
Is it something known and expected?
3/25 Test #3: sblas2 ...........................***Failed 0.41 sec Start 4: sblas3 4/25 Test #4: sblas3 ........................... Passed 0.44 sec Start 5: dblas1 5/25 Test #5: dblas1 ........................... Passed 0.38 sec Start 6: dblas2 6/25 Test #6: dblas2 ...........................***Failed 0.41 sec Start 7: dblas3
Kind Regards,
Dmitri
The text was updated successfully, but these errors were encountered: