-
-
Notifications
You must be signed in to change notification settings - Fork 5.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Activating environment on remote workers on a cluster fails on v1.7 (works on 1.6) #42405
Comments
Should be fixed on #42255. |
Is it not fixed on master? |
If Pkg is bumped. |
|
Are you sure that master does not already include the fix? According to #42407, Julia master is only behind Pkg master by one commit, and that commit doesn't seem to fix this issue? Was the fix in question merged to the master branch of Pkg? |
The OP says that the error is happening on Julia nightly, which is why I'm concerned. |
Oh yeah, that was only reverted for 1.6 / 1.7: JuliaLang/Pkg.jl#2731. |
Potential fix with tests in JuliaLang/Pkg.jl#2732 |
I can confirm that this is now fixed on |
There's a proper robust fix in JuliaLang/Pkg.jl#2793 upcoming for 1.8. |
This works on julia
v1.6.3
but fails onv1.7.0-rc1
and nightly on a Slurm cluster (usingClusterManagers
v0.4.2)the main julia script named
slurmtrial.jl
:The jobscript that I use to submit this (change the julia path and the output file names to run the same code on different julia versions):
I am using 2 nodes with 28 cores each, so a total of 56 workers. The error sometimes doesn't happen if I only use a few cores on one node (eg. 2 cores).
output on v1.6 (this is what is expected):
output on v1.7 & v1.8
Note that the number of exceptions raised here is 40 and not 56 (this number is variable).
The text was updated successfully, but these errors were encountered: