-
-
Notifications
You must be signed in to change notification settings - Fork 73
Adding processes and using eager API produces warnings about workers dying #536
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Comments
Could you try on master? I believe this was fixed in #532. |
Thank you. I tried master, the error is gone but the warnings are still there |
Yeah I think the warnings will have to stay, unless we bring back |
If those warnings are happening during a clean Julia shutdown, then we need to improve our fault tolerance logic to properly detect a clean shutdown and thus not emit these warnings, since they're quite scary to see. @m-fila can you confirm that these occur during a Julia exit? |
Yes, I confirm |
Ok, then re-opening this issue since we need to properly silence these warnings. |
Yes, they are gone with #537. Thanks! The warnings still appear tho if the workers are removed |
Yeah, that's a separate issue, because in this case Dagger has no idea that it was intentional for the workers to exit (Distributed.jl doesn't communicate this distinction to Dagger). You would need to call |
…el/Dagger.jl#536 Co-authored-by: Mateusz Jakub Fila <[email protected]>
…el/Dagger.jl#536 Co-authored-by: Mateusz Jakub Fila <[email protected]>
…11) * Added examples, docs + some fixes * Added examples results, updated readme * Added examples for the render_logs() function * Fixes to the main program; added example * Small fixes * Silence warnings about the workers dying and rescheduling JuliaParallel/Dagger.jl#536 Co-authored-by: Mateusz Jakub Fila <[email protected]> * wait for notify tasks --------- Co-authored-by: SmalRat <[email protected]>
Adding extra processes and scheduling with eager API seems to be producing error and warnings about reschduling do to workers dying. For example, snippet taken from README:
Gives the following error:
The error sometimes is omitted but warnings about workers dying are present.
If lazy API is used then there are no warnings or errors
The warnings seems to be harmless since they appear only while finishing the job
versioninfo:
Dagger: 0.18.11
I couldn't find any duplicates
The text was updated successfully, but these errors were encountered: