Distribute the scheduler #165

jpsamaroo · 2020-11-17T17:26:56Z

We're already pushing some extra work onto the worker nodes (mainly argument fetching and processor selection/load balancing), and it would be beneficial for large DAGs to move more work onto each worker. The main blocker is providing a way to split the DAG into multiple domains, where each domain is handled by a given thread on a given worker. With efficient Thunk serialization, we can then send a subgraph to each worker and let them process their own DAG without conflicts. We'll need to add a mechanism by which thunks automatically wait on their input thunks to complete before they attempt to download the output data; if possible, we can also have workers broadcast and shard their Chunks onto dependent workers as soon as the data is made available.

The text was updated successfully, but these errors were encountered:

jpsamaroo added scheduler performance processors labels Nov 17, 2020

jpsamaroo mentioned this issue Jan 15, 2021

Remove scheduler plugin machinery, make Sch programmable #166

Open

jpsamaroo changed the title ~~Multithread and distribute the scheduler~~ Distribute the scheduler Dec 5, 2021

jpsamaroo mentioned this issue Dec 5, 2021

Distribute the scheduler! #311

Closed

6 tasks

jpsamaroo linked a pull request Dec 5, 2021 that will close this issue

Distribute the scheduler! #311

Closed

6 tasks

jpsamaroo mentioned this issue Dec 14, 2021

Don't clobber the scheduler #310

Merged

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Distribute the scheduler #165

Distribute the scheduler #165

jpsamaroo commented Nov 17, 2020 •

edited

Loading

Distribute the scheduler #165

Distribute the scheduler #165

Comments

jpsamaroo commented Nov 17, 2020 • edited Loading

jpsamaroo commented Nov 17, 2020 •

edited

Loading