We have PBS cluster, but we also have several specialized nodes which are not part of a cluster and we'd like to allow Dask tasks to be ran on either the cluster's compute nodes, or the specialized nodes transparently based on resources.
For example, we have some routine tasks that don't really require anything in order to run, they just need to be executed periodically. We also have computationally expensive tasks that COULD run anywhere, but should really be ran through the cluster. Finally, we have tasks that require specialized hardware (such as GPUs), which is only attached to certain machines.
This looks like it would require a new type of scheduler, but before we start down that rabbit hole, is there something we're missing that makes this work already?
We have PBS cluster, but we also have several specialized nodes which are not part of a cluster and we'd like to allow Dask tasks to be ran on either the cluster's compute nodes, or the specialized nodes transparently based on resources.
For example, we have some routine tasks that don't really require anything in order to run, they just need to be executed periodically. We also have computationally expensive tasks that COULD run anywhere, but should really be ran through the cluster. Finally, we have tasks that require specialized hardware (such as GPUs), which is only attached to certain machines.
This looks like it would require a new type of scheduler, but before we start down that rabbit hole, is there something we're missing that makes this work already?