executorlib.task_scheduler.base.TaskSchedulerBase

executorlib.task_scheduler.base.TaskSchedulerBase#

class executorlib.task_scheduler.base.TaskSchedulerBase(max_cores: int | None = None, validator: ~typing.Callable = <function validate_resource_dict>)[source]#

Bases: Executor

Base class for the executor.

Parameters:: max_cores (int) – defines the number cores which can be used in parallel

__init__(max_cores: int | None = None, validator: ~typing.Callable = <function validate_resource_dict>)[source]#

Initialize the TaskSchedulerBase.

Parameters:

max_cores (int, optional) – Maximum number of cores available to the scheduler. Tasks requesting more cores than this will be rejected. Defaults to None (unlimited).
validator (Callable) – Function used to validate per-task resource dicts before submission. Defaults to the no-op validate_resource_dict.

Methods

`__init__`([max_cores, validator])	Initialize the TaskSchedulerBase.
`batched`(iterable, n)	Batch futures from the iterable into tuples of length n.
`map`(fn, *iterables[, timeout, chunksize])	Returns an iterator equivalent to map(fn, iter).
`shutdown`([wait, cancel_futures])	Clean-up the resources associated with the Executor.
`submit`(fn, /, *args[, resource_dict])	Submits a callable to be executed with the given arguments.

Attributes

`future_queue`	Get the future queue.
`info`	Get the information about the executor.
`max_workers`	Return the configured number of parallel workers, or None if unconstrained.

batched(iterable: list[Future], n: int) → list[Future][source]#

Batch futures from the iterable into tuples of length n. The last batch may be shorter than n.

Parameters:

iterable (list) – list of future objects to batch based on which future objects finish first
n (int) – badge size

Returns:

list of future objects one for each batch

Return type:

list[Future]

property future_queue: Queue | None#

Get the future queue.

Returns:: The future queue.
Return type:: queue.Queue

property info: dict | None#

Get the information about the executor.

Returns:: Information about the executor.
Return type:: Optional[dict]

map(fn: Callable, *iterables, timeout: float | None = None, chunksize: int = 1)[source]#

Returns an iterator equivalent to map(fn, iter).

Parameters:

fn – A callable that will take as many arguments as there are passed iterables.
timeout – The maximum number of seconds to wait. If None, then there is no limit on the wait time.
chunksize – The size of the chunks the iterable will be broken into before being passed to a child process. This argument is only used by ProcessPoolExecutor; it is ignored by ThreadPoolExecutor.

Returns:

map(func, *iterables) but the calls may be evaluated out-of-order.

Return type:

An iterator equivalent to

Raises:

TimeoutError – If the entire result iterator could not be generated before the given timeout.
Exception – If fn(*args) raises for any values.

property max_workers: int | None#

Return the configured number of parallel workers, or None if unconstrained.

Returns:: The max_workers value stored in process kwargs, or None.
Return type:: Optional[int]

shutdown(wait: bool = True, *, cancel_futures: bool = False)[source]#

Clean-up the resources associated with the Executor.

It is safe to call this method several times. Otherwise, no other methods can be called after this one.

Parameters:

wait (bool) – If True then shutdown will not return until all running futures have finished executing and the resources used by the parallel_executors have been reclaimed.
cancel_futures (bool) – If True then shutdown will cancel all pending futures. Futures that are completed or running will not be cancelled.

submit(fn: Callable, /, *args, resource_dict: dict | None = None, **kwargs) → Future[source]#

Submits a callable to be executed with the given arguments.

Schedules the callable to be executed as fn(*args, **kwargs) and returns a Future instance representing the execution of the callable.

Parameters:

fn (callable) – function to submit for execution
args – arguments for the submitted function
kwargs – keyword arguments for the submitted function
resource_dict (dict) –
A dictionary of resources required by the task. With the following keys: - cores (int): number of MPI cores to be used for each function call - threads_per_core (int): number of OpenMP threads to be used for each function call - gpus_per_core (int): number of GPUs per worker - defaults to 0 - cwd (str/None): current working directory where the parallel python task is executed - openmpi_oversubscribe (bool): adds the –oversubscribe command line flag (OpenMPI and

SLURM only) - default False
- slurm_cmd_args (list): Additional command line arguments for the srun call (SLURM only)
- error_log_file (str): Name of the error log file to use for storing exceptions raised
  by the Python functions submitted to the Executor.

Returns:

A Future representing the given call.

Return type:

Future

executorlib.task_scheduler.base.TaskSchedulerBase

Contents

executorlib.task_scheduler.base.TaskSchedulerBase#