Skip to content

HPCExecutor configurations #572

@zmaalick

Description

@zmaalick

Hi,

We are trying to run HPCExecutor.

Here are our configurations:

[executor]
executor = "climate_ref.executor.HPCExecutor"

[executor.config]
address = "eth0"
address_probe_timeout = 30
scheduler = "slurm"
account = "normal" # our account name
username = "my_username"
partition = "cpu"           # or "debug" for testing
#qos = "debug"                   # quality of service
req_nodes = 1                   # number of nodes
walltime = "00:30:00"           # time limit per job
cores_per_worker = 1            # cores per worker
max_workers_per_node = 4       # workers per node
scheduler_options = ""
worker_init = "export PARSL_HTEX_ADVERTISED_ADDRESS=10.138.80.53; export PARSL_HTEX_ADDRESS=10.138.80.53;"

but we are getting connection error:

0:     raise ConnectionError(f"No viable ZMQ url from: {addys}")
0: ConnectionError: No viable ZMQ url from: tcp://127.0.0.1:54201, tcp://10.138.80.53:54201, tcp://151.170.240.197:54201
srun: error: sla-cpu-r-03: task 0: Exited with exit code 1

I above settings we did a bit of work around by adding scheduler_options but didn't work. Anything else we need in configurations?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions