-
Notifications
You must be signed in to change notification settings - Fork 11
Open
Description
Hi,
We are trying to run HPCExecutor.
Here are our configurations:
[executor]
executor = "climate_ref.executor.HPCExecutor"
[executor.config]
address = "eth0"
address_probe_timeout = 30
scheduler = "slurm"
account = "normal" # our account name
username = "my_username"
partition = "cpu" # or "debug" for testing
#qos = "debug" # quality of service
req_nodes = 1 # number of nodes
walltime = "00:30:00" # time limit per job
cores_per_worker = 1 # cores per worker
max_workers_per_node = 4 # workers per node
scheduler_options = ""
worker_init = "export PARSL_HTEX_ADVERTISED_ADDRESS=10.138.80.53; export PARSL_HTEX_ADDRESS=10.138.80.53;"
but we are getting connection error:
0: raise ConnectionError(f"No viable ZMQ url from: {addys}")
0: ConnectionError: No viable ZMQ url from: tcp://127.0.0.1:54201, tcp://10.138.80.53:54201, tcp://151.170.240.197:54201
srun: error: sla-cpu-r-03: task 0: Exited with exit code 1
I above settings we did a bit of work around by adding scheduler_options but didn't work. Anything else we need in configurations?
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels