On the grid it is important to know in advance the amount of cores used in each job. In particular we found it is significantly more efficient to submit jobs which use just one core (due to memory assignment).
In the current legacy framework there is a "sequential" mode which makes sure that CORSIKA and sim_telarray do not run simultaneously, but rather wait for each other to "pause". The same happens with multiple instances of sim_telarray.
The same feature or even more control should be included in simtools. Not sure if this feature is already requested (I think it does not exist). Apologies if I missed it.