-
Notifications
You must be signed in to change notification settings - Fork 8
Hpc ugent modular #15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Hi @SanderBorgmans I am sorry for the bug, I now added at least a very basic test to see it is working, but testing the functionality with the queuing system is always a bit difficult. I just have one more question, is there any reason why the SLURM adapter does not work for you? (I modified the tests to be using slurm in the modular adapter, maybe you can take a look at the current master - it still needs your changes to use subprocess with shell=True, but it should not be necessary to have another adapter |
|
@jan-janssen I tought an extra wrapper was required because of the |
|
@SanderBorgmans I would like to minimise the duplicated code, that is why I am a bit reluctant to add a second parser to communicate with slurm. In the current master branch I added the line https://github.com/pyiron/pysqa/blob/master/pysqa/modular.py#L10 to use slurm in the modular adapter and I added the option |
|
@jan-janssen This is fine for me, I will then just overwrite the slurm wrapper by my own wrapper and everything should be ok. |
|
@SanderBorgmans I still have trouble to understand why we need two different adapters for slurm. But if it is necessary, that is fine for me, maybe you could at least derive it from the |
|
@jan-janssen That's a good idea. The different adapter is necessary because of https://github.com/pyiron/pysqa/blob/master/pysqa/wrapper/slurm.py#L39 and https://github.com/pyiron/pysqa/blob/master/pysqa/wrapper/slurm.py#L43 . When submitting a job with sbatch, the returned output looks like jobid;cluster, so that an extra split is necessary. Furthermore, the squeue command prints an extra line "CLUSTER: clustername", which is not dropped when specifying I also got errors from a possibly empty |
Ok, then it makes sense to have a different wrapper.
I have to admit our cluster still runs SGE so @dnoeger implemented the slurm adapter for their cluster. But it definitely makes sense to check the length of the output. Can you modify the pull request accordingly, then we can merge it? |
|
Does this work for you? #19 |
|
@jan-janssen Perfect, I was just about to push, but I had some git issues. |
|
Nice, then I guess we can close this pull request and I am going to push a new release to |
@jan-janssen, thank you for the modular implementation. I had to fiddle around with the queues.yaml file before I got everything working. But if I am correct, an extra cluster property is added to the queues dict and an extra list object with all the clusters should be added. I fixed some minor errors and everything should work now. The commands are now parsed as strings instead of lists, since the swap commands requires the shell flag, which meant that the format flag of the queue command had to be adapted.