StarCluster - Mailing List Archive

Re: master node not in qstat

From: David Erickson <no email>
Date: Mon, 19 Mar 2012 12:08:27 -0700

On 3/9/2012 3:04 PM, Justin Riley wrote:
> -----BEGIN PGP SIGNED MESSAGE-----
> Hash: SHA1
>
> Hi Adam,
>
>> Setup a cluster today (0.93.2) and suddenly noticed that the
>> 'master' node was not being reported in a "qstat -f" command and
>> was not accepting run jobs from the queue . . . i.e., with 12 nodes
>> x 8 cpus each (96), when 96 jobs are submitted, only 88 run (nodes
>> 1-11) while 8 remain in the queue waiting.
>>
>> I tried restarting the cluster using the 'sge' plugin to manually
>> ensure that master_is_exec_host was set to 'True'. But the result
>> was the same: 88 running - 8 waiting.
> Thanks for testing and reporting. I can confirm this bug and have
> created an issue on github[1]. I should have a hotfix release 0.93.3
> out tonight specifically to fix this.

Hi just wanted to follow up on this, I did an easy install update today
and still got 93.2, any eta on 93.3 with this fix?

Thanks,
David
Received on Mon Mar 19 2012 - 15:09:35 EDT
This archive was generated by hypermail 2.3.0.

Search:

Sort all by:

Date

Month

Thread

Author

Subject