StarCluster - Mailing List Archive

Spot instances not removed from the queue when terminated

From: Mircea Cimpoi <no email>
Date: Tue, 24 Nov 2015 12:01:19 +0000

Hi All,

We’ve started using StarCluster to run experiments on AWS and have
encountered some problems with the load balancer when using spot instances.

Specifically, we’re finding that the nodes are not being removed from the
queue (qconf -sel) after the spot instances are terminated and still appear
in qhost output. The jobs also appear to still be running after termination
which seems to be stopping the load balancer from adding new nodes as we
would expect.

We are using g2.2xlarge instances, and requesting 8 slots per submitted job.

Has anyone experienced similar issues?

Mircea
Received on Tue Nov 24 2015 - 07:01:24 EST
This archive was generated by hypermail 2.3.0.

Search:

Sort all by:

Date

Month

Thread

Author

Subject