StarCluster - Mailing List Archive

load balancer stopped working?

From: David Koppstein <no email>
Date: Mon, 08 Jun 2015 15:10:37 +0000

Hi,

I noticed that my load balancer stopped working -- specifically, it has
stopped deleting unnecessary nodes. It's been running fine for about three
weeks.

I have a small T2 micro instance loadbalancing a cluster of M3.xlarge. The
cluster is running Ubuntu 14.04 using the shared 14.0. AMI ami-38b99850.

The loadbalancer process is still running (started with nohup CMD &, where
CMD is the loadbalancer command below):

```
ubuntu_at_ip-10-0-0-20:~$ ps -ef | grep load
ubuntu 11784 11730 0 15:04 pts/1 00:00:00 grep --color=auto load
ubuntu 19493 1 0 Apr26 ? 01:25:03
/opt/venv/python2_venv/bin/python /opt/venv/python2_venv/bin/starcluster -c
/home/ubuntu/.starcluster/config loadbalance -n 1 -m 20 -w 300 dragon-1.3.0
```

Queue has been empty for several days.

```
dkoppstein_at_master:/dkoppstein/150521SG_v1.9_round2$ qstat -u "*"
dkoppstein_at_master:/dkoppstein/150521SG_v1.9_round2$
```

However, there are about 8 nodes that have been running over the weekend
and are not being killed despite -n 1. If anyone has any guesses as to why
the loadbalancer might stop working please let me know so I can prevent
this from happening in the future.

Thanks,
David
Received on Mon Jun 08 2015 - 11:10:51 EDT
This archive was generated by hypermail 2.3.0.

Search:

Sort all by:

Date

Month

Thread

Author

Subject