StarCluster - Mailing List Archive

Fixed 'ZeroDivisionError: float division' error when launching a cluster

From: Stuart Young <no email>
Date: Wed, 09 Feb 2011 18:28:02 -0500

Hi All,

Fyi, I came across an intermittent error message when launching a
cluster with the latest versions of the trunk and the load balancer branch:

   File "build/bdist.linux-x86_64/egg/starcluster/progressbar.py", line
261, in percentage
     return self.currval * 100.0 / self.maxval
ZeroDivisionError: float division

I fixed it by adding a pause in cluster.py BEFORE looking for running
instances:

LINE 1039, cluster.py:

         # ADDED SLEEP TO AVOID CRASH JUST AFTER FIRST CHECK OF RUNNING
INSTANCES
         # WHEN INSTANCES ARE NOT YET AVAILABLE
         # >>> Waiting for all nodes to be in a 'running' state...
         time.sleep(interval)


FULL ERROR OUTPUT:

starcluster start smallcluster
StarCluster - (http://web.mit.edu/starcluster) (v. 0.9999)
Software Tools for Academics and Researchers (STAR)
Please submit bug reports to starcluster_at_mit.edu

>>> Using default cluster template: smallcluster
>>> AutoMount.__init__ Running AutoMount plugin.
>>> Validating cluster template settings...
>>> Cluster template settings are valid
>>> Starting cluster...
>>> Launching a 2-node cluster...
>>> Launching master (ami: ami-a5c42dcc, type: m1.large)
>>> Launching node001 (ami: ami-a5c42dcc, type: m1.large)
>>> Creating security group _at_sc-smallcluster...
Reservation:r-0664ab6b
>>> Waiting for cluster to come up... (updating every 30s)
>>> Waiting for all nodes to be in a 'running' state...
Traceback (most recent call last):
   File "build/bdist.linux-x86_64/egg/starcluster/cli.py", line 152, in main
     sc.execute(args)
   File "build/bdist.linux-x86_64/egg/starcluster/commands/start.py",
line 195, in execute
     scluster.start(create=create, create_only=create_only, validate=False)
   File "build/bdist.linux-x86_64/egg/starcluster/cluster.py", line
1191, in start
     return self._start(create, create_only)
   File "build/bdist.linux-x86_64/egg/starcluster/utils.py", line 69, in
wrap_f
     res = func(*arg, **kargs)
   File "build/bdist.linux-x86_64/egg/starcluster/cluster.py", line
1209, in _start
     self._setup_cluster()
   File "build/bdist.linux-x86_64/egg/starcluster/cluster.py", line
1223, in _setup_cluster
     self.wait_for_cluster()
   File "build/bdist.linux-x86_64/egg/starcluster/cluster.py", line
1041, in wait_for_cluster
     pbar.update(0)
   File "build/bdist.linux-x86_64/egg/starcluster/progressbar.py", line
312, in update
     self.prev_percentage = self.percentage()
   File "build/bdist.linux-x86_64/egg/starcluster/progressbar.py", line
261, in percentage
     return self.currval * 100.0 / self.maxval
ZeroDivisionError: float division

cli.py:173 - ERROR - Oops! Looks like you've found a bug in StarCluster
cli.py:174 - ERROR - Debug file written to: /tmp/starcluster-debug-root.log
cli.py:175 - ERROR - Look for lines starting with PID: 3104
cli.py:177 - ERROR - Please submit this file, minus any private information,
cli.py:178 - ERROR - to starcluster_at_mit.edu


Cheers,

Stuart
Received on Wed Feb 09 2011 - 18:29:05 EST
This archive was generated by hypermail 2.3.0.

Search:

Sort all by:

Date

Month

Thread

Author

Subject