StarCluster - Mailing List Archive

Re: Fixed 'ZeroDivisionError: float division' error when launching a cluster

From: Justin Riley <no email>
Date: Thu, 10 Mar 2011 15:06:30 -0500

-----BEGIN PGP SIGNED MESSAGE-----
Hash: SHA1

Hi Stuart,

Thanks for reporting, this should be fixed in the latest github code.
Please let me know if you run into similar issues in the future.

~Justin

On 02/09/2011 06:28 PM, Stuart Young wrote:
> Hi All,
>
> Fyi, I came across an intermittent error message when launching a
> cluster with the latest versions of the trunk and the load balancer branch:
>
> File "build/bdist.linux-x86_64/egg/starcluster/progressbar.py", line
> 261, in percentage
> return self.currval * 100.0 / self.maxval
> ZeroDivisionError: float division
>
> I fixed it by adding a pause in cluster.py BEFORE looking for running
> instances:
>
> LINE 1039, cluster.py:
>
> # ADDED SLEEP TO AVOID CRASH JUST AFTER FIRST CHECK OF RUNNING
> INSTANCES
> # WHEN INSTANCES ARE NOT YET AVAILABLE
> # >>> Waiting for all nodes to be in a 'running' state...
> time.sleep(interval)
>
>
> FULL ERROR OUTPUT:
>
> starcluster start smallcluster
> StarCluster - (http://web.mit.edu/starcluster) (v. 0.9999)
> Software Tools for Academics and Researchers (STAR)
> Please submit bug reports to starcluster_at_mit.edu
>
> >>> Using default cluster template: smallcluster
> >>> AutoMount.__init__ Running AutoMount plugin.
> >>> Validating cluster template settings...
> >>> Cluster template settings are valid
> >>> Starting cluster...
> >>> Launching a 2-node cluster...
> >>> Launching master (ami: ami-a5c42dcc, type: m1.large)
> >>> Launching node001 (ami: ami-a5c42dcc, type: m1.large)
> >>> Creating security group _at_sc-smallcluster...
> Reservation:r-0664ab6b
> >>> Waiting for cluster to come up... (updating every 30s)
> >>> Waiting for all nodes to be in a 'running' state...
> Traceback (most recent call last):
> File "build/bdist.linux-x86_64/egg/starcluster/cli.py", line 152, in main
> sc.execute(args)
> File "build/bdist.linux-x86_64/egg/starcluster/commands/start.py",
> line 195, in execute
> scluster.start(create=create, create_only=create_only, validate=False)
> File "build/bdist.linux-x86_64/egg/starcluster/cluster.py", line
> 1191, in start
> return self._start(create, create_only)
> File "build/bdist.linux-x86_64/egg/starcluster/utils.py", line 69, in
> wrap_f
> res = func(*arg, **kargs)
> File "build/bdist.linux-x86_64/egg/starcluster/cluster.py", line
> 1209, in _start
> self._setup_cluster()
> File "build/bdist.linux-x86_64/egg/starcluster/cluster.py", line
> 1223, in _setup_cluster
> self.wait_for_cluster()
> File "build/bdist.linux-x86_64/egg/starcluster/cluster.py", line
> 1041, in wait_for_cluster
> pbar.update(0)
> File "build/bdist.linux-x86_64/egg/starcluster/progressbar.py", line
> 312, in update
> self.prev_percentage = self.percentage()
> File "build/bdist.linux-x86_64/egg/starcluster/progressbar.py", line
> 261, in percentage
> return self.currval * 100.0 / self.maxval
> ZeroDivisionError: float division
>
> cli.py:173 - ERROR - Oops! Looks like you've found a bug in StarCluster
> cli.py:174 - ERROR - Debug file written to: /tmp/starcluster-debug-root.log
> cli.py:175 - ERROR - Look for lines starting with PID: 3104
> cli.py:177 - ERROR - Please submit this file, minus any private information,
> cli.py:178 - ERROR - to starcluster_at_mit.edu
>
>
> Cheers,
>
> Stuart
> _______________________________________________
> StarCluster mailing list
> StarCluster_at_mit.edu
> http://mailman.mit.edu/mailman/listinfo/starcluster

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.17 (GNU/Linux)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAk15L0YACgkQ4llAkMfDcrnieACfQ7qOSHBX5oH2Kkr5lta/AaLt
mloAnAuaqPuLiVzaArrR0FPqc0QQGMqe
=BpCR
-----END PGP SIGNATURE-----
Received on Thu Mar 10 2011 - 15:06:46 EST
This archive was generated by hypermail 2.3.0.

Search:

Sort all by:

Date

Month

Thread

Author

Subject