StarCluster - Mailing List Archive

problem adding nodes. Node003 doesnt exist so I crash

From: Ramon Ramirez-Linan <no email>
Date: Fri, 14 Nov 2014 10:41:02 -0500

I was trying to add 97 nodes to y 3 nodes cluster.

I try last evening without success because AWS didnt have enought EC2 of
this type on my region (c3.4xlarge US-EAST)

I tried this morning and it looks like it was adding them, but for some
reason some nodes did not get added and StarCluster crashed since it
couldnt find those nodes to configure them

This is the error

ec2-user_at_ip-172-31-1-249 ~]$ time starcluster addnode -n 97 aes-300
/usr/lib64/python2.6/site-packages/Crypto/Util/number.py:57:
PowmInsecureWarning: Not using mpz_powm_sec. You should rebuild using
libgmp >= 5 to avoid timing attack vulnerability.
  _warn("Not using mpz_powm_sec. You should rebuild using libgmp >= 5 to
avoid timing attack vulnerability.", PowmInsecureWarning)
StarCluster - (http://star.mit.edu/cluster) (v. 0.95.5)
Software Tools for Academics and Researchers (STAR)
Please submit bug reports to starcluster_at_mit.edu

>>> Launching node(s): node003, node004, node005, node006, node007,
node008, node009, node010, node011, node012, node013, node014, node015,
node016, node017, node018, node019, node020, node021, node022, node023,
node024, node025, node026, node027, node028, node029, node030, node031,
node032, node033, node034, node035, node036, node037, node038, node039,
node040, node041, node042, node043, node044, node045, node046, node047,
node048, node049, node050, node051, node052, node053, node054, node055,
node056, node057, node058, node059, node060, node061, node062, node063,
node064, node065, node066, node067, node068, node069, node070, node071,
node072, node073, node074, node075, node076, node077, node078, node079,
node080, node081, node082, node083, node084, node085, node086, node087,
node088, node089, node090, node091, node092, node093, node094, node095,
node096, node097, node098, node099
Reservation:r-2fe2d905
>>> Waiting for instances to propagate...
97/97 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
100%
>>> Waiting for node(s) to come up... (updating every 30s)
>>> Waiting for all nodes to be in a 'running' state...
38/38 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
100%
>>> Waiting for SSH to come up on all nodes...
38/38 ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
100%
>>> Waiting for cluster to come up took 1.882 mins
!!! ERROR - node 'node003' does not exist

real 2m4.162s
user 0m4.868s
sys 0m0.332s
Received on Fri Nov 14 2014 - 10:41:04 EST
This archive was generated by hypermail 2.3.0.

Search:

Sort all by:

Date

Month

Thread

Author

Subject