StarCluster - Mailing List Archive

Bug.. need urgent workaround

From: Sergio Mafra <no email>
Date: Wed, 10 Jul 2013 09:58:56 -0300

Dear Friends,

I´m trying to setup a new cluster and StarCluster hangs on this point:

It´s a fresh creation of a 2 cr1.8xlarge instances using spot instances.
I´m using the latest version of SC 0.999

----
ERROR - Error occured while running plugin
'starcluster.clustersetup.DefaultClusterSetup':
!!! ERROR - error occurred in job (id=master): remote command 'source
/etc/profile && groupadd -o -g 1000 sgeadmin' failed with status 9:
groupadd: group sgeadmin exists
Traceback (most recent call last):
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/threadpool.py",
line 31, in run
    job.run()
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/threadpool.py",
line 58, in run
    r = self.method(*self.args, **self.kwargs)
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/clustersetup.py",
line 193, in _add_user_to_node
    node.add_user(self._user, uid, gid, self._user_shell)
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/node.py",
line 422, in add_user
    self.ssh.execute('groupadd -o -g %s %s' % (gid, name))
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/sshutils/__init__.py",
line 538, in execute
    msg, command, exit_status, out_str)
RemoteCommandFailed: remote command 'source /etc/profile && groupadd -o -g
1000 sgeadmin' failed with status 9:
groupadd: group sgeadmin exists
error occurred in job (id=node001): remote command 'source /etc/profile &&
groupadd -o -g 1000 sgeadmin' failed with status 9:
groupadd: group sgeadmin exists
Traceback (most recent call last):
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/threadpool.py",
line 31, in run
    job.run()
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/threadpool.py",
line 58, in run
    r = self.method(*self.args, **self.kwargs)
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/clustersetup.py",
line 193, in _add_user_to_node
    node.add_user(self._user, uid, gid, self._user_shell)
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/node.py",
line 422, in add_user
    self.ssh.execute('groupadd -o -g %s %s' % (gid, name))
  File
"/usr/local/lib/python2.7/dist-packages/StarCluster-0.9999-py2.7.egg/starcluster/sshutils/__init__.py",
line 538, in execute
    msg, command, exit_status, out_str)
RemoteCommandFailed: remote command 'source /etc/profile && groupadd -o -g
1000 sgeadmin' failed with status 9:
groupadd: group sgeadmin exists
!!! ERROR - Oops! Looks like you've found a bug in StarCluster
!!! ERROR - Crash report written to:
/home/ubuntu/.starcluster/logs/crash-report-8819.txt
!!! ERROR - Please remove any sensitive data from the crash report
!!! ERROR - and submit it to starcluster_at_mit.edu
All the best,
Sergio
Received on Wed Jul 10 2013 - 08:58:59 EDT
This archive was generated by hypermail 2.3.0.

Search:

Sort all by:

Date

Month

Thread

Author

Subject