load balanced nodes accepting jobs before ready
When loadbalancer adds nodes, it’s making them available to the scheduler before they’re fully provisioned. I’m using pkginstaller plugin to install required libraries across the cluster, but if a job hits a newly added node before pkginstaller has finished, those jobs then fail because the library was not yet installed.
So, I need to either
1. force loadbalancer to wait until all provisioning is complete before readying the node for job scheduling
2. Bypass pkginstaller altogether by making the master node share its libraries with the rest of the cluster over nfs
Any suggestions?
--
Andrew Stewart
Office of Research Information Services (ORIS),
Office of the Chief Information Officer (OCIO),
Smithsonian Institution
202-505-3633
Received on Sat Apr 12 2014 - 20:28:18 EDT
This archive was generated by
hypermail 2.3.0.