Hi Christopher,
Do you have a lot of I/O? For example writing and reading many files to the
same NFS location?
This may explain things.
Jacob
On Jul 30, 2015 2:34 AM, "Christopher Clearfield" <
chris.clearfield_at_system-logic.com> wrote:
> Hi All,
> I'm running a set of about 60K relatively short jobs that take 30 minutes
> to run. This is through ipython parallel.
>
> Yet my CPU utilization levels are relatively small:
>
> queuename qtype resv/used/tot. load_avg arch states
> ---------------------------------------------------------------------------------
> all.q_at_master BIP 0/0/2 0.98 linux-x64
> ---------------------------------------------------------------------------------
> all.q_at_node001 BIP 0/0/8 8.01 linux-x64
> ---------------------------------------------------------------------------------
> all.q_at_node002 BIP 0/0/8 8.07 linux-x64
> ---------------------------------------------------------------------------------
> all.q_at_node003 BIP 0/0/8 7.96 linux-x64
>
> (I disabled the ipython engines on master because I was having heartbeat
> timeout issues with the worker engines on my nodes, which explains why that
> is so low).
>
> But ~8% utilization on the nodes. Is that expected?
>
> Thanks,
> Chris
>
>
> _______________________________________________
> StarCluster mailing list
> StarCluster_at_mit.edu
> http://mailman.mit.edu/mailman/listinfo/starcluster
>
>
Received on Thu Jul 30 2015 - 13:52:51 EDT
This archive was generated by
hypermail 2.3.0.