I find dstat to be very useful when trying to isolate slow downs. It's like
an enhanced version of top with many more useful stats.
http://linux.die.net/man/1/dstat
Combines vmstat, iostat, ifstat, netstat information and more
Shows stats in exactly the same timeframe
Enable/order counters as they make most sense during
analysis/troubleshooting
On Jul 30, 2015 1:53 PM, "Jacob Barhak" <jacob.barhak_at_gmail.com> wrote:
> Hi Christopher,
>
> Do you have a lot of I/O? For example writing and reading many files to
> the same NFS location?
>
> This may explain things.
>
> Jacob
> On Jul 30, 2015 2:34 AM, "Christopher Clearfield" <
> chris.clearfield_at_system-logic.com> wrote:
>
>> Hi All,
>> I'm running a set of about 60K relatively short jobs that take 30 minutes
>> to run. This is through ipython parallel.
>>
>> Yet my CPU utilization levels are relatively small:
>>
>> queuename qtype resv/used/tot. load_avg arch states
>> ---------------------------------------------------------------------------------
>> all.q_at_master BIP 0/0/2 0.98 linux-x64
>> ---------------------------------------------------------------------------------
>> all.q_at_node001 BIP 0/0/8 8.01 linux-x64
>> ---------------------------------------------------------------------------------
>> all.q_at_node002 BIP 0/0/8 8.07 linux-x64
>> ---------------------------------------------------------------------------------
>> all.q_at_node003 BIP 0/0/8 7.96 linux-x64
>>
>> (I disabled the ipython engines on master because I was having heartbeat
>> timeout issues with the worker engines on my nodes, which explains why that
>> is so low).
>>
>> But ~8% utilization on the nodes. Is that expected?
>>
>> Thanks,
>> Chris
>>
>>
>> _______________________________________________
>> StarCluster mailing list
>> StarCluster_at_mit.edu
>> http://mailman.mit.edu/mailman/listinfo/starcluster
>>
>>
> _______________________________________________
> StarCluster mailing list
> StarCluster_at_mit.edu
> http://mailman.mit.edu/mailman/listinfo/starcluster
>
>
Received on Thu Jul 30 2015 - 17:26:06 EDT
This archive was generated by
hypermail 2.3.0.