Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

...

If your job does not use the system openmpi, or does not use MPI, then any desired core binding will need to be set up with whatever mechanism the software uses. Otherwise, there will be no core binding. Again, that may not be a major issue. If your job does not work well with HT then run on a number of cores equal to half of the number of slots requested and the OS scheduler will minimize contention. 

new SGE utilities

While SoGE is very similar to previous versions of SGE there are some new utilities that people may find of interest. There are manual pages for each of these.

...

In addition to the above, the HPC systems have some nodes that are not part of any investor queue. These are in the /wiki/spaces/hpcdocs/pages/76513448 and are used for node rentals and future purchases. The number of nodes for this purpose varies.

Resource requests

The Argon cluster is a heterogeneous cluster, meaning that it consists of different node types with varying amounts and types of resources. There are many resources that SGE keeps track of and most of them can be used in job submissions. However, the resource designations for machines based on CPU type, memory amount, and GPU are more likely to be used in practice. For the most part, machines with different amounts of memory and GPU capability are segregated by queuesThe investor queues will have a more limited variability in machine types. However, the all.q queue contains all machines and when running jobs in that queue it may be desirable to request specific machine types. The following table lists these outthe options for the more common resource requests. They would be selected with the '-l resource' flag to qsub. These are all Booleans.

Full Resource NameShortcut Resource NameNotes
std_mem
deprecated
sm
deprecated
use mem_128G
mid_mem
deprecated
mm
deprecated
use mem_256G
high_mem
deprecated

hm
deprecated

use mem_512G
mem_96G96G
mem_128G128G
mem_192G192G
mem_256G256G
mem_512G512G
cpu_archcpu_arch
  • broadwell
  • skylake_avx512
datacenterdc
  • ITF
  • LC
fabricfabric
  • infinipathnone
  • omnipath
gpugpu
gpu_k80k80
gpu_p100p100
gpu_p40

p40


gpu_titanvtitanv
gpu_1080ti1080ti

...