Table of Contents |
---|
...
If your job does not use the system openmpi, or does not use MPI, then any desired core binding will need to be set up with whatever mechanism the software uses. Otherwise, there will be no core binding. Again, that may not be a major issue. If your job does not work well with HT then run on a number of cores equal to half of the number of slots requested and the OS scheduler will minimize contention.
new SGE utilities
While SoGE is very similar to previous versions of SGE there are some new utilities that people may find of interest. There are manual pages for each of these.
...
In addition to the above, the HPC systems have some nodes that are not part of any investor queue. These are in the /wiki/spaces/hpcdocs/pages/76513448 and are used for node rentals and future purchases. The number of nodes for this purpose varies.
Resource requests
The Argon cluster is a heterogeneous cluster, meaning that it consists of different node types with varying amounts and types of resources. There are many resources that SGE keeps track of and most of them can be used in job submissions. However, the resource designations for machines based on CPU type, memory amount, and GPU are more likely to be used in practice. For the most part, machines with different amounts of memory and GPU capability are segregated by queuesThe investor queues will have a more limited variability in machine types. However, the all.q queue contains all machines and when running jobs in that queue it may be desirable to request specific machine types. The following table lists these outthe options for the more common resource requests. They would be selected with the '-l
resource' flag to qsub. These are all Booleans.
Full Resource Name | Shortcut Resource Name | Notes |
---|---|---|
std_mem deprecated | sm deprecated | use mem_128G |
mid_mem deprecated | mm deprecated | use mem_256G |
high_mem deprecated | hm | use mem_512G |
mem_96G | 96G | |
mem_128G | 128G | |
mem_192G | 192G | |
mem_256G | 256G | |
mem_512G | 512G | |
cpu_arch | cpu_arch |
|
datacenter | dc |
|
fabric | fabric |
|
gpu | gpu | |
gpu_k80 | k80 | |
gpu_p100 | p100 | |
gpu_p40 | p40 | |
gpu_titanv | titanv | |
gpu_1080ti | 1080ti |
...