Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

Table of Contents

...

If your job does not use the system openmpi, or does not use MPI, then any desired core binding will need to be set up with whatever mechanism the software uses. Otherwise, there will be no core binding. Again, that may not be a major issue. If your job does not work well with HT then run on a number of cores equal to half of the number of slots requested and the OS scheduler will minimize contention. 

new SGE utilities

While SoGE is very similar to previous versions of SGE there are some new utilities that people may find of interest. There are manual pages for each of these.

...


Note that requesting one of the *-GPU queues will automatically set ngpus=1 if that resource is not otherwise set. However, you will have to know what types of GPUs are in those queues if you need a specific type. Investor queues that have a mix of GPU and non-GPU nodes, ie., without the -GPU suffix will need to make a request for a GPU explicit. Since ngpus is a consumable resource, once the resource, the GPU device, is in use, then it is not available for other jobs on that node until it is freed up. If you wish to run non-GPU jobs on the node in tandem with a GPU job then specify ngpus=0 for false for the non-GPU job(s). 

...


In addition to the ngpus resource there some other non-Boolean resources for GPU nodes that could be useful to you. With the exception of requesting free memory on a GPU device these are informational.

ResorceResourceDescriptionRequestable
gpu.ncuda

number of CUDA GPUs on the host

NO
gpu.nopencl

number of OpenCL GPUs on the host

NO
gpu.ndev

total number of GPUs on the host

NO
gpu.cuda.N.mem_free

free memory on CUDA GPU N

YES
gpu.cuda.N.procs

number of processes on CUDA GPU N

NO
gpu.cuda.N.clock

maximum clock speed of CUDA GPU N (in MHz)

NO

gpu.cuda.N.util

compute utilization of CUDA GPU N (in %)

NO
gpu.opencl.N.clock

maximum clock speed of OpenCL GPU N (in MHz)

NO
gpu.opencl.N.mem

global memory of OpenCL GPU N

NO
gpu.names

semi-colon-separated list of GPU model names

NO

...