...
Compute clusters can be comprised of anywhere from several to thousands of compute nodes, a few "login" or "staging" nodes, and one or more head nodes, which coordinate the scheduling, distribution and data sharing of jobs.
What Is A Scheduler?
In order to fairly distribute a cluster's resources amongst a group of users, a software tool known as a scheduler is used. Cluster resources are divided into environments and resources. For example, users may choose between different available parallel environments (known as a PE) or special resources such as a co-processor, GPU, or extra memory. PEs add in extra functionality for parallel processing which manages inter-node process communication. Different types of parallel jobs may require a certain configuration of parallel environment, such as with the use of OpenMPI.
Schedulers generally work with a queueing system (Queues can be organized in various ways, but generally are organized by resource or by user access), which distributes jobs to the cluster as resources become available. Schedulers Schedulers monitor user job submissions, then check available resources to determine which jobs can be run at a given time.
Other factors are also taken into consideration, depending on how the scheduler may be configured. For example, schedulers can prioritize submissions based on how frequently a particular user may be using the system, and will put a slightly higher priority on jobs from users who do not use the system as frequently. This ensures that the system will not be dominated by only a few very active users.
University of Iowa Cluster Systems
The University of Iowa currently has a shared HPC cluster available for campus researchers to use. The shared system is run primarily by ITS-Research Services. The cluster is capable of running both High Performance jobs and High Throughput jobs. The system is comprised of several hundred compute nodes with several thousands of processor cores.
What are the differences between High Throughput Computing and High Performance Computing?
High Performance computing enables a user to solve a single, large problem by harnessing a large number of processors and memory across multiple compute nodes. These types of problems are typically broken down into pieces and processed in parallel, with different compute nodes working on a different part of the problem. Each node communicates with the other nodes working on the problem via a high-speed interconnect -- in our case, we use OmniPath. Parallel processing typically requires code modification in order to utilize a library such as MPI which in turn facilitates parallel communication between the nodes working to solve the problem. Examples of problems that use High Performance Computing are Computational Fluid Dynamics and Molecular Dynamics. More information on using MPI on our HPC systems is available here: MPI Implementation.
High Throughput computing allows a user to use multiple compute nodes in a coordinated fashion to solve a high number of individual problems. The jobs that make up this sort of computation typically do not communicate with each other. This provides the ability to analyze many data sets simultaneously, and also allows the user to efficiently perform a parameter sweep, which refers to running the same program multiple times, but with varying inputs.
...
I just received my HPC account, what are my next steps?
Getting started with Linux
Our HPC cluster runs CentOS Linux, the version being current at the time of system deployment. In order to make use of the cluster, users will need a basic understanding of how to interact with a Linux system at the command line. At a minimum, you will need to know how to move around the system, copy and edit files. There are many resources on the Internet devoted to helping you learn your way around a Linux system. One of the best resources available is a book called The Linux Command Line, which is available as a free PDF download. For a quicker overview of basic Linux commands, here is a good Linux Cheat Sheet: the-linux-command-line.pdf.
Mapping your work to one of the clusters
If your compute problem is not tractable on a desktop or lab workstation, uses a large amount of memory, requires a rapid turnaround of results, would benefit from being scheduled, then an HPC cluster may be a good fit for you. The next steps are to determine if your computation runs on Linux, can be run in batch mode (non-interactively) and whether it is a high performance (parallel) or high throughput (serial) job. Determining the answers to these questions will help decide how to go about requesting and utilizing HPC resources. Some additional questions to consider are:
Will you need to recompile your code to run on our cluster?
If you are bringing code over from another system, you may need to recompile it to work on our systems, especially if you are using MPI (of which we offer a few different varieties). We have some additional notes on compiling here: Compiling SoftwareWhat software will your job need, and is it available centrally, or could it be installed in your home directory?
Our list of installed software is here: Software Installations. If you don't see a package you need, please let us know, and if it is broadly applicable to a number of users, we my install it centrally, or we will help install it into your home directory.Can you estimate how much memory your job will need?
Knowing approximately how many processes you will need or how much memory to request will help ensure you request enough resources to get your job to complete. One way to discover this is to run a small version of the job to see how much memory it uses and then calculate how much it would use if you were to double or triple it in size. We also offer a small development queue on the HPC cluster that you may submit small jobs to to see how things go, and then tweak your resource requests accordingly.
Getting your data into the cluster
If your data is not large, the quickest way to get your data onto one of the clusters is to use scp, rsync or sftp from the command line or via an application such as Fetch (Mac based) or IPSwitch (windows based). If you have larger data sets (larger meaning several Gigabytes or more), then you can utilize our Globus Online connection.
Storage Options
HPC accounts have a 1TB quota, but there are times when more storage, or a group share might be required for your work. ITS Research Services has made several options available in an attempt to meet these needs.
...
Once your data is uploaded to the cluster, you are ready to work on getting a job submitted to the cluster. If you are going to use one of our centrally installed software packages, you'll need to load the module for it into your environment. More in-depth information on this is on our Environment Modules page. Basically, you'd use the command
No Formatcode |
---|
module avail |
To list the modules available to choose from. Then you'd use
No Formatcode |
---|
module load <module-name> |
To load that module into your environment. Note that some modules are not compatible, and will not load together. The environment module system will automatically load the correct dependent modules. You may also use the
No Formatcode |
---|
module show <module-name> |
...
Our cluster uses the SGE scheduler to match job submissions with available resources. There is extensive documentation on using SGE and all the options available. We offer pages on Basic Job Submission and Advanced Job Submission for the cluster. Launching jobs is done via qsub with options on the command line or via special commands in the job script which are then passed to the scheduler for controlling your job. A qsub script can be very simple, consisting of a few commands, or very complex, depending on your needs.
Note that if you use qsub options in the job script, then any additional options you pass to qsub on the command line when you launch the script will override those same settings inside the script. For example, if your script specifies the UI queue with #$ -q UI
, but you would like to do a submission to the development queue for prototyping, you can override the UI queue on the command line with qsub:
No Formatcode |
---|
qsub -q UI-DEVELOP <myscript.sh> |
...
Once you have launched your job using qsub, you will want to be able to monitor its progress on the cluster. There are a couple of simple ways to do this. First, you need to find out what the "jobid" is of your job. A jobid is a unique number assigned by SGE to each job. To get this information, you can use the qstat command like so:
No Formatcode |
---|
qstat -u <username> |
which will produce the following output:
No Formatcode | ||
---|---|---|
| ||
qstat -u aarenas job-ID prior name user state submit/start at queue slots ja-task-ID ----------------------------------------------------------------------------------------------------------------- 44348 1.00692 BC80_9 aarenas r 01/10/2019 11:21:50 IWA@argon-lc-h21-16.hpc 56 |
...
Code Block | ||
---|---|---|
| ||
$ qstat -j 8853637 ============================================================== job_number: 8853637 exec_file: job_scripts/8853637 submission_time: Tue Jan 6 09:57:10 2015 owner: naomi uid: 1205679 group: its-rs-neon gid: 899998927 sge_o_home: /Users/naomi sge_o_log_name: naomi sge_o_path: <path information here> sge_o_shell: /bin/bash sge_o_workdir: /Users/naomi/jobs/espresso sge_o_host: neon-login-0-1 account: sge cwd: /Users/naomi/jobs/espresso merge: y mail_options: abes mail_list: naomi-hospodarsky@uiowa.edu notify: FALSE job_name: QE-CO2-Test-time jobshare: 0 hard_queue_list: sandbox shell_list: NONE:/bin/bash env_list: <environment information here> script_file: espresso-test.sh parallel environment: 16cpn range: 16 usage 1: cpu=00:01:57, mem=34.41240 GBs, io=0.18064, vmem=N/A, maxvmem=5.972G scheduling info: (Collecting of scheduler job information is turned off) |
The above information gives an overview of how your job looks to the scheduler. You can see job submission & start times, queue requests, slot requests, and the environment loaded at the time of job submission. One of the most useful lines in this output, however, is the "usage" line. This line will show you peak resource usage of your job. Pay special attention to "maxvmem" as this is the peak memory used by your job up to that point; you can use this information to help determine if you have requested enough resources for your job to operate with.
Two additional commands which may be useful are "qdel" for deleting jobs:
Code Block | ||
---|---|---|
| ||
$ qdel -j <Jobid> # deletes jobs by jobid $ qdel -u <username> # deletes all jobs owned by user |
and qacct for gathering info about a completed job:
Code Block | ||
---|---|---|
| ||
$ qacct -o <username> -j -d <days to report> # shows accounting records for each of the user's job for the last x number of days $ qacct -j <jobid> # shows accounting record for a specific job |
The accounting logs are rolled over and archived to prevent them from getting too large. They can be found in the /opt/ohpc/admin/sge
directory. They are compressed but can be decompressed on the fly and fed into qacct.
No Formatcode |
---|
zcat /opt/ohpc/admin/sge/accounting-20201201.gz | qacct -f - |
Conclusion
This was a high-level introduction to HPC computing, and there are many topics not covered by this wiki page. Our other wiki pages offer more detail on various aspects of the system and how to use it. We We also offer consulting services, so if you have questions about using our resources, HPC in general, or would simply like additional assistance getting started, please do not hesitate to contact our staff: research-computing@uiowa.edu, and one of us will be happy to help you.