Once you have created your XSEDE account, have set up DUO authentication, and have verified that you can connect to the SSO (single sign on hub https://portal.xsede.org/single-sign-on-hub)
Then follow the instructions to start the module
https://portal.xsede.org/psc-bridges
With the augmented instructions that follow. These are the steps your students most likely will follow as well. Stay tuned for official instructions. Let me know if you do not get an account set up and
active by Monday, and we will set up another time.
To start a hadoop session with a persistent hdfs requires three simple
steps:
# XSEDE's Single Sign On (SSO) login hub, execute the following instructions to generate your personal virtual three node Hadoop cluster.
To start a Hadoop session with a persistent hdfs:
1. start an interactive job with at least 3 nodes
interact -N 3
# 2. load the hadoop module when the job starts
module load hadoop
...
IMPORTANT: do NOT run start-hadoop.sh as the module suggests.
# 3. instead, run this:
/opt/packages/hadoop-testing/persist-start.sh
# end commands4. Finally, load up the Hive module
source /opt/packages/hive/hive.sh
5. Run Hive
hive
For more information on Bridges see https://portal.xsede.org/psc-bridges.