Introduction to Interactive Data Analytics for New Users
On this page:
What is the Interactive Data Analytics Service?
The Interactive Data Analytics Service (IDAS) is provided by Information Technology Services - Research Services (ITS-RS). IDAS supports large-scale and collaborative data analytics using interactive tools such as RStudio for R and JupyterLab for Python, R, and Julia. Users will benefit from high-performance computing resources while performing their interactive data analysis tasks.
New users might find our FAQs helpful.
If you are deciding whether to use IDAS or Argon for your research project, please see Comparing IDAS and Argon.
Find more support articles by expanding the heading “Interactive Data Analytics Service Documentation” in the sidebar to the left of this page.
What are Python, R, and Julia?
Python, R, and Julia are among the most popular programming languages for data analysis. Python and Julia are general-purpose languages, and R is a language for statistical computing and graphics. Python, R, and Julia are open-source projects and free for everyone to use, although some companies provide commercial support and/or extensions for their customers. While Python, R, and Julia share many important features as high-level, open-source programming languages, they each have their own strengths and weaknesses from a data-science perspective.
What is interactive data analytics?
Interactiveness is one of the key features of data analytics. Due to the nature of data analysis, it is challenging to write perfect code on the first try. You write and run some code, see the output, and then based on that output write and run another code, see the output, so on and so forth, until you get the final, satisfactory outcome of your data analysis. That process is called interactive data analytics.
What are Jupyter Notebook, JupyterLab, and JupyterHub?
Python support iPython, which stands for interactive Python, for interactive scripting in a shell environment. The Jupyter Notebook is an open-source web application built on top of iPython. It allows you to create and share documents that contain live code, equations, visualizations and narrative text. Uses of the Jupyter Notebook include data cleaning and transformation, numerical simulation, statistical modeling, data visualization, machine learning, and much more. The Jupyter Notebook supports not only Python but also many of the other commonly-used programming languages such as R, Java, and Julia.
Project Jupyter, the community that builds Jupyter Notebook, released JupyterLab as its latest web-based user interface. Project Jupyter has stated that JupyterLab would eventually replace the original Jupyter Notebook. IDAS currently uses JupyterLab as the default interface (instead of Jupyter Notebook).
JupyterHub is a server version of the Jupyter Notebook for multiple users. It can be used in a class of students, a corporate data science group or scientific research group. It is a multi-user Hub that spawns, manages, and proxies multiple instances of the single-user Jupyter notebook server.
What is RStudio?
RStudio is a free and open-source integrated development environment (IDE) for R. RStudio Desktop is a standalone desktop application that works with the version of R you have installed on your local computer. RStudio Server is a Linux server application for multiple users that provides a web-based interface to the version of R running on the server.
I want to use IDAS. How do I start?
We are truly glad that you want to give IDAS a try.
To start, you will need to request an IDAS account.
After your account is approved, please follow the instructions here to access IDAS for research use or the instructions here to access IDAS for class use.
Find more information about common tasks in IDAS.
If you are curious about our updates, visit our changelog.
Whom do I contact if I have questions?
Please reach out to Cody B Johnson and Giang Rudderham at research-computing@uiowa.edu.