Education, Training, and Outreach

SciNet is the high performance computing centre at the University of Toronto, established in 2009 as a consortium of the University of Toronto and its affiliated research hospitals. Its mission is to provide computational resources, specialized support and training to any Canadian academic researcher.

The full power of high performance computing systems can best be exploited by people with specialized knowledge. The education and training of such people is absolutely critical, especially since the methodology in many disciplines has evolved to include a large computational component. SciNet has developed an education and training program for the wider scientific community aimed at helping students and users obtain the skills and knowledge required to get the most out of advanced research computing resources.

SciNet’s education and training program

SciNet’s education program started with the traditional “Intro to SciNet” sessions and yearly intensive parallel programming workshops. As our user base has grown to encompass fields relatively new to HPC, such as medical science, biology, forestry, and economics, the program has grown to include topics in data science such as introductory scientific computing in Python, R, machine learning, and work-flow design, while still including advanced research computing and high performance computing.

The skills that SciNet aims to transfer are rare and sought-after, they complement and enhance the skills students learn in regular curricula. Users and students can get a certificate in Scientific Computing, Data Science, or High Performance Computing once they have completed enough SciNet credit-hours. As a document that proves the holder has highly competitive skills, the certificates are in high demand. From the start of the program in 2013 until February 2017, a total of 80 SciNet certificates were issued.

scinet_stats_studenthours_growth_2

The growth of SciNet’s education program is illustrated by the following chart which counts the total number of attendance (number of attendees times duration in hours) of all education and training events given by SciNet.

This graph also highlights the growth in popularity of our data science courses.

SciNet courses tie into university graduate programs

By partnering with other departments in the University, an increasing number of our training courses have been taken for credit toward graduate degrees at the University of Toronto. Our current partners include the Departments of Physics, Astrophysics, Chemistry, and Ecology and Evolutionary Biology, as well as the Institute for Medical Sciences. Indicative of the success of our “partnered” courses, the full term physics graduate course “Scientific Computing for Physicists” had an enrollment of nearly 50 in the Winter of 2016, and attracted students from many different departments such as physics, astrophysics, engineering and math. The modular course Data Analysis with R, given in partnership with IMS and EEB, which started in the Fall of 2016, has over 100 registered students. In the 2017 Winter term, in addition to a repeat of the Physics graduate course, we are teaching parts of the 4th year Physics undergraduate Research Project course, and are delivering a new graduate course “Quantitative Applications for Data Analysis” in partnership with the Biological Sciences group at University of Toronto Scarborough.

The diversity of academic backgrounds of the students taking our courses can be seem in the following charts, broken down by faculty within the University of Toronto.

faculty_studenthours_distribution_scinet_teaching


SciNet courses in 2016

For-Credit Courses

Shorter Training Sessions
Scientific Computing for Physicists (40) PHY1610H

Intro to SciNet (80) 8x
Advanced Parallel Scientific Computing (5) PHY2109/AST3100

Intro to the Linux Shell (30) 3x

Data Analysis with R (105) MCS1010/PHY2109/AST3100/EEB

Research Data Management (14)

Research Computing with Python (54)

Intro to GPGPU Programming with CUDA (11)

Debugging (20)

Workshops (Full-day or longer)

Intro to SciNet and HPC (42)

Storage and I/O in Large Scale Scientific Projects (25)

Data Analysis with R (44)

Relational Database Basics (13)

Parallel R (45)

Programming Clusters with MPI (42)

Scientific Programming with Python (49)

Programming GPUs with CUDA (37)

Python for High Performance Computing (44)

Programming Shared Memory with OpenMP (36)

Scientific Visualization

New For-Credit Courses in 2017

Intro to Machine Learning

Quantitative Applications for Data Analysis EES1137H

Modules in the “Research Project Courses” PHY479Y

The numbers in parentheses reflect total registered attendees.
SciNet’s education site contains courses and course materials.

Collaborations in HPC education

Together with our partner consortia, SHARCNET and CAC, SciNet is involved in the annual Ontario Summer Schools in High Performance Computing. These schools provide attendees with opportunities to learn and share knowledge and experience in high performance and technical computing. Each of the three consortia organizes one week of summer school. In 2014, the number of unique attendees to the Toronto-based summer school was over 65. In addition, SciNet is involved in the tutorials in the annual Canadian HPC symposium, HPCS.
SciNet is also the local organizer of the 2015 International HPC summer school, to be held at the University of Toronto. This is a collaboration between Compute Canada (of which SciNet is a partner) and its US, European and Japanese counterparts. The demand from Canadian students was seven times larger than the number of available spots; further evidence for the demand for HPC education.

Assembling the "Goliath" Cluster

Some of the high school students here are cabling up what will be the “Goliath” cluster, a cluster of 3 old Pentium-4 desktops with 100Mb ethernet.

Community Outreach

SciNet’s education program extends beyond the university. Our outreach efforts include:

  1. Data centre tours.
  2. Bringing HPC experience to high schools (building mini-clusters, parallel programming,  …).
  3. The Teach the Teachers project, currently involving 12 teachers from 6 schools.
  4. The Big Data Challenge for High School Students (co-organizer and jury members).
  5. Participation in Science Rendezvous, an annual festival that takes science onto the  street.

Check our YouTube video presentation!