Lisa Strug and CANSSI Ontario: Building a data science community

November 3, 2020 by Chris Sasaki - A&S News

“With the volume of data exploding across all fields, the timing for CANSSI Ontario is fantastic,” says Lisa Strug.

In September 2019, Strug became the director of CANSSI Ontario, the Ontario Regional Centre of the Canadian Statistical Sciences Institute.

CANSSI is a national institute launched in 2012 with the mission to support and enhance research and training in data science. CANSSI Ontario was established as a multidisciplinary unit in July 2019 — with the Faculty of Arts & Science as its academic host — with the goal of building statistical and data science capabilities throughout the province, university and the Faculty.

“At CANSSI Ontario, we're developing programs to support multidisciplinary activity that bring researchers together to create new research relationships and solve problems in specific domains,” says Strug.

“We're creating a data science community and integrating new domains with data science methodology. Together, we’re learning about the role of data science in public policy, in Indigenous data sovereignty, in biomedical science, in the criminal justice system and more.”

In addition to being CANSSI Ontario’s director, Strug is a professor in the Faculty’s Departments of Statistical Sciences and Computer Science. She is also a senior scientist and associate director of the Centre for Applied Genomics at the Hospital for Sick Children.

Her research focuses on the development of statistical methods to identify genetic contributors to complex traits in diseases such as cystic fibrosis and epilepsy and her team is translating genetic discoveries into improved diagnostics and personalized therapies. Earlier this year, she and collaborators began a new research project to identify genetic variation in patients with COVID-19 that might explain why the disease affects people differently.

Arts & Science News spoke with Strug about CANSSI Ontario.

Why in the era of big data is CANSSI Ontario’s emphasis on data-driven research so important?

The sheer volume of data — in the humanities, social sciences, sciences — means that new tools need to be developed to handle and optimize the use of this data, or existing tools used in other domains need to be repurposed. We need algorithmic computational techniques and classical statistical techniques. And we need researchers from different disciplines — with their domain-specific knowledge who know the important problems that need solving — working together. Tools need to be developed that are tailored to the big global challenges of today and individuals need to be trained in how to use these tools.

CANSSI Ontario also promotes an interdisciplinary, multidisciplinary collaborative approach. How are you doing this?

One great success story is the Strategic Training for Advanced Genetic Epidemiology program, or STAGE, a successful multidisciplinary training program that has been in existence since 2009. STAGE trains graduate students, postdocs and early-career faculty who want to re-tool and train in the field where genetics, epidemiology and statistics meet. It offers training in the use and development of quantitative tools for genomic data. The program existed before CANSSI but over the last few years has been unfunded. Now, CANSSI Ontario has adopted it and it’s funded by the Faculty of Arts & Science and is a collaboration with the Dalla Lana School of Public Health.

STAGE provides trainees with multiple mentors from different disciplines — biomedical science, statistics, epidemiology, computer science — who come together as a team to help train individuals in a methods development and/or application project.

In fact, STAGE has been used as a model to implement a new multidisciplinary doctoral program that we've implemented in the Faculty of Arts & Science this past year. This is a pilot program with the Department of Statistical Sciences and integrates with any other unit across the University where a doctoral student in statistics is co-trained with a faculty member in another department or field using the STAGE model. And this program goes beyond statistical genetics and genetic epidemiology; our first cohort of students includes co-mentorship between astrophysics and statistics, as well as biomedical science and statistics. It's very exciting to see this new multidisciplinary program evolve, modeled after the successes of STAGE but now open to all domain areas.

What sort of funding support does CANSSI Ontario provide?

One of our mandates is to remove barriers to accessing large, publicly available datasets that could benefit the research of our affiliated faculty and trainees. Several data sources have costs associated with accessing or computing with them and in some cases those costs have become prohibitive. So we started a data-access grant program that removes that barrier by offering to cover those costs.

Funding support also comes in the form of awards. For example, in partnership with the Banting Research Foundation, we have the Banting-CANSSI Ontario Discovery Award in Data Science. It’s a one-year grant of up to $25,000 and it’s designed to encourage new faculty to be engaged in multidisciplinary, data-driven health and biomedical research.

And we’re very proud that in 2020, it was awarded to Dylan Kobsar from McMaster University to develop computational methodologies to analyze human movement which will in turn have applications in wearable sensors for managing osteoarthritis.

How do you feel about your first year as director of CANSSI Ontario?

I've really enjoyed the role. For the past 15 years, I’ve been focused on data science in genomics but now I have the opportunity to learn about the data-driven research in other disciplines across the Faculty and I’m learning a ton.

I'm enjoying speaking to different communities and understanding what their needs are, so we can then work to develop programs and activities to support them and facilitate the work they are doing to solve some of society’s most challenging problems.

The timing for CANSSI Ontario is right, and the future is very bright. We need this initiative to bring people together to share ideas and to share know-how.