Toronto Data Workshop

A weekly hour-long discussion where academia and industry come together to share best practices for the initial data-centric steps in any applied statistics project. Thursdays at 4:30-5:30pm (Toronto time) via Zoom. All welcome! Sign up here.


The Toronto Data Workshop (TDW) brings together academia and industry to consider, collate, share, and disseminate best practices in the initial data-centric steps of a data science project: collection; cleaning; storage; retrieval; dissemination; and protection. We meet weekly for an hour and aim to have a roughly even split of participants from academia and industry over the course of each term. For an invitation please sign up here. Anyone is welcome to attend - you don’t need to be affiliated with the university.

The current organising committee is: Faria Khandaker, Kelly Lyons, and Rohan Alexander. The TDW is a joint initiative between the Faculty of Information and the Department of Statistical Sciences at the University of Toronto and we especially thank Dean Wendy Duff and Chair Radu Craiu for their support.

Winter 2021

Thanks to Paul Hodgetts for the Jays-inspired sticker.

Date Speaker Topic Recording
Thu 14 Jan, 4:30-5:30pm Andrew Miles, University of Toronto (jointly hosted with the UTM Collaborative Digital Research Space.) Code, plots, and values
Wed 20 Jan, 4:30-5:30pm Zia Babar, University of Toronto Derivative data security.
Thu 28 Jan, 4:30-5:30pm Irene Duah-Kessie, University of Toronto Bias and fairness in health.
Thu 4 Feb, 4:30-5:30pm Kathy Ge, Uber Experimentation and product design.
Thu 11 Feb, 4:30-5:30pm Garrick Aden-Buie, R Studio Using R Markdown.
Mon 15 Feb, Noon-1:00pm Emily Riederer, Capital One Observational causal inference.
Thu 18 Feb, 4:30-5:30pm Annie Collins, Haoluan Chen, Isaac Ehrlich, Mariam Walaa, Marija Pejcinovska, Mathew Wankiewicz, Michael Chong, Paul Hodgetts, Rohan Alexander, Samantha-Jo Caetano, Shirley Deng, and Yena Joo, University of Toronto DoSS toolkit launch.
Thu 25 Feb, 9:00-5:30pm Various Toronto Workshop on Reproducibility See here.
Fri 26 Feb, 9:00-5:30pm Various Toronto Workshop on Reproducibility See here.
Thu 4 Mar, 4:30-5:30pm Petros Pechlivanoglou, The Hospital for Sick Children (SickKids) Research Institute Simulation and retrospective data for health economic decision making.
Thu 11 Mar, 4:30-5:30pm Lucas Cherkewski, Canadian Digital Service Using publicly-available data.
Mon 15 Mar, 4:00-5:00pm Todd Feathers, Freelance reporter Algorithmic fairness in universities. (jointly hosted with Maryclare Griffin)
Thu 18 Mar, 4:30-5:30pm Sofia Ruiz Suarez, National University of Comahue Animal tracking data.
Thu 25 Mar, 4:30-5:30pm Alex Cookson, Muse The power of great datasets.
Thu 1 Apr, 4:30-5:30pm Vik Pant, Natural Resources Canada Integrating science & policy through DS & AI. -
Thu 8 Apr, 4:30-5:30pm Faria Khandaker, University of Toronto ’Mining Process Models from Email Data.
Thu 15 Apr, 4:30-5:30pm Emily A. Sellars, Yale University Data issues in Mexican demographic history.
Thu 22 Apr, 4:30-5:30pm Aimee Schwab-McCoy, Creighton University, Ashley Juavinett, UC San Diego, Chris Papalia, St. Andrew’s College, Samantha-Jo Caetano, University of Toronto Panel on teaching data-focused topics.

Fall 2020

Thanks to Hidaya Ismail for the brilliant maple leaf and dinosaur hex stickers.

Date (Toronto time) Speaker Topic Recording
Thu, 3 Sep, 4-5pm Erik Drysdale (The Hospital for Sick Children) Using hospital data
Tue, 8 Sep, 3:30-4:30pm Sophie Bennett (Industry data scientist) UK A levels algorithm issues (jointly hosted with SRI)
Thu, 10 Sep, 4-5pm A Mahfouz, Diego Mamanche Castellanos, Hidaya Ismail, Ke-Li Chiu & Paul Hodgetts (University of Toronto) Various R packages and research developed by students
Thu, 17 Sep, 4-5pm Amber Simpson (Queen’s University) Cancer and AI
Thu, 24 Sep, 4-5pm Chelsea Parlett-Pelleriti (Chapman University) Talking to non-statisticians about statistics
Thu, 1 Oct, 4-5pm Florence Vallée-Dubois (Université de Montréal) Canadian demographics by riding (1991-2015)
Thu, 8 Oct, 4-5pm Yim Register (University of Washington Data Lab) Self-advocacy within machine learning systems
Thu, 22 Oct, 4-5pm Jeff Waldman, Leanne Trimble, Leslie Barnes, & Lisa Strug (University of Toronto) Panel on data-focused resources at U of T
Thu, 29 Oct, 4-5pm Fei Chiang (McMaster University) Data currency and applications
Thu, 5 Nov, 4-5pm Andrew Whitby (Industry data scientist) Censuses
Mon, 9 Nov, 4-5pm Tom Cardoso (Globe and Mail) Bias Behind Bars
Thu, 12 Nov, 4-5pm Kevin Armstrong (University of Toronto) Measuring poverty for NGOs
Thu, 19 Nov, 4-5pm Michael Chong (University of Toronto) High-throughput Bayesian modelling workflow
Thu, 26 Nov, 4-5pm Postponed
Thu, 3 Dec, 5-6pm Monica Alexander (University of Toronto) Using Facebook advertising data to estimate migration
Thu, 10 Dec, 4-5pm Shabrina Mardevi (United Nations Population Fund & University of Toronto) & Romesh Silva (United Nations Population Fund) Population data estimation
Thu, 17 Dec, 4-5pm Liza Bolton (University of Toronto), Maria Tackett (Duke University), Nathalie Moon (University of Toronto), Teon Brooks (Mozilla Firefox) Panel on teaching data-focused topics

Summer 2020

Winter 2020

Fall 2019

If you would like to receive invitations to the series, then you can subscribe here.