A weekly hour-long discussion where academia and industry come together to share best practices for the initial data-centric steps in any applied statistics project. Thursdays at 4:30-5:30pm (Toronto time) via Zoom. All welcome! Sign up here.
The Toronto Data Workshop (TDW) brings together academia and industry to consider, collate, share, and disseminate best practices in the initial data-centric steps of a data science project: collection; cleaning; storage; retrieval; dissemination; and protection. We meet weekly for an hour and aim to have a roughly even split of participants from academia and industry over the course of each term. For an invitation please sign up here. Anyone is welcome to attend - you don’t need to be affiliated with the university.
The current organising committee is: Faria Khandaker, Kelly Lyons, and Rohan Alexander. The TDW is a joint initiative between the Faculty of Information and the Department of Statistical Sciences at the University of Toronto and we especially thank Dean Wendy Duff and Chair Radu Craiu for their support.
Thanks to Paul Hodgetts for the Jays-inspired sticker.
Date | Speaker | Topic | Recording |
---|---|---|---|
Thu 14 Jan, 4:30-5:30pm | Andrew Miles, University of Toronto (jointly hosted with the UTM Collaborative Digital Research Space.) | Code, plots, and values | https://youtu.be/mdjOoKT-f7E |
Wed 20 Jan, 4:30-5:30pm | Zia Babar, University of Toronto | Derivative data security. | https://youtu.be/fdVZqvECXXQ |
Thu 28 Jan, 4:30-5:30pm | Irene Duah-Kessie, University of Toronto | Bias and fairness in health. | |
Thu 4 Feb, 4:30-5:30pm | Kathy Ge, Uber | Experimentation and product design. | |
Thu 11 Feb, 4:30-5:30pm | Garrick Aden-Buie, R Studio | TBA | |
Thu 18 Feb, 4:30-5:30pm | Annie Collins, Asel Kushkeyeva, Haoluan Chen, Isaac Ehrlich, Mariam Walaa, Marija Pejcinovska, Mathew Wankiewicz, Michael Chong, Paul Hodgetts, Rohan Alexander, Samantha-Jo Caetano, Shirley Deng, and Yena Joo, University of Toronto | DoSS toolkit launch. | |
Thu 25 Feb, 9:00-5:30pm | Danielle Smalls-Perkins, Google, Fiona Fidler, University of Melbourne, Garret Christensen, US FDIC, Jake Bowers, University of Illinois & The Policy Lab, Lauren Kennedy, Monash University, Mauricio Vargas, Catholic University of Chile & Nicolas Didier, Arizona State University, Mine Çetinkaya-Rundel, University of Edinburgh, Radu Craiu, University of Toronto, Riana Minocher, Max Planck Institute for Evolutionary Anthropology, Shiro Kuriwaki, Harvard University, Tiffany Timbers, University of British Columbia, Tom Barton, Royal Holloway, University of London, Tyler Girard, University of Western Ontario, Wendy Duff, University of Toronto, Wijdan Tariq, University of Toronto, Yanbo Tang, University of Toronto | Toronto Data Workshop on Reproducibility | |
Fri 26 Feb, 9:00-5:30pm | Amber Simpson, Queens University, Andrés Cruz, Pontificia Universidad Católica de Chile, Annie Collins, University of Toronto, Emily Riederer, Capital One, Eva Vivalt, University of Toronto, Florencia D’Andrea, National Institute of Agricultural Technology, Heidi Seibold, Helmholtz AI Cooperation Unit, John Blischak, Freelance scientific software developer, John McLevey, University of Waterloo, Julia Schulte-Cloos, LMU Munich, Monica Alexander, University of Toronto, Nancy Reid, University of Toronto, Nick Radcliffe, University of Edinburgh, Ryan Briggs, University of Guelph, Sharla Gelfand, Freelance R Developer, Shemra Rizzo, Genentech, Simeon Carstens, Tweag/IO | Toronto Data Workshop on Reproducibility | |
Thu 4 Mar, 4:30-5:30pm | Petros Pechlivanoglou, The Hospital for Sick Children (SickKids) Research Institute | Simulation and retrospective data for health economic decision making. | |
Thu 11 Mar, 4:30-5:30pm | Lucas Cherkewski, Canadian Digital Service | Using publicly-available data. | |
Thu 18 Mar, 4:30-5:30pm | Sofia Ruiz Suarez, National University of Comahue | Animal tracking data. | |
Thu 25 Mar, 4:30-5:30pm | Alex Cookson, Royal Canadian Mint | The power of great datasets. | |
Thu 1 Apr, 4:30-5:30pm | Vik Pant, Natural Resources Canada | Integrating science & policy through DS & AI. | |
Thu 8 Apr, 4:30-5:30pm | Faria Khandaker, University of Toronto | Object oriented process mining. | |
Thu 15 Apr, 4:30-5:30pm | Emily A. Sellars, Yale University | Data issues in Mexican demographic history. | |
Thu 22 Apr, 4:30-5:30pm | Aimee Schwab-McCoy, Creighton University, Ashley Juavinett, UC San Diego, Samantha-Jo Caetano, University of Toronto | Panel on teaching data-focused topics. |
Thanks to Hidaya Ismail for the brilliant maple leaf and dinosaur hex stickers.
Date (Toronto time) | Speaker | Topic | Recording |
---|---|---|---|
Thu, 3 Sep, 4-5pm | Erik Drysdale (The Hospital for Sick Children) | Using hospital data | |
Tue, 8 Sep, 3:30-4:30pm | Sophie Bennett (Industry data scientist) | UK A levels algorithm issues (jointly hosted with SRI) | |
Thu, 10 Sep, 4-5pm | A Mahfouz, Diego Mamanche Castellanos, Hidaya Ismail, Ke-Li Chiu & Paul Hodgetts (University of Toronto) | Various R packages and research developed by students | |
Thu, 17 Sep, 4-5pm | Amber Simpson (Queen’s University) | Cancer and AI | |
Thu, 24 Sep, 4-5pm | Chelsea Parlett-Pelleriti (Chapman University) | Talking to non-statisticians about statistics | |
Thu, 1 Oct, 4-5pm | Florence Vallée-Dubois (Université de Montréal) | Canadian demographics by riding (1991-2015) | |
Thu, 8 Oct, 4-5pm | Yim Register (University of Washington Data Lab) | Self-advocacy within machine learning systems | |
Thu, 22 Oct, 4-5pm | Jeff Waldman, Leanne Trimble, Leslie Barnes, & Lisa Strug (University of Toronto) | Panel on data-focused resources at U of T | |
Thu, 29 Oct, 4-5pm | Fei Chiang (McMaster University) | Data currency and applications | |
Thu, 5 Nov, 4-5pm | Andrew Whitby (Industry data scientist) | Censuses | |
Mon, 9 Nov, 4-5pm | Tom Cardoso (Globe and Mail) | Bias Behind Bars | |
Thu, 12 Nov, 4-5pm | Kevin Armstrong (University of Toronto) | Measuring poverty for NGOs | |
Thu, 19 Nov, 4-5pm | Michael Chong (University of Toronto) | High-throughput Bayesian modelling workflow | https://youtu.be/xM1vf_KT76g |
Thu, 26 Nov, 4-5pm | Postponed | ||
Thu, 3 Dec, 5-6pm | Monica Alexander (University of Toronto) | Using Facebook advertising data to estimate migration | https://youtu.be/xM1vf_KT76g |
Thu, 10 Dec, 4-5pm | Shabrina Mardevi (United Nations Population Fund & University of Toronto) & Romesh Silva (United Nations Population Fund) | Population data estimation | https://youtu.be/kfmKusnGDLI |
Thu, 17 Dec, 4-5pm | Liza Bolton (University of Toronto), Maria Tackett (Duke University), Nathalie Moon (University of Toronto), Teon Brooks (Mozilla Firefox) | Panel on teaching data-focused topics | https://youtu.be/c3R6pZisvm0 |
arxivdl
, aRianna
, cesR
, and more!pointblank
, which is an R package that allows workflows involving nice and easy data validation in reproducible documents.If you would like to receive invitations to the series, then you can subscribe here.