Data ingestation, data collection, and data analysis: key components in the statistics and data science analysis cycle


Tuesday, March 23rd, 20214:00 pm – 5:00 pm

Presented by: Mine Dogucu (UC Irvine) & Albert Y. Kim (Smith College)


Abstract

In the March CAUSE/Journal of Statistics and Data Science Education webinar series we will discuss two related papers on data ingestation, data collection, and data analysis.  

Mine Dogucu (UC Irvine) will discuss her paper "Web Scraping in the Statistics and Data Science Curriculum: Challenges and Opportunities" (https://github.com/mdogucu/web-scrape).

Albert Y. Kim (Smith College) will discuss his paper "'Playing the Whole Game': A Data Collection and Analysis Exercise With Google Calendar" (https://smithcollege-sds.github.io/sds-www/JSE_calendar.html)

Mine Dogucu is an Assistant Professor of Teaching in the Department of Statistics at University of California Irvine. Her work focuses on modern pedagogical approaches in the statistics curriculum, making data science education accessible, and undergraduate Bayesian education. She is the coauthor of the upcoming book Bayes Rules! An Introduction to Bayesian Modeling with R. She co-chairs the Undergraduate Statistics Project Competition and the Electronic Undergraduate Statistics Research Conference (USPROC+eUSR). She shares her thoughts about data science education on her Data Pedagogy blog.

Albert Kim is an Assistant Professor of Statistical & Data Sciences at Smith College as well as a Visiting Scholar at the ForestGEO network's Smithsonian Conservation Biology Institute (SCBI) large forest dynamics plot. His research centers on forest ecology, in particular modeling the impact of climate change on the growth of trees as well as ecological forecasting. He is a co-author of "Statistical Inference via Data Science: A ModernDive into R and the Tidyverse" (see moderndive.com).


Recording