Graduate students

  • When an experimenter is interested in the effects of two or more independent variables, it is usually more efficient to manipulate these variables in one experiment than to run a separate experiment for each variable. Moreover, only in experiments with more than one independent variable is it possible to test for interactions among variables.  Experimental designs in which every level of every variable is paired with every level of every other variable are called factorial designs. 

    0
    No votes yet
  • Within-subject designs are designs in which one or more of the independent variables are within-subject variables. Within-subjects designs are often called repeated-measures designs since within-subjects variables always involve taking repeated measurements from each subject. Within-subject designs are extremely common in psychological and biomedical research.

    0
    No votes yet
  • This chapter discusses a collection of tests called distribution-free tests, or nonparametric tests, that do not make any assumptions about the distribution from which the numbers were sampled. The main advantage of distribution-free tests is that they provide more power than traditional tests when the samples are from highly-skewed distributions. 

    0
    No votes yet
  • Measures of the size of an effect based on the degree of overlap between groups usually involve calculating the proportion of the variance that can be explained by differences between groups. This resource outlines different approaches to measuring this proportion.

    0
    No votes yet
  • As our economy, society, and daily life become increasingly dependent on data, work across nearly all fields is becoming more data driven, affecting both the jobs that are available and the skills that are required. At the request of the National Science Foundation, the National Academies of Sciences, Engineering, and Medicine were asked to set forth a vision for the emerging discipline of data science at the undergraduate level. The study committee considered the core principles and skills undergraduates should learn and discussed the pedagogical issues that must be addressed to build effective data science education programs. Data Science for Undergraduates: Opportunities and Options underscores the importance of preparing undergraduates for a data-enabled world and recommends that academic institutions and other stakeholders take steps to meet the evolving data science needs of students. 

     

    Watch the report release webinar here:  https://vimeo.com/269033724  

    0
    No votes yet
  • RStudio Cloud makes it easy for professionals, hobbyists, trainers, teachers and students to do, share, teach and learn data science using R.  Create analyses using RStudio directly from your browser - there is no software to install and nothing to configure on your computer.  Share your projects - and access those of others - without worrying about data transfer or package installation. Each project defines its own environment, and RStudio Cloud automatically reproduces that environment whenever anyone accesses the project.  It’s easy to share analyses with the world - but it’s also simple to collaborate with a select group in a private space. You control who can enter a space - and via roles, you have fine grained control over what each user can do.  There are also many learning materials available: interactive tutorials covering the basics of data science, cheatsheets for working with popular R packages, links to Datacamp courses, and a guide to using RStudio Cloud.

    0
    No votes yet
  • Gapminder seeks to educate all on the importance of "factfulness" and of knowing and contextualizing the statistics that describe the state of our world.  These data sets provide facts from across the globe such as average income, life expectancy, energy use, education levels, and much more.

    5
    Average: 5 (1 vote)
  • CATS, established in 1978, promotes the statistical sciences, statistical education, statistics applications, and related issues affecting the statistics community. The mission and scope of CATS evolved over time as interdisciplinary collaboration increasingly shaped the character of scientific research. After a brief hiatus, CATS was reconstituted in 2011 and has since focused on improving the visibility and practice of statistics within government agencies not well connected to statistics, increasing attention to statistical issues of big data and data science, and helping agencies identify bottlenecks impairing their analysis capabilities. Its multidisciplinary members are experts from statistics and related fields and leaders in diverse areas of interdisciplinary research, including the analysis of large-scale data, computational biology and bioinformatics, spatial data, environmental science, neuroscience, health care policy, and complex computer experiments.

    0
    No votes yet
  • This is the free online textbook for the Foundations of Data Science class at UC Berkeley for the Data 8 Project. Creators have used https://github.com/data-8/textbook to maintain this textbook (an open source project that allows for continual easy editing and maintenance).

    0
    No votes yet
  • This UC Berkeley Foundations of Data Science course combines three perspectives: inferential thinking, computational thinking, and real-world relevance. Given data arising from some real-world phenomenon, how does one analyze that data so as to understand that phenomenon? This course teaches critical concepts and skills in computer programming and statistical inference, in conjunction with hands-on analysis of real-world datasets, including economic data, document collections, geographical data, and social networks. It delves into social issues surrounding data analysis such as privacy and design.

    0
    No votes yet

Pages