One Numerical Variable

  • This article provides a data collection and analysis activity for illustrating simple linear regression and outlier analysis. The activity was designed to involve students in the process of data collection and to motivate studying the relationship between two quantitative variables. Students collect data on occurrences of letters in English text. These data are used to study the relationships between how often a letter occurs in English text, and: (1) the letter's Morse Code units and (2) the relative frequency of Scrabbleä‹¢ game tiles for the letter. Worksheets and answers to the activities are provided.
    0
    No votes yet
  • This article describes a dataset containing information on 308 diamond stones, which is useful when studying concepts in multiple linear regression analysis. Key Words: Categorical variables; Data transformation; Standardized residuals.
    0
    No votes yet
  • The dataset presented in this article provides the salary and performance data for non-pitchers for the 1992 Major League Baseball season. Exploratory data analysis is used to determine a suitable regression model for the data. Key Words: Model selection and validation; Stepwise model selection.
    0
    No votes yet
  • This article describes a dataset containing monthly household electric billing charges for ten years. The data can be used to illustrate graphing, descriptive statistics, correlation, seasonal decomposition, a variety of smoothing methods, ARIMA models, forecasting, and multiple regression.
    0
    No votes yet
  • The dataset presented in this article contains body measurements for 252 men and can be used to illustrate multiple regression and to provide practice in model building.
    0
    No votes yet
  • The dataset presented in this article referes to game-by-game information for Mark McGwire and Sammy Sosa during summer of 1998. This data can be used to demonstrate graphical displays, categorical data analysis, analysis of variance, logistic regression, and smoothing methods for Poisson and binomial data.
    0
    No votes yet
  • The dataset described in this article contains data on retired Major League Baseball players, eligible for the MLB Hall of Fame. The data can be used to illustrate descriptive statistical methods (numerical, graphical, and tabular) or inferential statistics (hypothesis testing, confidence intervals, etc.). The data is in .dat format.
    0
    No votes yet
  • The datasets described in this article contain information for all National Football League (NFL)regular season and playoff games played from 1993 to 1996. In addition to game scores, the data give oddsmakers' pointspreads and over/under values for each game. Key Words: Predictions; Wagering.
    0
    No votes yet
  • This article describes a dataset of days in office of US Presidents with outliers that are not mistakes or unusually high or low observations. The data illustrate that outliers need not be errors but could be particularly interesting cases and that data displays may differ in their ability to reveal interesting data structure. Key Words: Inliers; Interpretation in context.
    0
    No votes yet
  • This article describes a dataset on the readability of booklets about cancer and the reading levels of patients with cancer. Students should be familiar with scales of measurement, data reduction, measuring center, constructing and interpreting displays, and reaching conclusions in real problems. Key Words: Ordinal data, Means, Medians, Histograms
    0
    No votes yet

Pages