Data Management & Organization

  • Big data analysis is explained in this online course that introduces the user to the tools Hadoop and Mapreduce. These tools allow for the parallel computing necessary to analyze large amounts of data.

    0
    No votes yet
  • This is a web application framework for R, in which you can write and publish web apps without knowing HTML, Java, etc. You create two .R files: one that controls the user interface, and one that controls what the app does. The site contains examples of Shiny apps, a tutorial on how to get started, and information on how to have your apps hosted, if you don't have a server.

    0
    No votes yet
  • This software allows you to extract data from published graphs. There is a web-based app and a downloadable version. First, you provide the software with a picture of the graph in question. Then you give it two points on the x-axis and two points on the y-axis for reference. Then you click on the points on the graph that you want to extract. The points are put into a .csv file.

    0
    No votes yet
  • This complete lesson plan, which includes assessments, is based upon a data set partially discussed in the article "Female Hurricanes are Deadlier than Male Hurricanes." The data set contains archival data on actual fatalities caused by hurricanes in the United States between 1950 and 2012. Students analyze and explore this hurricane data in order to formulate a question, design and implement a plan to collect data, analyze the data by measures and graphs, and interpret the results in the context of the original question.
    0
    No votes yet
  • A reference for analyzing large, complex data sets. Helpful for various levels of students.

    0
    No votes yet
  • DataFerrett is a unique data analysis and extraction tool -- with recoding capabilities -- to customize federal, state, and local data to suit your requirements. Using DataFerrett, you can develop an unlimited array of customized spreadsheets that are as versatile and complex as your usage demands. The DataFerrett helps you locate and retrieve the data you need across the Internet to your desktop or system, regardless of where the data resides. You can then develop and customize tables. Selecting your results in your table you can create a chart or graph for a visual presentation into an html page. Save your data in the databasket and save your table for continued reuse. The DataFerrett is a Beta testing version that will incorporate the latest bug fixes, enhancements, and new functionality that will be rolled into the DataFerrett after testing has been completed.

    0
    No votes yet
  • Beware of bugs in the above code; I have only proved it correct, not tried it. is a quote from American computer scientist Donald E. Knuth (1938 - ). The quote was written on March 22, 1977 as the last sentence of a five-page memo entitled "Notes on the van Emde Boas construction of priority deques: An instructive use of recursion."
    0
    No votes yet
  • A sketch by Anastasia Mandel reinterpreting Cattle by a Lake by Sidney Richard Percy (1862) with the statistical caption "A multimodal distribution with small outliers." This is part of a collection of sketches by Anastasia Mandel and their accompanying statistical captions discussed in the paper "How art helps to understand statistics" (Model Assisted Statistics and Applications, 2009) by Stan Lipovetsky and Igor Mandel in volume 4 pages 313-324. Free to use in classrooms and on course websites.

    0
    No votes yet
  • This applet is designed to allow users to explore the relationship between histograms and the most typical summary statistics. The user can choose from several types of histograms (uniform, normal, symmetric, skewed, etc.), or can create their own by manipulating the bars of the histogram. The statistics available for display are mean, median, mode, range, standard deviation, and interquartile range. Also available is a "Practice Guessing" option, in which the values of the statistics are hidden until the user has entered guesses for each value.
    0
    No votes yet
  • A cartoon to teach about understanding large error bars (e.g. caused by the effect of outliers). The cartoon is #9 in the "Life in Research" series at www.vadio.com. Free to use with attribution in the classroom or on course websites.
    0
    No votes yet

Pages

register