Introduction

Today’s reading describes 10 different ways that statistical information can be used to mislead. For your reference, these 10 points are summarized below:

  1. Reading too much into correlations (ignoring third variables/confounding)
  2. Assuming relationships are constant over time (or over another dimension)
  3. Not considering the scale or axes of a chart
  4. Not considering the sample size (amount of data)
  5. Not considering a variety of different descriptive statistics
  6. Not considering limitations of the descriptive statistics used (ie: mean vs. median)
  7. Presenting information without a frame of reference
  8. Biased data collection
  9. Appeals to authority (ie: famous scientist as an author, study commissioned by Harvard, etc.)
  10. Focusing too much on one number

The aim of the following activity activity is for you to better understand these ideas by trying some of them out for yourself. That said, not all of them are applicable to the NYPD data or the graphics that can be generated by the two R Shiny apps we’ll use today.

Questions 1-5 will provide some practice looking at graphics that could be used as misleading evidence for certain arguments, and Question 6 will give you an opportunity to craft your own misleading argument.

\(~\)

Activity

Open the R Shiny app we used in our previous meeting:

For your reference, here is the link to more information about the app and the variables it displays: https://stat2labs.sites.grinnell.edu/Handouts/nypd/NYPDVariableDescriptions.pdf

\(~\)

For the following questions we’ll use a different R Shiny app to continue looking at the NYPD stop-and-frisk data.

This app uses the exact same underlying data as the previous app, but it uses spatial information to display the data at precinct level on a choropleth map. In this app you can click on a precinct to show a pop-up with additional information.

\(~\)

Assignment

Question #6: Your task is now to craft a misleading argument using a graphic created by one of the two R Shiny apps as evidence/support. You should intentionally try to exploit one or more of the ways that statistical information can be presented in a misleading manner as summarized in the introduction. You are welcome to borrow ideas from Questions 1-5 when coming up with your argument, or you may pursue something entirely different.

When you are finished with Questions 1-6, have one partner upload your responses to P-web.