Lying with
Statistics:
AKA Data Analysis
Gone Wrong
Let's start with
this example: Maybe you saw the
following (it was all over the internet in August 2013)
And now it's global COOLING! Record
return of Arctic ice cap as it grows by 60% in a year.Almost
a million more square miles of ocean covered with ice than
in 2012
WOW, RIGHT?
First, this statistic is
correct. The Arctic ice cap did grow 60%, which was indeed
almost a million square miles. But this says NOTHING about
possible global cooling, AND it conveys the wrong impression.
Here's why:
1. This is "Cherry Picked"
data. What they did was take the measure of the ice cap at
a specific point in time, namely that point in time when it had
reached it's highest level.
2. They left out the fact
that the ice cap melts and refreezes each year. So, of
course, climatologists would expect it to increase. The
question is not how much it increased but how much it increased
in comparison to other years. It didn't increase as much
in 2012 as it did in 2013, but that does not mean it is NOT
smaller than it used to be. Here is the appropriate
comparison graph that makes this point:
From this you
can see that, while the ice cap became smaller in 2012 than in
2013, in 2013 it is still a good bit smaller than it has been on average
for the past 31 years. Some might say that this does
support the notion of polar ice melt due to global warming, but
NO ONE (except someone wanting to mislead, perhaps even lie
using statistics) can say it supports the notion of global
cooling.
3. Always
question data provided in percentages. The quickest way we
can explain this is with the following example:
Imagine you have a 10 pound
bag of sand.
You pour out 5 pounds (half)
thereby reducing the bag's weight by 50%
You increase the bag's weight
by 50% (half of 5 pounds is 2.5 pounds, so that's how much sand
you put back in), bringing the bag's weight up to 7.5 pounds.
The point: Even though you
decreased the weight of the bag by 50%, then increased the
weight of the bag by 50%, it is still 25% lighter than its
heaviest weight.
Here's another example of misleading data presentation (taken from 1937 data).
Hmmm, looks
like a sudden and VERY BIG increase in government pay roll,
right?
The graph
above is actually accurate, as is the one below, BUT it doesn't
have a zero point.
Here's what
it looks like with a zero point:
Another example of the above:
So if you were investing in
the stock market, which of these two stocks would you buy, stock
A or stock B?
Well, as you no doubt have
guessed, they're both the same stock. The graph for Stock
"A" has no zero point.
Yet another example: How many people are
unemployed? 95 million?
Have you seen the various
numbers that have been bandied about on the internet? The
highest we've seen (as of December 2016) is about 95 million.
While this total is based on actual, verifiable data from the
Bureau of Labor Statistics (BLS) it is misleading (which is why
we've included it on our Lying with Statistics page).
So here's the deal, while
population data shows that around 95 million are not employed,
that number includes lots of people who likely aren’t looking
for work. It includes every American of retirement age -- 65 and
older. It includes every high-school student at least 16 years
of age. It includes every college and many graduate or
professional-school students currently enrolled and in classes.
It includes every person who has a disability that makes it
impossible for them to work. It includes parents who are
choosing to stay home to take care of their kids. It includes
every adult who’s gone back to school full-time. It even
includes trust-fund kids who are living off investments. While the following is from 2014, here's
a graph that shows
A final (for now) example (using made up names):
Perhaps
you've seen ads on TV telling how online dating
service "Meet Your Mate" has the most dates and marriages.
So, okay,
let's say that this is true. "Meet Your Mate" does
indeed have the most dates
and marriages of ALL online dating services.
But what they
left out is the fact that it is also the OLDEST such online
service, having been online twice as long as any other such
service. So OF COURSE it has the MOST in all categories.
What's wrong?
The RATE is more important than the total. What matters
his how many out of those who visit the site. In our
hypothetical example, it turns out that "Find Your Partner" has
a much higher rate of dates and marriages than "Meet Your
Mate," because one out of ten make dates and one out of a
hundred get married, compared to one out of fifteen and one out
of one hundred and fifty for "Meet Your Mate."
No
doubt you've heard it said, "Statistics don't lie. People
lie using statistics." That's what we're going to explore
here: Just how does this occur?
We
think
that sometimes it's by accident, but sometimes it's on purpose.
So we want to examine the ways to
tell whether data collection, reporting and analysis
can be valid, or invalid (i.e., legit, or not legit).
1. The primary tool for all
reality checking is critical thinking. It is so important
that we've devoted a whole article to it. So we're not
going to repeat it here. We're just going to tell you to
use it.
2. Always check to see
how much detail is provided, especially in terms of size of
samples and the breakdown (subsamples). For example,
saying that a survey shows 50% of the public approve, or
disapprove something is nearly useless. We need to know
who was asked (e.g., ages, ethnic background, etc.), how many
were asked (sample size), and what the actual questions were
asked. All these details can make a tremendous difference
in what the data means.
3. Remember that the
data (or statistics) may be valid, but if they are
cherry-picked, they can be useless.
4. Remember that even
if every aspect of the research methodology and data reporting
is valid, the interpretation of the conclusions are always
theoretical and thus debatable. Try to think of alternate
conclusions that might be drawn from the date.
5. Do a search to see
if there are alternate analyses and conclusions and apply
critical thinking to them as well.
6. Finally, if you know
anything about research methodology and the rules for validity,
apply them. If not, you can check for them online.
To
read about critical thinking.
To
see an online search concerning research methodology (will open
in new tab, or window)
To
go to the brief introduction to intentional efforts to obstruct, or avoid reality
To
go to the Articles Page
All written text on this website copyright
© Reality Check Online
|