Posts from October 2013 (67)

October 3, 2013

People who bought this theory also liked…

An improved version of study that Stuff and StatsChat reported on more than a year ago has now appeared in print. The study found that people who have non-standard beliefs about the moon landings or Princess Diana’s death are also likely to have non-standard beliefs about climate change or health effects of tobacco. It improves on the previous research by using a reasonably representative online survey rather than a sample of visitors to climate debate blogs.

Mother Jones magazine in the US summarised some of the results in this graph of correlations

conspiracies6_2

 

That’s a horrible graph partly because, contrary to what the footnote says, correlations are not in fact restricted to be between 0 and 1, but between -1 and 1: and in fact the three correlations shown were negative in the research and have been turned around for more convenient display.

The title is misleading: only one of the six `conspiracist ideation’ questions was about 9/11, and it wasn’t a yes/no question, and it wasn’t really about it being an inside job (ie, performed by the government), but about the government allowing it to happen. In the same way, the other three variables aren’t simple yes/no questions, but scores based multiple questions, each on a 5-point scale.

A more-technical point is that correlations, while appropriate in the paper as part of their statistical model, aren’t really a good way to describe the strength of association.  It’s easier to understand the square of the correlation, which gives the proportion of variability in one variable explained by the other.  That is, the conspiracy-theory score explains about 25% of the variation in the vaccine score,  just over 1% of the variation in the GM Foods score, and just under 1% of the variation in the climate change score.

(via @zentree)

October 2, 2013

Cough, choke, history

If the PubMed research database is still surviving the US government shutdown, you can read a paper published 63 years ago today on lung cancer

In England and Wales the phenomenal increase in the
number of deaths attributed to cancer of the lung provides
one of the most striking changes in the pattern of

mortality recorded by the Registrar-General. For example,
in the quarter of a century between 1922 and 1947 the
annual number of deaths recorded increased from 612 to
9,287, or roughly fifteenfold. This remarkable increase is,
of course, out of all proportion to the increase of population

Some people were arguing that the increase was just due to better diagnosis of lung cancer, and even  those who believed in a real increase weren’t sure of the reason

Two main causes have from time to time been put forward:
(1) a general atmospheric pollution from the exhaust

fumes of cars, from the surface dust of tarred roads, and
from gas-works, industrial plants, and coal fires; and
(2) the smoking of tobacco.

Richard Doll and Austin Bradford Hill decided to compare histories of smoking in lung cancer patients and those in hospital for other reasons. As you know, they found that the lung cancer patients were much more likely to be heavy smokers. It’s also interesting to read what other possibilities they considered, and how they tried to rule them out.

This sort of study isn’t completely definitive, and, famously, the eminent statistician and geneticist (and heavy smoker) R. A. Fisher was never convinced. He thought that genetic factors might well be responsible. Further evidence was provided by experiments in animals (such the ‘smoking beagles‘ of Duke University) showed that smoking really could cause cancer. Also, much more recently, studies of twins and studies that actually measured genotypes showed that genetic differences weren’t a big enough contributor to lung cancer to explain the correlation.

In contrast to, say, alcohol or opium, tobacco has been a public health problem only for about a century: tobacco smoking became very widespread in men during the first world war. With a bit of effort and some luck, future generations might see it as an inexplicable historical anomaly, like a deadly version of canasta.

Data journalism links

The Data Journalism handbook online

This book is intended to be a useful resource for anyone who thinks that they might be interested in becoming a data journalist, or dabbling in data journalism….

Lamentably the act of reading this book will not supply you with a comprehensive repertoire of all if the knowledge and skills you need to become a data journalist. This would require a vast library manned by hundreds of experts able to help answer questions on hundreds of topics. Luckily this library exists and it is called the internet. Instead, we hope this book will give you a sense of how to get started and where to look if you want to go further. Examples and tutorials serve to be illustrative rather than exhaustive.

And one of the additional resources on the internet: Cathy O’Neil’s On Being a Data Skeptic.

ITM Cup Predictions for Round 8

Team Ratings for Round 8

Here are the team ratings prior to Round 8, along with the ratings at the start of the season. I have created a brief description of the method I use for predicting rugby games. Go to my Department home page to see this.

Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Canterbury 21.12 23.14 -2.00
Wellington 11.86 6.93 4.90
Auckland 10.86 9.02 1.80
Counties Manukau 4.08 4.36 -0.30
Waikato 2.13 5.25 -3.10
Tasman -0.16 -6.29 6.10
Hawke’s Bay -1.94 -6.72 4.80
Taranaki -3.03 3.92 -7.00
Otago -4.82 -4.44 -0.40
Bay of Plenty -5.20 -1.96 -3.20
Southland -7.78 -11.86 4.10
Northland -9.69 -8.26 -1.40
North Harbour -9.73 -7.43 -2.30
Manawatu -10.98 -8.97 -2.00

 

Performance So Far

So far there have been 54 matches played, 41 of which were correctly predicted, a success rate of 75.9%.

Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Canterbury vs. Manawatu Sep 25 72 – 7 35.20 TRUE
2 Northland vs. Tasman Sep 26 13 – 28 -3.10 TRUE
3 Waikato vs. Wellington Sep 27 14 – 19 -5.30 TRUE
4 Otago vs. Southland Sep 28 32 – 38 10.00 FALSE
5 Manawatu vs. Taranaki Sep 28 12 – 6 -5.30 FALSE
6 Auckland vs. Canterbury Sep 28 39 – 19 -10.70 FALSE
7 Hawke’s Bay vs. North Harbour Sep 29 55 – 10 6.10 TRUE
8 Bay of Plenty vs. Counties Manukau Sep 29 30 – 31 -5.50 TRUE

 

Predictions for Round 8

Here are the predictions for Round 8. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Tasman vs. Waikato Oct 02 Tasman 2.20
2 Southland vs. Hawke’s Bay Oct 03 Hawke’s Bay -1.30
3 Manawatu vs. Northland Oct 04 Manawatu 3.20
4 Bay of Plenty vs. Waikato Oct 05 Waikato -2.80
5 North Harbour vs. Otago Oct 05 Otago -0.40
6 Wellington vs. Auckland Oct 05 Wellington 5.50
7 Canterbury vs. Counties Manukau Oct 06 Canterbury 21.50
8 Taranaki vs. Tasman Oct 06 Taranaki 1.60

 

Currie Cup Predictions for Round 9

Team Ratings for Round 9

Here are the team ratings prior to Round 9, along with the ratings at the start of the season. I have created a brief description of the method I use for predicting rugby games. Go to my Department home page to see this.

Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Sharks 4.69 3.24 1.50
Western Province 3.46 4.47 -1.00
Lions 1.00 -1.22 2.20
Blue Bulls 0.89 0.59 0.30
Cheetahs -3.15 -2.74 -0.40
Griquas -9.04 -6.48 -2.60

 

Performance So Far

So far there have been 24 matches played, 12 of which were correctly predicted, a success rate of 50%.

Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Cheetahs vs. Western Province Sep 27 27 – 29 1.50 FALSE
2 Lions vs. Sharks Sep 27 25 – 31 6.00 FALSE
3 Griquas vs. Blue Bulls Sep 28 10 – 52 6.20 FALSE

 

Predictions for Round 9

Here are the predictions for Round 9. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Western Province vs. Lions Oct 04 Western Province 10.00
2 Blue Bulls vs. Sharks Oct 04 Blue Bulls 3.70
3 Griquas vs. Cheetahs Oct 05 Griquas 1.60

 

NRL Predictions for the Grand Final

Team Ratings for the Grand Final

Here are the team ratings prior to this week’s games, along with the ratings at the start of the season. I have created a brief description of the method I use for predicting rugby games. Go to my Department home page to see this.

Current Rating Rating at Season Start Difference
Roosters 13.52 -5.68 19.20
Sea Eagles 8.65 4.78 3.90
Storm 8.36 9.73 -1.40
Cowboys 6.60 7.05 -0.40
Rabbitohs 6.12 5.23 0.90
Knights 5.11 0.44 4.70
Bulldogs 2.16 7.33 -5.20
Titans 2.11 -1.85 4.00
Sharks 1.87 -1.78 3.70
Warriors -0.89 -10.01 9.10
Panthers -2.57 -6.58 4.00
Broncos -5.15 -1.55 -3.60
Dragons -8.18 -0.33 -7.90
Raiders -10.23 2.03 -12.30
Wests Tigers -11.37 -3.71 -7.70
Eels -19.87 -8.82 -11.00

 

Performance So Far

So far there have been 200 matches played, 121 of which were correctly predicted, a success rate of 60.5%.

Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Rabbitohs vs. Sea Eagles Sep 27 20 – 30 -0.67 TRUE
2 Roosters vs. Knights Sep 28 40 – 14 4.01 TRUE

 

Predictions for the Grand Final

Here are the predictions for the Grand Final. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Roosters vs. Sea Eagles Oct 06 Roosters 4.90

 

October 1, 2013

Making up numbers

Bob Jones, in the Herald

But although the offenders are male, 99.999 per cent of men are not rapists and feel just as outraged as women do about it.

He’s obviously just pulled that number out of his head (or somewhere else round and inappropriate), since it would imply only about 20 male rapists in New Zealand. About 20 times that number were convicted and sent to prison just last year for sexual assault or aggravated sexual assault.

A lot of men have a hard time believing that rape is as common as it is — either because they can’t imagine doing it, or because they do it and don’t think it’s rape. But there are good-quality high response-rate surveys showing that lots of women have been raped, and if only a tiny minority of men are rapists, they have to be very busy.

Bob Jones is off by at least three, and quite likely more than four orders of magnitude. That must be close to a record.

(update)