Posts from April 2013 (67)

April 26, 2013

Life expectancy doesn’t mean that

Tony Cooper has nominated a Bloomberg statistic (being reprinted in NZ) on life expectancy for Stat of the Week.

The `Sunset Index‘ purports to be the average number of years of life after you stop working, with figures ranging from 23.44 for Singapore to 1.49 for Nigeria. New Zealand is somewhere in the middle, with an index of 15.98.  It really isn’t credible that Nigerians who leave the workforce at age 50 die an average of 18 months later, so what have they done wrong?

Bloomberg have calculated life expectancy at birth, and subtracted the retirement age, but if you reach retirement, you’ve already avoided dying for a long time.  The life expectancy at retirement could be quite different from life expectancy at birth.  Since this difference is likely to vary between countries, the `Sunset Index’ won’t even be correct in relative terms.

So, how bad does it get?  If you look at life expectancy data for Nigeria you see that, indeed, life expectancy at birth is about 50 years, but that life expectancy at age 50 is 70.7 years for men and 72.6 for women. The true `Sunset Index’ value would be about 21, and Bloomberg are off by a couple of decades.

The error is less severe in other countries: infant and child mortality has a big impact on life expectancy at birth,  and in Nigeria about one child in seven dies before age 5. Here are a few corrected values for the Sunset Index

  • Singapore: 25
  • Nigeria: 21
  • Iran: 21
  • New Zealand: 20
  • USA: 16.5
  • Bangladesh: 13

The US is near the bottom of the corrected index because it combines a late retirement age (by Bloomberg’s definition — full Social Security eligibility) with only moderately good life expectancy.

That TV3 poll on racism? Bogus!

The Herald today ran this story claiming that people think New Zealand is a racist country, based on the results of a survey run  for TV3’s new show The Vote. Viewers voted through Facebook, Twitter, The Vote website or by text.

I haven’t watched The Vote, but I would like to know whether its journalist presenters, presumably fans of accuracy, point out that such self-selecting  polls are unscientific – the polite term for bogus. The best thing you can say is that such polls allow viewers to  feel involved.

But that’s not a good thing if claims made as a  result of these polls lead to way off-beam impressions being planted in the public consciousness; that’s often the way urban myths are born and prejudice stoked.

I’m not saying that racism doesn’t exist in New Zealand, but polls like this  offer no insight into the issue or, worse, distort the truth.

It’s disappointing to see the Herald, which still, presumably, places a premium on accuracy, has swallowed The Vote press release whole, without  pointing out its shortcomings or doing its homework to see what reliable surveys exist. TV3 must be very pleased with the free publicity, though.

April 25, 2013

Internet searches reveal drug interactions?

The New York Times has a story about finding interactions between common medications using internet search histories.  The research, published in the Journal of the American Medical Informatics Association, looks at search histories containing searches for two medication names and also for possible symptoms.  For example, their primary success was finding that people who searched for information on paroxetine (an antidepressant) and pravastatin (a cholesterol-lowering drug) were more likely to search for information on a set of symptoms that can be caused by high blood sugar.  These two drugs are now known to interact to cause high blood sugar in some people, although this wasn’t known at the time the internet searches took place.

This approach is promising, but like so many approaches to safety of medications it is limited by the huge number of possibilities.  The researchers knew where to look: they knew which drugs to examine and which symptoms to follow. With the thousands of different medications, leading to millions of possible interacting pairs and dozens or hundreds of sets of symptoms it becomes much harder to know what’s going on.

Drug safety is hard.

Infographic of the week

Every so often, someone comes up with a creative way to make pie charts less informative.  This week’s innovation comes to you from Wired magazine.

explodedpie

Note that it’s structured like a bar chart, except that all the `bars’ are the same height, and the wedges are turned at different angles, to make the widths harder to estimate.  The numbers are presented as if their heights mean something, but actually not.

There are also some subtleties to the design.  For example, at first glance you might think the left-to-right order of the wedges reflects the time period each one corresponds to, so that the fact they aren’t largest to smallest means something. Sadly, no.

(via @acfrazee and @kwbroman)

An exam with cheating allowed

Statistical decision theory is about making decisions in the presence of uncertainty. We can’t know everything, but we still need to make choices.  In decision theory we assume that the world isn’t out to get us — if cigarette smoke is toxic, it is so regardless of whether or not we study it, and whether or not we’re trying to stamp it out. Murphy’s Law is true, but only as an engineering design principle, not a fact about the malevolence of Nature.

Game theory is the evil twin of decision theory — it’s about making choices in the presence of competition, when the other players aren’t precisely out to get you, but they are out to do the best for themselves.  There are a few examples of game theory in medical statistics: how do you set up regulations so that making effective drugs is more profitable than making ineffective ones? how do you use new antibiotics, given that resistance will inevitably develop? Typically, though, game theory works best in ecology, where natural selection ensures that organisms behave as if they were trying to maximise their numbers of descendants given the behaviour of other organisms.

A UCLA professor teaching a course in behavioural ecology decided to try to make his students really appreciate the problems of cooperation and competition that arise in game theory:

A week before the test, I told my class that the Game Theory exam would be insanely hard—far harder than any that had established my rep as a hard prof.  But as recompense, for this one time only, students could cheat. They could bring and use anything or anyone they liked, including animal behavior experts. (Richard Dawkins in town? Bring him!) They could surf the Web. They could talk to each other or call friends who’d taken the course before. They could offer me bribes. (I wouldn’t take them, but neither would I report it to the Dean.) Only violations of state or federal criminal law such as kidnapping my dog, blackmail, or threats of violence were out of bounds.

 

April 24, 2013

Age-period-cohort

Stuff has a story using data from the NZ General Social Survey.  The data say that people 15-29 are more likely to feel lonely than older people.  The story says that this is a generational change caused by Facebook (they put it a bit less baldly, but that’s the message).

When you find differences between age groups, there are two broad classes of explanations. Epidemiologists call them “age” and “cohort” effects.  An age effect is actually due to age: teenagers today argue with their parents more than 40 year olds do, because that’s what teenagers are like.  Toddlers read less than adults, because they haven’t learned yet. When today’s teenagers get older they will argue with their parents less than they now do; when today’s toddlers get older they will read more than they now do.

Cohort effects are common to a group of people born at the same time.  For example, older New Zealanders are more likely to regard Anzac Day as very important, but we wouldn’t expect today’s teenagers to develop a greater appreciation for it as they get older.  Older people also, on average, have less formal education than 25-35 year olds, but again, todays young Bachelors and Masters graduates are not going to lose their degrees as they age.

So, is the difference in feelings of loneliness between 15-29 year olds and older people an age effect or a cohort effect?  Stuff seems pretty sure it’s a cohort effect — “Generation Net” — but the data say absolutely nothing one way or the other.  Since the NZ General Social Survey started in 2009, there isn’t enough historical data to answer the question, and the US General Social Survey (which has been going since 1972) doesn’t routinely ask a ‘loneliness’ question. The British Social Attitudes Survey, as is so often the case with UK publically-funded data collection, won’t let you see much without becoming a registered user.

There are good reasons to be skeptical about the cohort interpretation.  In 1995, well before social media, Robert Putnam wrote an essay, Bowling Alone, about a rapid decline in social connectedness in the US.  And in the 1920s, the Middletown studies blamed the same sort of changes on the new technologies of radio, film, and automobiles.

However tempting it is to say that the kids these days are different, you do actually need some evidence. More evidence than their constant facebooking and twittering, and showing no respect, and look at what they wear and their music is just noise and they need to get off your lawn.

 

 

[update: The Herald has a more balanced story]

Note with worrying statistics at Pakiri horse riding

fiftypercentrule

NRL Predictions, Round 7

Team Ratings for Round 7

Here are the team ratings prior to Round 7, along with the ratings at the start of the season. I have created a brief description of the method I use for predicting rugby games. Go to my Department home page to see this.

Current Rating Rating at Season Start Difference
Storm 11.58 9.73 1.90
Sea Eagles 8.88 4.78 4.10
Rabbitohs 5.06 5.23 -0.20
Cowboys 3.66 7.05 -3.40
Roosters 2.59 -5.68 8.30
Broncos 2.01 -1.55 3.60
Knights 1.91 0.44 1.50
Bulldogs -0.23 7.33 -7.60
Titans -0.27 -1.85 1.60
Dragons -1.21 -0.33 -0.90
Sharks -2.34 -1.78 -0.60
Raiders -4.25 2.03 -6.30
Wests Tigers -4.92 -3.71 -1.20
Panthers -7.12 -6.58 -0.50
Eels -9.18 -8.82 -0.40
Warriors -9.91 -10.01 0.10

 

Performance So Far

So far there have been 48 matches played, 32 of which were correctly predicted, a success rate of 66.67%.

Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Broncos vs. Cowboys Apr 12 12 – 10 3.06 TRUE
2 Roosters vs. Bulldogs Apr 12 38 – 0 -0.36 FALSE
3 Knights vs. Panthers Apr 13 8 – 6 16.41 TRUE
4 Raiders vs. Warriors Apr 13 20 – 16 11.70 TRUE
5 Rabbitohs vs. Storm Apr 13 10 – 17 -0.78 TRUE
6 Sea Eagles vs. Sharks Apr 14 25 – 18 17.91 TRUE
7 Wests Tigers vs. Dragons Apr 14 12 – 13 1.25 FALSE
8 Titans vs. Eels Apr 14 28 – 22 15.26 TRUE

 

Predictions for Round 7

Here are the predictions for Round 7. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Roosters vs. Dragons Apr 25 Roosters 8.30
2 Storm vs. Warriors Apr 25 Storm 26.00
3 Sea Eagles vs. Rabbitohs Apr 26 Sea Eagles 8.30
4 Cowboys vs. Raiders Apr 27 Cowboys 12.40
5 Wests Tigers vs. Broncos Apr 27 Broncos -2.40
6 Titans vs. Knights Apr 28 Titans 2.30
7 Sharks vs. Bulldogs Apr 28 Sharks 2.40
8 Panthers vs. Eels Apr 29 Panthers 6.60

 

Super 15 Predictions, Round 11

Team Ratings for Round 11

This year the predictions have been slightly changed with the help of a student, Joshua Dale. The home ground advantage now is different when both teams are from the same country to when the teams are from different countries. The basic method is described on my Department home page.

Here are the team ratings prior to Round 11, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Crusaders 7.58 9.03 -1.40
Chiefs 5.37 6.98 -1.60
Sharks 4.10 4.57 -0.50
Bulls 3.92 2.55 1.40
Stormers 3.70 3.34 0.40
Brumbies 2.77 -1.06 3.80
Blues 2.08 -3.02 5.10
Reds 0.47 0.46 0.00
Hurricanes -0.14 4.40 -4.50
Cheetahs -1.30 -4.16 2.90
Waratahs -5.09 -4.10 -1.00
Highlanders -6.13 -3.41 -2.70
Force -8.34 -9.73 1.40
Kings -10.51 -10.00 -0.50
Rebels -13.27 -10.64 -2.60

 

Performance So Far

So far there have been 61 matches played, 41 of which were correctly predicted, a success rate of 67.2%.

Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Hurricanes vs. Force Apr 19 22 – 16 13.40 TRUE
2 Waratahs vs. Chiefs Apr 19 25 – 20 -8.60 FALSE
3 Crusaders vs. Highlanders Apr 20 24 – 8 16.20 TRUE
4 Reds vs. Brumbies Apr 20 19 – 19 0.20 FALSE
5 Sharks vs. Cheetahs Apr 20 6 – 12 10.60 FALSE
6 Kings vs. Bulls Apr 20 0 – 34 -7.70 TRUE

 

Predictions for Round 11

Here are the predictions for Round 11. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Hurricanes vs. Stormers Apr 26 Hurricanes 0.20
2 Reds vs. Blues Apr 26 Reds 2.40
3 Chiefs vs. Sharks Apr 27 Chiefs 5.30
4 Brumbies vs. Force Apr 27 Brumbies 13.60
5 Bulls vs. Waratahs Apr 27 Bulls 13.00
6 Cheetahs vs. Kings Apr 27 Cheetahs 11.70
7 Crusaders vs. Rebels Apr 28 Crusaders 24.80

 

The “stupidest, most outrageous statistic I ever heard this week”

Graeme Hill has sent in this clip from his Radio Live show on Sunday. Listen from about 4:30 until 7:00 elapsed.

He calls this quote from ONE News the “stupidest, most outrageous statistic I ever heard this week”:

“Experts say the odds of having two disasters like those in Boston and Texas in the same week are 1 in every 4,800 years.”

The statistic seems to have come from an ABC News story, but there’s no attribution there for the “experts” either.