Posts from July 2013 (70)

July 17, 2013

Some data visualisation links

These are from a long list of recommended links at Health Intelligence (I wouldn’t recommend all of the long list).

Super 15 Predictions for the Qualifying Finals

Team Ratings for the Qualifying Finals

This year the predictions have been slightly changed with the help of a student, Joshua Dale. The home ground advantage now is different when both teams are from the same country to when the teams are from different countries. The basic method is described on my Department home page.

Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Crusaders 8.85 9.03 -0.20
Bulls 4.69 2.55 2.10
Sharks 4.43 4.57 -0.10
Stormers 3.99 3.34 0.70
Chiefs 3.61 6.98 -3.40
Brumbies 2.84 -1.06 3.90
Waratahs 1.15 -4.10 5.30
Reds 0.65 0.46 0.20
Cheetahs -1.69 -4.16 2.50
Hurricanes -2.28 4.40 -6.70
Blues -3.22 -3.02 -0.20
Highlanders -5.85 -3.41 -2.40
Force -7.69 -9.73 2.00
Rebels -8.66 -10.64 2.00
Kings -15.61 -10.00 -5.60

 

Performance So Far

So far there have been 120 matches played, 83 of which were correctly predicted, a success rate of 69.2%.

Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Crusaders vs. Hurricanes Jul 12 25 – 17 14.70 TRUE
2 Rebels vs. Highlanders Jul 12 38 – 37 1.20 TRUE
3 Blues vs. Chiefs Jul 13 16 – 26 -3.30 TRUE
4 Waratahs vs. Reds Jul 13 12 – 14 4.00 FALSE
5 Force vs. Brumbies Jul 13 21 – 15 -10.70 FALSE
6 Sharks vs. Kings Jul 13 58 – 13 18.30 TRUE
7 Stormers vs. Bulls Jul 13 30 – 13 -1.10 FALSE

 

Predictions for the Qualifying Finals

Here are the predictions forthe Qualifying Finals. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Crusaders vs. Reds Jul 20 Crusaders 12.20
2 Brumbies vs. Cheetahs Jul 21 Brumbies 8.50

 

July 16, 2013

Benefits numbers context

According to Stuff, Paula Bennett says the number on benefits is down by 10,000 since last year . Whether this is good or bad depends on where they ended up instead (as the story points out) but it is what the Government was attempting.

What the story doesn’t point out is that by MSD numbers there were drops of 10,000 or more in the number on benefits in the years ending June 2004, 2005, 2006, 2007 (and over 7000 in the year ending 2012, and over 8000 in the year ending 2003). And that’s as early as the data file goes.

Numbers on benefits have been going down for a long time, with an interruption for the recession, when they (obviously) went up a lot. Some of what we’re seeing now is just economic recovery, some is new rules, some is long-term changes in society. It’s hard to split the credit or blame, but it’s useful to know that a fall of 10,000 in a year isn’t  unusual.

If you hold a seashell to your ear

From XKCD and Bayes’ Theorem

seashell

 

 

Briefly

Numbers have a magnitude as well as an order

Nathan Yau, at Flowing Data, shows this barchart-like object

Speaking-the-world-625x622

 

The most obvious problem is that the bar for Chinese is about 3 times the area of the bar for English, and should be either about a third bigger, or, if you really believe there are 1 trillion Chinese speakers, 1300 times bigger. The bars for English and Spanish are obviously out of proportion as well.

I tried to find the original source of this graph, which Nathan doesn’t give, perhaps because he couldn’t find it either.  However, Google image search did show a related graph that explains some of the problems.  I found it at a range of sites, but the one at 10-most.com is where it looks most original.

The-most-common-languages

 

This version is still ugly, but the bars match the labels.

Comparing the numbers to Wikipedia, it looks as though someone edited the labels from ‘top ten by number of native speakers’ to ‘top ten by total number of speakers’ either as a deliberate change or because they didn’t understand the original graph (and they also inflated the Chinese count by a factor of 1000). Sadly, the editor didn’t appreciate that the bars are more than just decoration; they are supposed to convey numeric information.

PS: Yes, we could also get picky about ‘Chinese’ as a single language, but they clearly mean Mandarin Chinese, the group of dialects that includes the official language of both the PRC and the ROC.

July 15, 2013

Think of a number and multiply by eight

We haven’t had a good denominator post in a while, but I was struck by Stuff’s story on ‘social lending’.  By next April, an NZ company hopes to be able to start up peer-to-peer lending here, after changes in the law, and we’re told

“This will be a way for individuals to play alongside the big institutions,” Milsom said

Just how that can happen – and the scale that is possible – is shown from what has happened in the UK, where the largest peer-to-peer lender has lent more than £330 million since it launched in 2005.

That’s an average of just over £40 million per year over 2005-2012, or about 66p per capita per year in the UK.  On the same scale in New Zealand, that would come to perhaps $6 million per year in loans.  The UK lender in question appears to be Zopa, and they make their money by charging a 1% borrowing fee and 1% lending fee.  Under the same setup, $6 million per year in loans would mean $120 000 per year in income for an NZ equivalent, before paying any costs.

It’s certainty possible that peer-to-peer lending will take off, and it might not be long before it exceeds 1% of all unsecured consumer debt. But the idea makes more sense spun as ‘room for expansion’ not as ‘look at the scale in the UK’.

Stat of the Week Competition: July 13 – 19 2013

Each week, we would like to invite readers of Stats Chat to submit nominations for our Stat of the Week competition and be in with the chance to win an iTunes voucher.

Here’s how it works:

  • Anyone may add a comment on this post to nominate their Stat of the Week candidate before midday Friday July 19 2013.
  • Statistics can be bad, exemplary or fascinating.
  • The statistic must be in the NZ media during the period of July 13 – 19 2013 inclusive.
  • Quote the statistic, when and where it was published and tell us why it should be our Stat of the Week.

Next Monday at midday we’ll announce the winner of this week’s Stat of the Week competition, and start a new one.

(more…)

Stat of the Week Competition Discussion: July 13 – 19 2013

If you’d like to comment on or debate any of this week’s Stat of the Week nominations, please do so below!

July 13, 2013

Visualising the Bechdel test

The Bechdel Test classifies movies according to whether they have two female characters, who at some point talk to each other, about something other than a man.

It’s not that all movies should pass the test — for example, a movie with a tight first-person viewpoint is unlikely to pass the test if the viewpoint character is male, and no-one’s saying such movies should not exist.  The point of the test is that surprisingly few movies pass it.

At Ten Chocolate Sundaes there’s an interesting statistical analysis of movies over time and by genre, looking at the proportion that pass the test.  The proportion seems to have gone down over time, though it’s been pretty stable in recent years.