Posts from June 2018 (16)

June 11, 2018

Who gets counted?

Last week, Microsoft (who you’ve probably heard of) bought GitHub (who you may well not have heard of) with $7.5 billion in Microsoft stock.  Github is a site that cross-bred version-control software (‘track changes’ for programmers) with social media, providing a place to share and promote code.

Stuff and the Herald quoted the same number

More than 28 million developers around the world use GitHub, with Microsoft ranking as the most active organisation on GitHub.

You might wonder what proportion of developers use GitHub. If you search a bit on Google, you’ll find that the total number of developers is estimated at about 21 million, so roughly 130% of them use GitHub.

Obviously there’s something wrong there.  The problem is how to define ‘developers’.

In the US, the Bureau of Labor Statistics reports how many people do each job. They say, currently,  that there are 1,617,400 “Software Developers and Programmers”.  The figure of 28 million worldwide is actually based on a subset of those, the 1.2 million “Software Developers, Applications” and “Software Developers, Systems”.  This sort of official classification has to be narrow, because the goal is for  every job to end up in exactly one category.

Lots of people in the US who are developers in the GitHub sense aren’t developers in the Bureau of Labor Statistics sense. Some of them write software only in their spare time. Others write software as part of their jobs, but their jobs are classified somewhere else in the BLS system. The same is true in New Zealand. Stats NZ reports 31,860 jobs in ANZSIC06 category 7000 “Computer Systems Design and Related Services”, which is a bit broader than the US category.  Even though I’m a developer in the GitHub sense, I’m not one of them. I’m in 8102, “Higher Education”.

Other people who probably in the 21 million count include statisticians, data scientists, computational biologists, ornithologists, journalists, linguists, and many more.

Official statistics are usually pretty accurate, but they are only accurate for what they are trying to measure, which might not be what you are looking for.

June 7, 2018

Looking at the numbers

The new QS university rankings are out, and there’s a story in the Herald.  It starts

Staff cuts despite growing student numbers have dragged most New Zealand universities down in the latest world rankings.

The biggest six of the country’s eight universities have all tumbled in the London-based QS rankings, which are regarded as the most important for attracting international students.

“Tumbled” is an exaggeration — for example, the University of Auckland has ‘tumbled’ from 82nd to 85th in the rankings. But the message that it’s staff numbers does seem to be backed up with a quote from QS

“The increase in enrolments – and the decrease in faculty numbers – reported by the country’s universities sees all eight receive a lower score for faculty/student ratio,”

QS don’t make it easy to find older numbers, but an archive of their webpage in March last year said there were 29,930 students and 2025 academic staff, and a ranking of 81. The current figures are  29,641  students (ie, fewer) and 2,047 academic staff (ie, more), for an improvement in staff:student ratio from 14.8 to 14.5. 

That’s over a two year period, but last year, the story at Stuff said

New Zealand universities performed well in research outputs – Waikato ranked 133rd, Otago 174th and Canterbury 178th – but showed “uniformly deteriorating” faculty to student ratios. The exception was Lincoln University, which featured among the top 200 universities globally in that measure.

So, cumulatively over this two year period the staff:student ratio at UoA (as measured by QS) improved, but the reporting said it worsened in both years.

My guess as to what’s going on is that these rankings are rankings. What they mean by “lower score for faculty/student ratio” isn’t that the ratio got worse here, but that it got better here by less than it did at some competing universities.

The other strange thing in the Herald story is this:

The worsening staff/student ratio in NZ universities was entirely due to cuts of 203 academics at Massey and 74 at Lincoln.

It could be true that these are the only NZ universities with a worsening staff/student ratio — the other universities could be seeing the same sort of change that UoA did — but if it is, the apparent contradiction with the lead should have been noticed.

June 6, 2018

Super 15 Predictions for Round 17

Team Ratings for Round 17

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Crusaders 15.52 15.23 0.30
Hurricanes 12.56 16.18 -3.60
Lions 8.48 13.81 -5.30
Highlanders 7.90 10.29 -2.40
Chiefs 7.64 9.29 -1.60
Waratahs 1.05 -3.92 5.00
Sharks 0.78 1.02 -0.20
Jaguares 0.62 -4.64 5.30
Stormers -0.58 1.48 -2.10
Bulls -2.90 -4.79 1.90
Blues -2.99 -0.24 -2.80
Brumbies -3.06 1.75 -4.80
Rebels -8.00 -14.96 7.00
Reds -9.21 -9.47 0.30
Sunwolves -15.24 -18.42 3.20

 

Performance So Far

So far there have been 99 matches played, 69 of which were correctly predicted, a success rate of 69.7%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Highlanders vs. Hurricanes Jun 01 30 – 14 -3.50 FALSE
2 Blues vs. Rebels Jun 02 10 – 20 11.60 FALSE
3 Chiefs vs. Crusaders Jun 02 20 – 34 -3.10 TRUE
4 Reds vs. Waratahs Jun 02 41 – 52 -6.20 TRUE
5 Brumbies vs. Sunwolves Jun 03 41 – 31 17.00 TRUE

 

Predictions for Round 17

Here are the predictions for Round 17. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Blues vs. Reds Jun 29 Blues 10.20
2 Rebels vs. Waratahs Jun 29 Waratahs -5.60
3 Highlanders vs. Chiefs Jun 30 Highlanders 3.80
4 Brumbies vs. Hurricanes Jun 30 Hurricanes -11.60
5 Sunwolves vs. Bulls Jun 30 Bulls -8.30
6 Sharks vs. Lions Jun 30 Lions -4.20
7 Jaguares vs. Stormers Jun 30 Jaguares 5.20

 

Briefly

    • According to the Daily Mail “Brisk ten-minute walk, around 1,000 steps, can increase mortality by 15 per cent.”  They mean decrease.
    • Wikipedia thinks Amanda Cox, the data visualisation expert from the New York Times, isn’t “notable”. They are wrong and they should feel bad. But also, published evidence that she is notable would be useful.
    • The FDA has issued guidance on how to include pregnant women in clinical trials.
    • A monthly collection of the best data visualisation from around the web
    • Black cats are less popular

Methamphetamine testing

The report from Dr Anne Bardsley and Dr Felicia Low for the Office of the Chief Science Adviser makes clear that testing of houses for methamphetamine has been a complete failure for the sort of evidence-based risk-benefit analysis the NZ governments are claiming to care about.

It’s not just that Housing NZ used an Australian guideline for how clean a former meth lab should be after you’ve cleaned it as a screening threshold. Or that there’s an explicit 300-fold safety factor underlying that threshold even for the most susceptible people (toddlers crawling around and putting things in their mouths). Or that they not only evicted people but sometimes took away their personal belongings as too unclean to touch.  

In a situation where Housing NZ now claims they knew their standard was not very well founded, they didn’t try to do any better.  Faced with a huge testing and remediation bill, whose necessity was — at the most generous evaluation — unclear, they didn’t spend the relatively small amounts that would be needed to find out whether they were wasting public money. They didn’t even ask for help from, say,  the Chief Science Adviser, or the Royal Society Te Apārangi.

More importantly, though, they ignored the harm done by evicting vulnerable people.  The fundamental assumption of any cost-benefit or risk-benefit analysis is that you’ve got the costs and risks and benefits right, or at least that you’ve made an honest effort to get them right and been explicit about your uncertainty.  There are difficult second-order questions of whose costs and benefits you include, and how you account for hard-to-quantify factors like cultural preferences and reputational costs. But if you don’t even put in some of the major costs of a policy, you’re admitting up front that you don’t care about the right answer.

There currently seems pretty broad consensus among people who don’t work there that Housing NZ needs to be remediated. But the ability of the meth screening policies to last that long will — and should — raise doubts about evidence-based decision-making across the NZ public sector. Which is a pity.

 

June 5, 2018

NRL Predictions for Round 14

Team Ratings for Round 14

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Storm 8.88 16.73 -7.90
Panthers 5.75 2.64 3.10
Rabbitohs 4.03 -3.90 7.90
Dragons 3.94 -0.45 4.40
Sharks 3.10 2.20 0.90
Roosters 1.86 0.13 1.70
Cowboys 0.54 2.97 -2.40
Broncos 0.15 4.78 -4.60
Raiders -0.15 3.50 -3.70
Wests Tigers -0.70 -3.63 2.90
Sea Eagles -2.08 -1.07 -1.00
Bulldogs -2.16 -3.43 1.30
Warriors -4.13 -6.97 2.80
Eels -6.11 1.51 -7.60
Knights -7.42 -8.43 1.00
Titans -7.81 -8.91 1.10

 

Performance So Far

So far there have been 100 matches played, 59 of which were correctly predicted, a success rate of 59%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Sea Eagles vs. Cowboys May 31 12 – 26 2.70 FALSE
2 Rabbitohs vs. Sharks Jun 01 22 – 14 3.30 TRUE
3 Eels vs. Knights Jun 02 4 – 30 9.20 FALSE
4 Roosters vs. Wests Tigers Jun 03 16 – 14 6.10 TRUE

 

Predictions for Round 14

Here are the predictions for Round 14. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Raiders vs. Panthers Jun 08 Panthers -2.90
2 Titans vs. Rabbitohs Jun 08 Rabbitohs -8.80
3 Sea Eagles vs. Warriors Jun 09 Sea Eagles 6.50
4 Knights vs. Roosters Jun 09 Roosters -6.30
5 Eels vs. Cowboys Jun 09 Cowboys -3.70
6 Sharks vs. Wests Tigers Jun 10 Sharks 6.80
7 Storm vs. Broncos Jun 10 Storm 11.70
8 Bulldogs vs. Dragons Jun 11 Dragons -3.10