August 9, 2022

NRL Predictions for Round 22

Team Ratings for Round 22

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Panthers 12.80 14.26 -1.50
Storm 10.75 19.20 -8.50
Rabbitohs 7.23 15.81 -8.60
Roosters 4.04 2.23 1.80
Cowboys 3.86 -12.27 16.10
Sharks 3.05 -1.10 4.20
Eels 2.60 2.54 0.10
Sea Eagles 0.17 10.99 -10.80
Raiders -1.56 -1.10 -0.50
Broncos -1.60 -8.90 7.30
Dragons -3.94 -7.99 4.10
Bulldogs -5.68 -10.25 4.60
Titans -7.46 1.05 -8.50
Knights -7.63 -6.54 -1.10
Wests Tigers -8.32 -10.94 2.60
Warriors -10.30 -8.99 -1.30

 

Performance So Far

So far there have been 160 matches played, 108 of which were correctly predicted, a success rate of 67.5%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Roosters vs. Broncos Aug 04 34 – 16 7.30 TRUE
2 Storm vs. Titans Aug 05 32 – 14 21.80 TRUE
3 Sea Eagles vs. Eels Aug 05 20 – 36 2.80 FALSE
4 Rabbitohs vs. Warriors Aug 06 48 – 10 21.00 TRUE
5 Raiders vs. Panthers Aug 06 6 – 26 -10.10 TRUE
6 Sharks vs. Dragons Aug 06 24 – 18 10.70 TRUE
7 Bulldogs vs. Cowboys Aug 07 14 – 28 -5.40 TRUE
8 Wests Tigers vs. Knights Aug 07 10 – 14 3.50 FALSE

 

Predictions for Round 22

Here are the predictions for Round 22. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Panthers vs. Storm Aug 11 Panthers 5.10
2 Warriors vs. Bulldogs Aug 12 Warriors 0.90
3 Eels vs. Rabbitohs Aug 12 Rabbitohs -1.60
4 Roosters vs. Cowboys Aug 13 Roosters 3.20
5 Wests Tigers vs. Sharks Aug 13 Sharks -11.40
6 Broncos vs. Knights Aug 13 Broncos 9.00
7 Raiders vs. Dragons Aug 14 Raiders 5.40
8 Titans vs. Sea Eagles Aug 14 Sea Eagles -4.60

 

Bunnings NPC Predictions for Wednesday Game for Round 1

Team Ratings for Wednesday Game for Round 1

Because of the unusual scheduling of games in the NPC, I will be giving a forecast for the Wednesday game on a Tuesday and forecasts for the weekend games after the Wednesday game.

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Hawke’s Bay 5.73 5.87 -0.10
Tasman 5.45 5.94 -0.50
Auckland 5.03 4.50 0.50
Canterbury 3.64 2.00 1.60
Wellington 3.18 3.58 -0.40
Taranaki 2.88 3.63 -0.80
Waikato 2.14 2.00 0.10
North Harbour 1.78 2.31 -0.50
Bay of Plenty 0.50 0.10 0.40
Otago -1.95 -1.63 -0.30
Northland -5.92 -6.68 0.80
Counties Manukau -6.00 -6.32 0.30
Southland -6.52 -7.01 0.50
Manawatu -7.61 -5.97 -1.60

 

Performance So Far

So far there have been 7 matches played, 4 of which were correctly predicted, a success rate of 57.1%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Manawatu vs. Canterbury Aug 05 15 – 62 -5.50 TRUE
2 Counties Manukau vs. Otago Aug 06 23 – 22 -2.20 FALSE
3 Waikato vs. Hawke’s Bay Aug 06 32 – 32 -1.40 FALSE
4 Auckland vs. North Harbour Aug 06 36 – 26 4.70 TRUE
5 Taranaki vs. Northland Aug 07 11 – 13 12.80 FALSE
6 Tasman vs. Southland Aug 07 27 – 20 15.50 TRUE
7 Wellington vs. Bay of Plenty Aug 07 37 – 35 6.00 TRUE

 

Predictions for Wednesday Game for Round 1

Here are the predictions for Wednesday Game for Round 1. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Manawatu vs. Auckland Aug 10 Auckland -10.10

 

August 5, 2022

Briefly

  • There’s a new version of ESR’s Wastewater Covid dashboard. It has information on which variants are being found, by location and over time
  • Hashigo Zake, the Wellington craft beer bar, has a new Twitter bot tweeting out the CO2 concentration inside the bar. I summarised a couple of days of it:
  • How far can you go by train in 5 hours? A map of Europe
  • How likely are people to win the lottery: the Washington Post did a quiz
  • Jamie Morton in the Herald has a good discussion of the Stats NZ review of the population denominator used in Covid vaccine stats.  The HSU undercounts somewhat, especially for Māori and Pacific Peoples, but it has the virtue of counting ethnicity the same way that the vaccination data does, and of including people in NZ who are not residents.
August 4, 2022

Bunnings NPC Predictions for Week 1

Team Ratings for Week 1

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Tasman 5.94 5.94 -0.00
Hawke’s Bay 5.87 5.87 0.00
Auckland 4.50 4.50 0.00
Taranaki 3.63 3.63 -0.00
Wellington 3.58 3.58 0.00
North Harbour 2.31 2.31 -0.00
Canterbury 2.00 2.00 -0.00
Waikato 2.00 2.00 0.00
Bay of Plenty 0.10 0.10 0.00
Otago -1.63 -1.63 0.00
Manawatu -5.97 -5.97 0.00
Counties Manukau -6.32 -6.32 -0.00
Northland -6.68 -6.68 -0.00
Southland -7.01 -7.01 0.00

 

Predictions for Week 1

Here are the predictions for Week 1. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Manawatu vs. Canterbury Aug 05 Canterbury -5.50
2 Counties Manukau vs. Otago Aug 06 Otago -2.20
3 Waikato vs. Hawke’s Bay Aug 06 Hawke’s Bay -1.40
4 Auckland vs. North Harbour Aug 06 Auckland 4.70
5 Taranaki vs. Northland Aug 07 Taranaki 12.80
6 Tasman vs. Southland Aug 07 Tasman 15.50
7 Wellington vs. Bay of Plenty Aug 07 Wellington 6.00
8 Manawatu vs. Auckland Aug 10 Auckland -8.00

 

August 2, 2022

NRL Predictions for Round 21

Team Ratings for Round 21

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Panthers 12.17 14.26 -2.10
Storm 11.06 19.20 -8.10
Rabbitohs 6.21 15.81 -9.60
Sharks 3.42 -1.10 4.50
Roosters 3.36 2.23 1.10
Cowboys 3.31 -12.27 15.60
Eels 1.48 2.54 -1.10
Sea Eagles 1.29 10.99 -9.70
Broncos -0.93 -8.90 8.00
Raiders -0.93 -1.10 0.20
Dragons -4.32 -7.99 3.70
Bulldogs -5.13 -10.25 5.10
Wests Tigers -7.72 -10.94 3.20
Titans -7.76 1.05 -8.80
Knights -8.23 -6.54 -1.70
Warriors -9.27 -8.99 -0.30

 

Performance So Far

So far there have been 152 matches played, 102 of which were correctly predicted, a success rate of 67.1%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Sea Eagles vs. Roosters Jul 28 10 – 20 2.50 FALSE
2 Warriors vs. Storm Jul 29 12 – 24 -15.40 TRUE
3 Eels vs. Panthers Jul 29 34 – 10 -11.70 FALSE
4 Titans vs. Raiders Jul 30 24 – 36 -2.60 TRUE
5 Sharks vs. Rabbitohs Jul 30 21 – 20 0.10 TRUE
6 Broncos vs. Wests Tigers Jul 30 18 – 32 12.90 FALSE
7 Knights vs. Bulldogs Jul 31 10 – 24 1.80 FALSE
8 Dragons vs. Cowboys Jul 31 8 – 34 -1.80 TRUE

 

Predictions for Round 21

Here are the predictions for Round 21. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Roosters vs. Broncos Aug 04 Roosters 7.30
2 Storm vs. Titans Aug 05 Storm 21.80
3 Sea Eagles vs. Eels Aug 05 Sea Eagles 2.80
4 Rabbitohs vs. Warriors Aug 06 Rabbitohs 21.00
5 Raiders vs. Panthers Aug 06 Panthers -10.10
6 Sharks vs. Dragons Aug 06 Sharks 10.70
7 Bulldogs vs. Cowboys Aug 07 Cowboys -5.40
8 Wests Tigers vs. Knights Aug 07 Wests Tigers 3.50

 

Homelessness statistics

Radio NZ reported an estimate by the charity Orange Sky “One in six kiwis have been homeless and tonight about 41,000 of us will bed down without adequate access to housing”.  I saw some skepticism of these figures on Twitter, so let’s take a look.

Based on the 2018 Census, researchers at the University of Otago estimated

  • 3,624 people who were considered to be living without shelter (on the streets, in improvised dwellings – including cars – and in mobile dwellings). 
  • 7,929 people who were living in temporary accommodation (night shelters, women’s refuges, transitional housing, camping grounds, boarding houses, hotels, motels, vessels, and marae). 
  • 30,171 people who were sharing accommodation, staying with others in a severely crowded dwelling. 
  • 60,399 people who were living in uninhabitable housing that was lacking one of six basic amenities: tap water that is safe to drink; electricity; cooking facilities; a kitchen sink; a bath or shower; a toilet.

So, the figure of 41,000 is a surprisingly close match to the Census data for those first three groups — if you’d only count the first group or the first two, you would obviously get a smaller number.  Because it would be hard to estimate current homelessness from a YouGov survey panel, I suspect the number did come from the Census,  and the ‘new study’ the story mentions is responsible for the ‘one in six’, though Orange Sky actually gives the number as ‘more than one in five (21%)’.

Do the two figures match? Well, if about a million people had ever been homeless (in the broad sense) and 41,000 currently are, that’s a ratio of 25.  The median age of adults (YouGov interviews adults) is probably in the 40s, so if the typical person who was ever homeless spent less than a couple of years homeless the figures would match.  The People’s Project NZ say that homelessness in NZ is mostly short-term — in the sense that most people who are ever homeless are only that way for a relatively short time (which isn’t the same as saying most people who are currently homeless will be that way for a short time).

So, the figures aren’t obviously implausible, and given that they’re presented as the result of research that should be able to get reasonable estimates, they may well be reasonably accurate.

July 28, 2022

Counting bots better

I wrote before about estimating the proportion of spam bots among the apparent people on Twitter.  The way Twitter does it seems ok. According to some people in the internet who seem to know about Delaware mergers and acquisitions law it doesn’t even matter if the way Twitter does it is ok, as long as it roughly matches what they have claimed they do.  But it’s still interesting from a statistics point of view to ask whether it could be done better given the existence of predictive models (“AI”, if you must).  It’s also connected to my research.

Imagine we have a magic black box that spits out “Bot” or “Not” for each user.  We don’t know how it works (it’s magic) and we don’t know how much to trust it (it’s magic). We feed in the account details of 217 million monetisable daily active users and it chugs and whirrs for a while before saying “You have 15696969 bots.”

We’re not going to just tell investors “A magic box says we have 15696969 bots among our daily active users“, but it’s still useful information.  We also have reviewed a genuine random sample of 1000 accounts by hand, over a couple of weeks, and we get 54 bots. We don’t want to just ignore the magic box and say “we have 5.4% bots” What should our estimate be, combining the two? It obviously depends on how accurate the magic box is!  We can get some idea by looking at what the magic box says for the 1000 accounts reviewed by hand.

Maybe the magic box says 74 of the 1000 accounts are bots: 50 of the ones that really are, and 24 others. That means it’s fairly accurate, but it overcounts by about 40%.  Over all of Twitter, you probably don’t have 15696969 bots; maybe you have more like 11,420,000 bots.   If we want the best estimate that doesn’t require trusting the magic box and only requires trusting the random sampling, we can divide up Twitter into accounts the box says are bots and ones that it says aren’t bots, estimate the true proportion in each group, and combine.   In this example, we’d get 5.3% with a 95% confidence interval of  (4.4%, 6.2%). If we didn’t have the magic box at all, we’d get an estimate of 5.4% with a confidence interval of (4.0%, 6.8%).  The magic box has improved the precision of the estimate.  With this technique, the magic box can only be helpful. If it’s accurate, we’ll get a big improvement in precision. If it’s not accurate, we’ll get little or no improvement in precision, but we still won’t introduce any bias.

The techique is called post-stratification, and it’s the simplest form of a very general approach to using information about a whole population to improve an estimate from a random sample.  Improving estimates of proportions or counts with post-stratification is a very old idea (well, very old by the standards of statistics).  More recent research in this area includes ways to improve estimation of more complicated statistical estimates, such as regression models. We also look at ways to use the magic box to pick a better random sample  — in this example, instead of picking 1000 users at random we might pick a random sample of 500 accounts that the magic box says are bots and 500 accounts that it says are people. Or maybe it’s more reliable on old accounts than new ones, and we want to take random samples from more new accounts and fewer old accounts.

In practical applications the real limit on this idea is the difficulty of doing random sampling.  For Twitter, that’s easy. It’s feasible when you’re choosing which medical records from a database to check by hand, or which frozen blood samples to analyse, or which Covid PCR swabs to send for genome sequencing.  If you’re sampling people, though, the big challenge is non-response. Many people just won’t fill in your forms or talk to you on the phone or whatever. Post-stratification can be part of the solution there, too, but the problem is a lot messier.

 

July 27, 2022

Attendance figures

Chris Luxon said today on RNZ Morning Report that “55% of kids aren’t going to school regularly”.  On Twitter, Simon Britten said “In Term 1 of 2022 the Unjustified Absence rate was 6.4%, up from 4.1% the year prior. Not great, but also not 50%.”

It’s pretty unusual for NZ politicians to make straightforwardly false statements about publicly available statistics, so if there are numbers that seem to disagree or are just surprising, the most likely explanation is that the number doesn’t mean what you think it means.   It sounds like we have a disagreement about facts here, but we actually have a disagreement about which summary is most useful.

New Zealand does have an ongoing problem with school attendance — according to the Government, not just the Opposition.  The new Attendance and Engagement Strategy document (PDF) says that the percentage of regular attendance was  59.7% in 2021, down from  69.5% in 2015. The aim is to raise this to 70% by 2024 and 75% by 2026.

So if the unjustified absence rate is 6.4%, how can the regular attendance rate be 59.7% or 45%?  “Regular attendance” is defined as attending school at least 90% of the time — so if you miss more than one day per fortnight, or more than one week per term, you are not attending regularly.

For example, suppose half the kids in NZ missed one week and one day in term 1. The absence rate would be about 12% but the regular attendance rate would be 50%.  The unjustified absence rate could be anything from 0% to 12%. It’s quite possible to have a 5% unjustified absence rate and a 50% regular attendance rate.

Now we want more details. They are available here.  The regular attendance rate is down dramatically this year, from 66.8% in term 1 last year to 46.1% in term 1 this year. The proportion of half-days attended is down less dramatically, from 90.5% in term 1 last year to 84.5% in term 1 this year.  Justified absences are up 4.5 percentage points and unjustified absences up by just under 2 percentage points.

What’s different between term 1 this year and term 1 last year?

Well…

It wouldn’t be surprising if a fair fraction of NZ kids took a week off school in term 1, either because they had Covid or because they were in isolation as household contacts.  That’s what should have happened, from a public health point of view.  It’s actually a bit surprising to me that justified absences weren’t even higher. Term 1, 2022, shouldn’t really representative of the long-term state of schools in NZ.  Attendance rates were higher before the Omicron spike; they will probably be higher in the future even without anti-truancy interventions.

It’s reasonable to be worried about school attendance, as the Government and Opposition both claim they are. I don’t think “55% of kids aren’t going to school regularly”  is a particularly good way to describe a Covid outbreak.  Last year’s figures are more relevant if you want to talk about the problem seriously.

July 26, 2022

Briefly

  • Derek Lowe writesLate last week came this report in Science about doctored images in a series of very influential papers on amyloid and Alzheimer’s disease. That’s attracted a lot of interest, as well it should, and as a longtime observer of the field (and onetime researcher in it), I wanted to offer my own opinions on the controversy.”  As he says, the interest in amyloid is not just (or primarily) driven by the allegedly fraudulent research. There’s a lot of support for the importance of beta-amyloid from genetics: mutations that cause early-onset Alzheimer’s, and perhaps even more convincingly, a mutation found in Icelanders that protects against Alzheimers. The alleged fraud is bad, as is the current complete failure of research into treatments, but the link between the two isn’t as strong as some people are implying.
  • Prof Casey Fiesler, who teaches in the area of tech ethics and governance, is developing a TikTok-based tech ethics and privacy course
  • ESR’s Covid wastewater dashboard is live.  This is important because Everyone Poops. We don’t have an exact conversion from measured viruses to active cases, and the conversion could vary with the strain of Covid and with age of the patients, but at least it won’t depend on who decides to get tested and report their test results.
  • The wastewater data will be an excellent complement for the prevalence survey that the Ministry of Health is starting up. The survey, assuming that a reasonable fraction of people go along with getting tested, will give a direct estimate of the true population infection rate, but it will not be as detailed as the wastewater data, which can give estimates for relatively small areas and short time frames.
  • Briefing on the Data and Statistics Bill from the NZ Council of Civil Liberties. If you follow StatsChat you’ve seen these points before. And you will see them again.

NRL Predictions for Round 20

 

 

Team Ratings for Round 20

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Panthers 14.17 14.26 -0.10
Storm 11.32 19.20 -7.90
Rabbitohs 6.28 15.81 -9.50
Sharks 3.35 -1.10 4.50
Roosters 2.59 2.23 0.40
Sea Eagles 2.07 10.99 -8.90
Cowboys 1.90 -12.27 14.20
Broncos 0.62 -8.90 9.50
Eels -0.52 2.54 -3.10
Raiders -1.53 -1.10 -0.40
Dragons -2.91 -7.99 5.10
Bulldogs -6.09 -10.25 4.20
Titans -7.16 1.05 -8.20
Knights -7.27 -6.54 -0.70
Wests Tigers -9.27 -10.94 1.70
Warriors -9.54 -8.99 -0.60

 

Performance So Far

So far there have been 144 matches played, 98 of which were correctly predicted, a success rate of 68.1%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Eels vs. Broncos Jul 21 14 – 36 5.00 FALSE
2 Dragons vs. Sea Eagles Jul 22 20 – 6 -4.10 FALSE
3 Knights vs. Roosters Jul 22 12 – 42 -3.80 TRUE
4 Raiders vs. Warriors Jul 23 26 – 14 13.80 TRUE
5 Panthers vs. Sharks Jul 23 20 – 10 14.50 TRUE
6 Rabbitohs vs. Storm Jul 23 24 – 12 -4.00 FALSE
7 Bulldogs vs. Titans Jul 24 36 – 26 2.90 TRUE
8 Cowboys vs. Wests Tigers Jul 24 27 – 26 16.00 TRUE

 

Predictions for Round 20

Here are the predictions for Round 20. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Sea Eagles vs. Roosters Jul 28 Sea Eagles 2.50
2 Warriors vs. Storm Jul 29 Storm -15.40
3 Eels vs. Panthers Jul 29 Panthers -11.70
4 Titans vs. Raiders Jul 30 Raiders -2.60
5 Sharks vs. Rabbitohs Jul 30 Sharks 0.10
6 Broncos vs. Wests Tigers Jul 30 Broncos 12.90
7 Knights vs. Bulldogs Jul 31 Knights 1.80
8 Dragons vs. Cowboys Jul 31 Cowboys -1.80