Archives (240)

August 25, 2020

NRL Predictions for Round 16

Team Ratings for Round 16

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Storm 14.06 12.73 1.30
Roosters 10.82 12.25 -1.40
Raiders 6.16 7.06 -0.90
Panthers 6.04 -0.13 6.20
Eels 5.46 2.80 2.70
Rabbitohs 4.00 2.85 1.20
Sharks 0.59 1.81 -1.20
Knights -0.11 -5.92 5.80
Wests Tigers -2.54 -0.18 -2.40
Sea Eagles -2.67 1.05 -3.70
Dragons -3.37 -6.14 2.80
Warriors -4.59 -5.17 0.60
Bulldogs -6.32 -2.52 -3.80
Cowboys -6.77 -3.95 -2.80
Broncos -11.05 -5.53 -5.50
Titans -11.72 -12.99 1.30

 

Performance So Far

So far there have been 120 matches played, 83 of which were correctly predicted, a success rate of 69.2%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Eels vs. Storm Aug 20 14 – 0 -8.70 FALSE
2 Panthers vs. Sharks Aug 21 38 – 12 5.50 TRUE
3 Broncos vs. Dragons Aug 21 24 – 28 -6.00 TRUE
4 Titans vs. Raiders Aug 22 16 – 36 -15.20 TRUE
5 Wests Tigers vs. Roosters Aug 22 16 – 38 -10.10 TRUE
6 Rabbitohs vs. Sea Eagles Aug 22 56 – 16 5.80 TRUE
7 Bulldogs vs. Warriors Aug 23 14 – 20 3.90 FALSE
8 Knights vs. Cowboys Aug 23 12 – 0 8.10 TRUE

 

Predictions for Round 16

Here are the predictions for Round 16. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Eels vs. Rabbitohs Aug 27 Eels 3.50
2 Dragons vs. Titans Aug 28 Dragons 10.40
3 Roosters vs. Broncos Aug 28 Roosters 23.90
4 Knights vs. Warriors Aug 29 Knights 9.00
5 Sharks vs. Cowboys Aug 29 Sharks 9.40
6 Panthers vs. Wests Tigers Aug 29 Panthers 10.60
7 Storm vs. Sea Eagles Aug 30 Storm 16.70
8 Raiders vs. Bulldogs Aug 30 Raiders 14.50

 

August 24, 2020

A lanyard vs a piece of paper

Staged debate, as a format, has problems for actual discussion. So I wasn’t expecting the Stuff for/against on the COVID card to be more informative than the various discussions I’ve seen on Twitter.

This, from Ian Taylor was more than I’d anticipated, though

The Government has budgeted $210m for the 2023 census. That’s $210m to get a piece of paper to every New Zealander.

If you were a nasty suspicious person you might think I’ve quoted this out of context, and cut out all the things apart from the mail-out that go into making the value of the census over a billion dollars (PDF) — developing a sampling frame for dwellings, the hardware and software computer systems, data entry and validation,  monitoring of response rates, employing people to go door-to-door to catch up on non-response, the post-census enumeration survey, estimation of under-coverage, imputation of missing data, and so on.

You might think I’d left those out. But I didn’t. Describing the next Census as $210m to get a piece of paper to every New Zealander is like describing the COVID card as $100m to get a lanyard to every New Zealander. It leaves out all the stuff that makes it work.

And it’s not as if there aren’t other, more relevant, comparisons to make. A better comparison for the $100m cost of the COVID card would be the cost a of a few days more for Auckland at Level 3. If the COVID card could save us a week at level 3 in total, over the next couple of years, and there isn’t another solution that would be better or cheaper, then it easily makes sense.

I’m basically in favour of Bluetooth proximity measures as an adjunct to tracing in the current situation of mostly-successful elimination. I don’t think they come anywhere close to allowing us to relax the isolation/quarantine process, as some people had suggested earlier.

For the COVID card in particular I’d like to see some evidence about realistic fractions of people carrying the thing a year from now, and about how many false-positive ‘close contacts’ it generates. This information might exist, but it hasn’t been pushed by proponents. I’m not convinced by the opposing argument that it will take months to roll out:  the optimistic estimates for mass vaccination are probably eighteen months away; we’ve got plenty of time to improve. I think there are questions about cost and reliability and acceptability of COVID card relative to other Bluetooth and non-Bluetooth options, but I’ll leave them to the engineers and designers (preferably people who won’t simply dismiss any reluctance to wear the thing as ‘fashion’).  

The polling spectrum

I’ve had two people already complain to me on Twitter about the Stickybeak polling at The Spinoff. I’m a lot less negative than they are.

To start with, I think the really important distinction in surveys is between those that are actually trying to get the right answer, and those that aren’t.  Stickybeak are on the “are trying” side.

There’s also a distinction between studies that are really only making an effort to get internally-valid comparisons and those that are trying to match the population.  Internally-valid comparisons can still be useful: if you have a big self-selected internet sample you won’t learn much about what proportion of people take drugs, but you might be able learn how the proportion of cannabis users trying to cut down compares with the proportion of nicotine users trying to cut down, or whether people who smoke weed and drink beer do both at once or on separate days, or other useful things.

Stickybeak are clearly trying to get nationally representative estimates (at least for their overall political polling): they talk about reweighting to match census data by gender, age, and region, and their claimed secret sauce is chatbots to raise response rates for online surveys.

Now, just because you’re trying to get the right answer doesn’t mean you will. There are plenty of people who try to predict Lotto results or earthquakes, too.  And there, it’s too soon to say.  We know that online panels can give good answers: YouGov has done well with this technique, where their respondents are not necessarily representative, but they have a lot of information about them.   We’re also pretty sure that pure random sampling for political opinion doesn’t work any more; response rates are so low that either quota sampling or weighting is needed to make the sample look at all like the population.

So what do I think?  I would have hoped to see more variables used to reweight (ethnicity, and finer-scale geography), with total sample size larger, not smaller, than the traditional polls.  I’d also like to see a better uncertainty description. The Spinoff is quoting

For a random sample of this size and after accounting for weighting the maximum sampling error (using 95% confidence) is approximately ±4%.

The accounting for weighting is not always done by NZ pollsters, so that’s good to see, but ‘For a random sample of this size’ seems a bit evasive.  Either they’re claiming 4% is a good summary of the (maximum) sampling error for their results, in which case they should say so, or they aren’t, in which case they should stop hinting that it is.    Still, we know that the 3.1% error claimed by traditional pollsters is an underestimate, and they largely get a pass on it.

If you want to know whether to trust their results, I can’t tell you. Stickybeak are new enough that we don’t really know how accurate they are.

August 22, 2020

Causation and fair comparisons

This is a version of a graph I saw on Twitter, posted by something who I think was trolling. The purple arrows are the lockdown decisions; the curve is the number of active COVID cases in New Zealand.  As you can see both lockdowns have been followed by a clear increase in the number of active cases, and no such increase has occurred any time when we haven’t imposed a lockdown. So, lockdowns cause COVID? Yeah nah.

Some of you are probably gearing up to say “correlation isn’t causation”; yes, well done. But that’s not the issue here. The relationship between number of active cases and lockdown is not a coincidence. There is a direct causal relationship. It just goes the other way: outbreaks cause lockdowns.

If we’re trying to estimate the effect of the California fires or the potential Gulf of Mexico hurricanes on COVID cases, it does make sense to compare infections shortly after and shortly before the event.  It obviously doesn’t for lockdown, but what (apart from “I know it when I see it”) is the distinction?

Economists would say the lockdown is endogenous (it’s coming from inside the epidemic). Epidemiologists, who have a more detailed taxonomy of bias, would talk about confounding by indication. People who take blood pressure drugs tend to have higher blood pressure than those who don’t; someone with  a headache is more likely to have taken paracetamol than someone without a headache. Interventions look bad precisely because you use them when they’re needed.  My bedroom tends to be warmer when the air conditioning is on (in summer) than when the heat pump is on (in winter).

We need a fair comparison to what actually happens after lockdown, and it isn’t business as usual.  This is where a model is useful.  We know roughly what happens to COVID case numbers with no intervention, because we have a fairly good mathematical model for how the disease spreads.  With no intervention, the number of new cases wouldn’t peak early and decline; it would keep going up.  With alternative, milder, interventions we’d need models on both sides of the comparison.  We have some data to validate the models, including genome sequencing to confirm which people were really infected as part of the same cluster, but the model does a lot of the work.

So, yes, we really can conclude that a New Zealand-style lockdown has worked.  This doesn’t mean it would work everywhere — just having the government say “lockdown” doesn’t do anything unless people cooperate — but the comparison to what we’d expect without it is evidence to say it worked here.

You get the same sort of problems in estimating the cost of lockdowns.  The cost compared to business as usual is relatively easy to estimate. That’s even a fair comparison for some policy questions: if we’re evaluating how much money it’s worth spending on infection control at the border, it’s a useful benchmark to know that a two-week Auckland lockdown won’t leave you much change out of a billion dollars.

But if you have an outbreak already and you want to estimate the cost of a lockdown compared to no lockdown, you can’t do a fair comparison to business as usual.  The economy will suffer during a prolonged outbreak: people will be reluctant to eat out or go to movies or rugby; jobs will be lost; less money will be available for spending.  Even before you add in the economic value of health, just the economic value of the economy will be down.  If you want to talk about the economic cost of the lockdown vs just letting the coronavirus run free, you need to do that comparison.  You can’t just compare to business as usual, any more than you can compare to business as usual and decide that lockdowns cause outbreaks.

August 21, 2020

Pro14 Predictions for Round 14

Team Ratings for Round 14

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Leinster 16.52 12.20 4.30
Munster 9.90 10.73 -0.80
Glasgow Warriors 5.66 9.66 -4.00
Edinburgh 5.49 1.24 4.20
Ulster 4.58 1.89 2.70
Scarlets 1.98 3.91 -1.90
Connacht 0.70 2.68 -2.00
Cardiff Blues 0.08 0.54 -0.50
Cheetahs -0.46 -3.38 2.90
Ospreys -2.82 2.80 -5.60
Treviso -3.50 -1.33 -2.20
Dragons -7.85 -9.31 1.50
Southern Kings -14.92 -14.70 -0.20
Zebre -15.37 -16.93 1.60

 

Performance So Far

So far there have been 101 matches played, 69 of which were correctly predicted, a success rate of 68.3%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Dragons vs. Treviso Feb 16 25 – 37 2.20 FALSE

 

Predictions for Round 14

Here are the predictions for Round 14. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Treviso vs. Zebre Aug 22 Treviso 16.90
2 Scarlets vs. Cardiff Blues Aug 23 Scarlets 6.90
3 Edinburgh vs. Glasgow Warriors Aug 23 Edinburgh 4.80
4 Leinster vs. Munster Aug 23 Leinster 11.60
5 Ospreys vs. Dragons Aug 23 Ospreys 10.00
6 Connacht vs. Ulster Aug 24 Connacht 1.10

 

August 19, 2020

Comparing two natural product Covid proposals

The Herald has a story headlined Pineapples could be key to treating virus about a proposed treatment for early COVID infection.  CNN (and various other sources) has a story about oleander extract† as a proposed treatment.

In both cases, the proposal is based on lab studies that have not be formally published, though the oleander one is available as a preprint.

More importantly, though, the Australian pineapple-based treatment is heading into proper clinical trials.  In lab tests it has been shown to disable the spike protein that the coronavirus uses for docking to cells.  A safety study is about to start, and has an entry in the ANZ clinical trial registry.  The treatment isn’t especially dangerous (it’s the enzyme that makes the inside of your mouth sore when you eat raw pineapple) but there’s no good data yet on the exact range of side-effects it is when squirted up your nose on a regular basis. If the side-effects aren’t too bad, the next step will presumably be a controlled trial looking at the effects on the virus, and then a larger controlled trial looking at whether it actually benefits the patients. That’s the usual testing procedure for new treatments.

The US oleander-based treatment† has lab test data showing it stops cells being infected. The manufacturer wants to get the drug into use rapidly, either in controlled trials or, if necessary, by calling it a ‘dietary supplement’† and evading drug approval rules.

In this case, the need for safety studies is obvious.  The data on viral replication in the lab looked at concentrations down to 0.05 mg/ml, or 50ng/ml. The lethal† blood concentration has been estimated as 10ng/ml.  A previous study of it as a cancer treatment saw ‘dose-limiting toxicities’ at about 2ng/ml — and this is ‘dose-limiting toxicity’ in the context of untreatable cancers, so it has to be pretty brutal.

There will obviously be some low enough dose that’s safe, but there’s currently not the slightest reason to expect those low doses to be effective. You should want lab studies at plausible human doses, and probably animal studies, before you tried giving this to patients, and before you tried advertising it to the President of the United States and the broader public.

 

† Do not eat/drink/smoke oleander. It will kill you unpleasantly.

August 18, 2020

Super Rugby Australia Predictions for Round 8

Team Ratings for Round 8

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Brumbies 2.23 4.67 -2.40
Reds -1.95 -0.31 -1.60
Rebels -2.94 -5.52 2.60
Waratahs -3.84 -7.12 3.30
Force -11.78 -10.00 -1.80

 

Performance So Far

So far there have been 14 matches played, 11 of which were correctly predicted, a success rate of 78.6%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Force vs. Waratahs Aug 14 8 – 28 -6.00 TRUE
2 Reds vs. Rebels Aug 15 19 – 3 3.80 TRUE

 

Predictions for Round 8

Here are the predictions for Round 8. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Reds vs. Force Aug 21 Reds 14.30
2 Brumbies vs. Waratahs Aug 22 Brumbies 10.60

 

Rugby Premiership Predictions for Round 15

Team Ratings for Round 15

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Exeter Chiefs 10.01 7.99 2.00
Saracens 7.01 9.34 -2.30
Sale Sharks 5.77 0.17 5.60
Wasps 2.25 0.31 1.90
Gloucester 0.91 0.58 0.30
Bristol 0.01 -2.77 2.80
Bath -1.12 1.10 -2.20
Northampton Saints -1.44 0.25 -1.70
Harlequins -1.47 -0.81 -0.70
Leicester Tigers -3.08 -1.76 -1.30
London Irish -6.14 -5.51 -0.60
Worcester Warriors -6.50 -2.69 -3.80

 

Performance So Far

So far there have been 84 matches played, 55 of which were correctly predicted, a success rate of 65.5%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Harlequins vs. Sale Sharks Aug 15 16 – 10 -3.90 FALSE
2 Worcester Warriors vs. Gloucester Aug 15 15 – 44 -0.10 TRUE
3 Exeter Chiefs vs. Leicester Tigers Aug 15 26 – 13 18.20 TRUE
4 Bath vs. London Irish Aug 16 34 – 17 8.50 TRUE
5 Bristol vs. Saracens Aug 16 16 – 12 -3.40 FALSE
6 Northampton Saints vs. Wasps Aug 17 21 – 34 2.50 FALSE

 

Predictions for Round 15

Here are the predictions for Round 15. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Sale Sharks vs. Exeter Chiefs Aug 22 Sale Sharks 0.30
2 Gloucester vs. Bristol Aug 22 Gloucester 5.40
3 Wasps vs. Worcester Warriors Aug 22 Wasps 13.20
4 Saracens vs. Harlequins Aug 22 Saracens 13.00
5 London Irish vs. Northampton Saints Aug 22 Northampton Saints -0.20
6 Leicester Tigers vs. Bath Aug 23 Leicester Tigers 2.50

 

NRL Predictions for Round 15

Team Ratings for Round 15

The basic method is described on my Department home page.
Here are the team ratings prior to this week’s games, along with the ratings at the start of the season.

Current Rating Rating at Season Start Difference
Storm 15.10 12.73 2.40
Roosters 10.18 12.25 -2.10
Raiders 5.83 7.06 -1.20
Panthers 5.07 -0.13 5.20
Eels 4.43 2.80 1.60
Rabbitohs 2.59 2.85 -0.30
Sharks 1.56 1.81 -0.20
Knights -0.38 -5.92 5.50
Sea Eagles -1.25 1.05 -2.30
Wests Tigers -1.90 -0.18 -1.70
Dragons -3.20 -6.14 2.90
Warriors -5.15 -5.17 0.00
Bulldogs -5.76 -2.52 -3.20
Cowboys -6.49 -3.95 -2.50
Broncos -11.22 -5.53 -5.70
Titans -11.40 -12.99 1.60

 

Performance So Far

So far there have been 112 matches played, 77 of which were correctly predicted, a success rate of 68.8%.
Here are the predictions for last week’s games.

Game Date Score Prediction Correct
1 Roosters vs. Storm Aug 13 6 – 24 -1.30 TRUE
2 Panthers vs. Warriors Aug 14 18 – 12 15.80 TRUE
3 Eels vs. Dragons Aug 14 12 – 14 11.00 FALSE
4 Sharks vs. Titans Aug 15 30 – 18 15.50 TRUE
5 Cowboys vs. Rabbitohs Aug 15 30 – 31 -7.90 TRUE
6 Raiders vs. Broncos Aug 15 36 – 8 17.90 TRUE
7 Wests Tigers vs. Bulldogs Aug 16 29 – 28 6.60 TRUE
8 Knights vs. Sea Eagles Aug 16 26 – 24 3.10 TRUE

 

Predictions for Round 15

Here are the predictions for Round 15. The prediction is my estimated expected points difference with a positive margin being a win to the home team, and a negative margin a win to the away team.

Game Date Winner Prediction
1 Eels vs. Storm Aug 20 Storm -8.70
2 Panthers vs. Sharks Aug 21 Panthers 5.50
3 Broncos vs. Dragons Aug 21 Dragons -6.00
4 Titans vs. Raiders Aug 22 Raiders -15.20
5 Wests Tigers vs. Roosters Aug 22 Roosters -10.10
6 Rabbitohs vs. Sea Eagles Aug 22 Rabbitohs 5.80
7 Bulldogs vs. Warriors Aug 23 Bulldogs 3.90
8 Knights vs. Cowboys Aug 23 Knights 8.10

 

August 15, 2020

Lotto, luck, and risk perception

There’s a big Lotto jackpot today, so it is my duty as a statistician to  write something about how people misunderstand probability. I don’t make the rules.

So, last week, I did a bogus poll on Twitter

The results don’t tell you anything about any useful population,  because it was a bogus poll on Twitter, but it’s still interesting how many responses fell into the  trap.

Last week, we had headlines about ‘lucky’ stores to buy Lotto tickets.  Now, as you know, the probability that one of your chosen combinations wins does not depend at all on where you  bought the ticket, or how you chose the numbers. However, it is true that a shop which has sold winning tickets in the past is more likely to sell winning tickets in the future. Selling a winning ticket in the past is more likely for an outlet that sells lots of tickets, and an  outlet that sells lots of tickets is more  likely to sell  winning tickets in the future.  On top of that, it appears that people like to buy their tickets from outlets that have sold winning tickets in the past, which will increase the sales and therefore increase the number of future winners.  It doesn’t help the gamblers, but it does help the outlet.

In fact, saying it doesn’t help the  gamblers isn’t quite correct.   You’d hope, given the odds, that many people buying Lotto tickets were doing it primarily for entertainment (the cash return on tickets is negative, but that’s also true of beer and movies and rugby games and restaurant dinners).  Given that, anything that increases the entertainment value helps the gamblers,  and buying  from a ‘lucky’ store could count as a plus.

I also want to talk about a story in the Herald. It’s good on the  odds of winning and the impact of a ‘must win’ draw  — and quotes Dr Matt Parry, from Otago, which is generally a good indicator.  However, a separate part of the story says

Your chances of winning Powerball – one in 38 million – are less likely than you being struck by lightning – one in 280,000 – on your way to buy the ticket.

That seems not just wrong but incredibly wrong.   The story says “[m]ore than 1.9 million tickets were sold for the previous $50m must-be-won Powerball jackpot” (and that’s tickets, not lines), so at one in 280,000 we’d expect six or seven people to have been struck by lightning on their way to buy tickets.  According to this Radio NZ story, there were 13 ACC claims for lightning injury  in 3 years, and while that would leave out fatal injuries, the story also says a minority are fatal.  There’s no way you have a 1 in  280,000  chance of being struck by lightning on a routine shopping trip.

So where does this number come from? Well, the US also has a Powerball lottery, which has even lower odds of winning: one in  292 million. And there are news stories there with vaguely similar numbers.

The odds of grabbing the grand prize are 1 in 292.2 million, according to the game’s own assessment. To put this in context, your chances of being killed by a lightning strike are approximately 1 in 161,000. The odds of being killed in a shark attack are 1 in 3.7 million.

Even getting hit by a meteorite is more likely than winning the Powerball — 1 in 1.9 million.

The lightning-strike number looks to be a lifetime risk in the US, where lightning is more common than New Zealand, not the risk per shopping trip.

There are other problems with the numbers.  The 1 in 1.9 million for getting hit by a meteorite  is staggeringly wrong given that only one person in US history has been hit by a meteorite,  back in 1954.

How did the  1 in 1.9 million figure get past editing? Well, it probably wasn’t checked, but  there is a related number that’s arguably correct.  If you calculate the probability of dying due to a meteorite impact,  you have to consider the entire range of impacts from something the size of a golf  ball up to a significant asteroid.  The dinosaurs were wiped out (in part, probably) by an asteroid impact, 66 million years ago, so it would be reasonable to assume a risk in the ballpark of 1 in 100 million per year, giving a lifetime risk of experiencing the impact for an individual of one in a million or so.   Putting that together with expected  deaths from a major impact, it’s not unreasonable to get a 1 in two million risk for an individual  of dying because of an asteroid impact. On  the other hand, that’s not getting hit by a meteorite, and they shouldn’t be giving the number to two digits accuracy when even the order of magnitude must be uncertain.

So, if you’re in New Zealand and doing a careful risk assessment before buying a lottery ticket today, you probably don’t need to worry about lightning or low-flying rocks, but you should wear a mask. And if you’re in Auckland, maybe go to a local store or buy online.