Can a referee cost a team a win?

Complaining about the performance of the referee goes hand in hand with sporting contests. It’s currently very topical in the rugby world, as in the weekend just past a controversial referring decision denied England a try against the All Blacks late in the game. This has many, especially some in the English media, claiming the referee cost England the game.

It seems like a good time then for us to answer the age old question, can a referee cost a team a win?

We’ve all heard people claim that the referee cost their team the game. We’ve probably all also heard the extra qualification that is sometimes added, that a referee can cost a team the game, but only if they lose by less points than the decision cost them.

An example of this type of thinking comes from the coach of the Australian national rugby team, Michael Cheika.

“We can only call it a turning point if we lose by less than one score, prehaps?” Michael Cheika

Cheika is referring to a decision that cost his team, the Wallabies, a try. The Wallabies then went on to be well beaten by more than a try in that game, and as a result Cheika feels he can’t call this a turning point. But if they had of lost by a try or less, he would presumably be comfortable with calling it a turning point.

Now I can’t be sure what Michael Cheika intended by a turning point, but in this post we need an objective definition of a turning point. We’ll assume that in a sporting contest a turning point is a significant shift in the probability of winning from one team to another (apologies if that’s not what you meant coach Cheika).

So let’s consider not only the question of whether or not a referee can cost a team a game, but also whether or not they can only cry foul if they lose by less than the decision cost them.

To answer these questions we don’t actually need to do any calculations or simulations. The question can be answered with a conceptual understanding of probability. However, we will use a simulation based approach as it will help guide us toward this understanding in a more visual way.

Note that we’ll be assuming that the refereeing decisions we discuss here were incorrect. Whether they were or not is a debate for another day. The simulation results presented will also be for matches between two evenly matched professional level teams. By extension we will also be assuming any real world games discussed below were between relatively evenly matched teams. This might be a stretch in some cases, but it doesn’t detract from the points we are trying to illustrate here. In the future we will look at simulations between teams with varying degrees of mismatching. Finally, we will restrict ourselves here to considering situations where we assume an incorrect referring decision almost immediately resulted in the scoring of points which we assume to be attributable to the decision.

With that said, let’s get to it.

The graph below was composed from 5000 Monte Carlo simulations of full rugby matches. It shows the probability of going on to win a game when trailing by 1 to 7 points from the 40th to the 76th minute of the match.

Probability of winning when trailing by 1 to 7. Values calculated by counting values in bin widths of 4 min from 5000 Monte Carlo simulations. Data points are plotted at the center of the time range of each bin.

The graph shows that at the beginning of the half being behind by 1 to 7 drops a teams chances of winning the match to about 35%. From there the chances of winning decline as the match progresses. This decline becomes noticeably steeper as the final 15 minutes of the match are entered. This probability will tend to zero as the final whistle approaches. But in a one score game, it will never quite get there. There will always be hope as long as there is time on the clock.

Note, that if we did more simulations the trend in the graph would likely be smoother, and the values more precise. As it stands they should be considered to have a precision of about plus or minus 3%.

Now, from this position of losing by 1 to 7 points, let’s assume that a bad refereeing decision directly results in a try scored by our opponents under the posts, so that the conversion is assumed a formality. We go down by an additional 7 points, and now move to the category of losing by 8 to 14 points. The graph below shows this transition.

Probability of winning when trailing by 1 to 7 and 8 to 14. Values calculated by counting values in bin widths of 4 min from 5000 Monte Carlo simulations. Data points are plotted at the center of the time range of each bin.

The trend in the graph remains similar, but it shifts down considerably, a clear turning point. In fact it shifts down by a fairly consistent margin of about 20% at all time values. So, a bad refereeing call which results in 7 points against us in the second half when we are already trailing by 1 to 7, reduces our chances of winning by about 20%. If this bad call occurs at the beginning of the half, our chances of winning drop to about 18%. But if it occurs around the 76th minute, they are reduced to around 1%.

Let’s go one final step further, by considering the impact of a bad call which transitions a team already losing by 8 to 14 to losing by 15 to 21. This is shown  in the graph below.

Probability of winning when trailing by 8 to 14 and 15 to 21. Values calculated by counting values in bin widths of 4 min from 5000 Monte Carlo simulations. Data points are plotted at the center of the time range of each bin.

We see that the bad decision early in the half drops our chances of winning from about 18% to around 8%. Late in the half they drop from around 1% to pretty much 0%. A drop to 0% will occur whenever it is not practically possible for us to win the game in the time remaining.

This analysis allows us to make three important statements about the impact of poor refereeing decisions of the type we are considering, and in doing so answers our question of whether or not the referee can cost us the game.

1. Reducing the probability of winning – the most common result of a bad decision is to reduce the probability of winning, but still leave us with a chance to win the game.

2. Denying the opportunity to win – the reduction in the probability of winning can result in it becoming zero. This occurs when there is not sufficient time left in the match to score the points required to win. In this case the referee has denied us the opportunity to win. This is subtly different from costing us the game, since if the decision had not been made both teams would still have a chance to win and therefore we can’t be sure we would have won.

3. Costing the game – if a refereeing decision denies us the chance to put our opponent in a position in which their probability of winning would be reduced to zero, and they then go onto win the match, that decision will have categorically cost us a win.

So the referee can cost us the game.

What about the idea that a referee can only cost a team the game if they lose by less points than the decision cost their team?

Well, interestingly there is some truth to this.

Consider the situation where the criteria for costing a game is met as specified above. A team needs $latex x$ scores to give them the win, which will put them a single score ahead of their opposition. There isn’t time for $latex x + 1$ scores. So if the team who is $latex x$ scores ahead is denied an opportunity to go $latex x + 1$ scores ahead due to a poor refereeing decision, and their opponent then goes onto win, it can only be by scoring $latex x$ times, since that is all their is time for. They will win be one score (or less in a multipoint scoring system).

Notice I said there is some truth to this. That’s because there can be many situations where a refereeing decision will cost a team a given amount of points and they then go onto lose by less than that amount, but the criteria for costing the game will not be met. For example, we saw earlier that at the beginning of the half when we are trailing by 1 to 7 a bad decision which results in a try against us reduces our chances of winning to 18%. We still have a chance to win, so even if we go onto lose by less that the bad decision cost us, we can’t claim it cost us the game. So, while a team who has been cost the game will lose by less than the decision cost them, losing by less than a decision cost a team does not guarantee they were cost the game.

What about a turning point? Do we have to lose by less than the amount a bad call cost us in order to consider ourselves disadvantaged by the decision. In terms of a turning point, which we consider to be a swing in the probability of winning, the answer is obviously, no. We have seen a bad refereeing call can result in a large drop in the probability of winning, regardless of what happens next. The final score is irrelevant. Likewise, the referee can deny us the opportunity to win without the final score having any relevance.

In fact, in a sport like rugby, large turning points will be by far the most common way a poor refereeing decision will impact a team. Being denied the opportunity to win will be less common, and being cost the game even less common again. So when we fixate on decisions where a team may have been cost the game, or very nearly so, we miss out on identifying many other opportunities for refereeing improvement.

In my opinion, we should be focusing on turning points. They are much more common, and therefore provide many more opportunities for improvement. The situation is often made worse by the fact that teams disadvantaged by a turning point can still go onto win. Unfortunately, most teams don’t complain as loud, if at all, when they win. So a lot of opportunities to highlight and improve our refereeing slip under the radar. One of the advantages of quantifying the effect of bad calls in the way we have done here, is that they are less likely to be overlooked because we can put a value on their effect, regardless of the end result of the match.

To highlight what we have discussed, let’s have some fun by looking at how much some famous real life controversial refereeing decisions impacted the teams involved. I have prepared a look up table below to assist us, and will look at a few well known games that I found interesting (I’m a New Zealander so forgive my bias selection of games involving the All Blacks). Feel free to apply the same process to any other games that interest you. Once again, treat the values as about plus or minus 3% precision (which explains why, for example, we see some small upward fluctuations when moving from one cell to the one below it).

Probabilities of winning (%) when losing or winning by various margins at different times in the match. L = Losing, W = Winning, D = Draw.

Match 1 – New Zealand 37 Australia 10 – October 18 2016 Eden Park, Auckland.

This is the game Michael Cheika was referring to in the quote at the beginning of this article. With about 45 minutes on the game clock the Wallabies were trailing by 15 to 10 when Henry Speight scored a try which would subsequently be disallowed to the dismay of many.

Using the table above the Wallabies had a 32.4% chance of winning (Row >44 to <= 48, Column L 4-6), and were denied the opportunity to move to a 51.6% chance of winning (Row >44 to <= 48, Column D/W 0-3). In making his decision the referee reduced the Wallabies chances of winning by 19.2%.

The decision did not cost the game, and it did not deny the opportunity to win. It did result in a major turning point, which disadvantaged the Wallabies. By focusing on whether or not they lost by less than the decision cost them, coach Cheika and the Wallabies missed the opportunity to identify a crucial turning point. The failure to highlight this makes it less likely that an opportunity for refereeing improvement was identified and subsequent action taken (action is always more likely when people complain).

Match 2 – England 15 New Zealand 16 – November 10 2018, Twickenham.

At the time of writing, this game occurred in the weekend just past. With about 75 minutes on the game clock England were trailing by 15 to 16 when Sam Underhill scored a try which would subsequently be disallowed due to an offside infringement.

Using the table above England had a 28.2% chance of winning (Row >72 to <= 76, Column L 1-3), and were denied the opportunity to move to a 76.1% chance of winning (Row >72 to <= 76, Column W 4-6). In making his decision the referee reduced England’s chances of winning by q whopping 47.9%.

This decision is still being debated. Assuming it is incorrect, while it did not cost England the game, or deny them an opportunity to win, you can see it had a huge impact on England’s chances of winning. A very big turning point indeed.

Match 3 – France 20 New Zealand 18 – October 6 2007, Cardiff.

This game was a quarter final match in the 2007 Rugby World Cup. With just over 68 minutes on the game clock New Zealand were winning by 18 to 13 when France scored a try after a forward pass went unnoticed by the referee and his touch judges.

Using the table above New Zealand had a 72.3% chance of winning (Row >68 to <= 72, Column W 4-6), which was reduced to a 33.7% chance of winning (Row >68 to <= 72, Column L 1-3). New Zealand’s chances of winning were reduced by about 38.6%.

This game is still talked about on a regular basis in New Zealand. Just buy a New Zealander a beer, and ask him about this game. When he tells you that referee Wayne Barnes cost New Zealand the game, have some fun correcting him. Tell him New Zealand were not cost the game, or denied the opportunity to win. It was simply a turning point which reduced their chances of winning. In fact they still had about a 1 in 3 chance of winning. They just didn’t take it!

Let’s summarise.

A referee can cost a team a win. When this happens, that team will lose by a score or less. However, losing by a score or less even when a team was cost a score by a poor refereeing decision doesn’t guarantee they were cost the game.

In addition to being cost they game, a refereeing decision can also deny a team the opportunity to win. However, the most common result of a poor refereeing decision is to cause a turning point which shifts the probability of winning toward one team and away from the other. If we focus on and quantitatively understand turning points, as we have done here, we identify many more opportunities for the improvement of referee decisions in crucial situations. This in turn, provides us with more opportunities to improve the spectacle of the game for all involved.

Finally, note that although we have restricted ourselves to situations in which points were scored. We need not. In the future we could also readily determine what the effect of a incorrect decision such as a knock on call has on the probability of a team winning at various stages of the game and various locations on the field. We can also look at player decisions. For example should a player take a tackle or chip ahead in a certain situation? Should we kick for touch or take a shot at goal in a certain situation? In other words, which option will maximise our chances of winning?

How much does size matter in rugby?

We all know size matters in rugby. In fact it’s a big part of what makes the sport such a spectacle. There’s something primal about high speed collisions.

But how much does size matter? How important is it to winning a rugby game?

With the next Rugby World Cup rapidly approaching, we’ll try and answer this question by taking a look back to the very first Rugby World Cup which took place in New Zealand in 1987.

The world has changed a lot since 1987, and the size of rugby players is no exception. To see how this has affected the game, we will take the world cup winning All Black teams from both 1987 and 2015 and play them against each other in 2000 Monte Carlo simulations. For those looking for an explanation of how such simulations work, refer back to the first post.

Because we are interested only in the effects of size (height and weight), we’ll give  all the players equal skills (e.g. handling and tackling) and physical abilities (e.g. speed and acceleration), but allow their height and weight to be set to their actual values. For simplicity, their equal skills and physical abilities will be set at the level of a typical modern day rugby player in each position, since we already have reasonable input data for this.

The table below presents the starting lineups, including a comparison of the heights and weights of the 1987 and 2015 All Black sides. The bracketed values give the relative increase or decrease of the 2015 side over the 1987 side.

Comparison of the heights and weights of the starting lineups of the 1987 and 2015 Rugby World Cup winning teams.
 1987Height
(cm)
Weight
(kg)
2015Height
(cm)
Weight
(kg)
1Steve
McDowall
182102Joe
Moody
188
(+6)
120
(+18)
2Sean
Fitzpatrick
183105Dane
Coles
184
(+1)
108
(+3)
3John
Drake
18399Owen
Franks
185
(+2)
118
(+19)
4Murray
Pierce
198107Brodie
Retallick
204
(+6)
121
(+14)
5Gary
Whetton
198105Sam
Whitelock
203
(+5)
120
(+15)
6Alan
Whetton
193100Jerome
Kaino
195
(+2)
109
(+9)
7Michael
Jones
18598Richie
McCaw
187
(+2)
107
(+9)
8Wayne
Shelford
189107Kieran
Read
193
(+4)
111
(+4)
9David
Kirk
17373Aaron
Smith
169
(-4)
83
(+10)
10Grant
Fox
17572Dan
Carter
180
(+5)
92
(+20)
11Craig
Green
17879Julian
Savea
192
(+14)
103
(+24)
12Warwick
Taylor
17979Ma'a
Nonu
182
(+3)
108
(+29)
13Joe
Stanley
17883Conrad
Smith
187
(+9)
95
(+12)
14John
Kirwan
191100Nehe
Milner-Skudder
180
(-11)
90
(-10)
15John
Gallagher
19085Ben
Smith
186
(-4)
94
(+9)

On average the 2015 All Blacks are taller than their 1987 counterparts by a fairly modest 3 cm. All but three players in the 2015 side are taller than the player in the same position on the 1987 side. The biggest difference between the same position is in the left wing spot where 2015’s Julian Savea is 14 cm taller than 1987’s Craig Green.

The differences in weight are more striking.

On average the 2015 All Blacks are heavier by 12 kg. All but one player in the 2015 side is heavier than the player in the same position on the 1987 side. The biggest difference between the same position is at inside centre where 2015’s Ma’a Nonu is a whopping 29 kg heavier then 1987’s Warwick Taylor. In the fly-half position, two All Black greats square off in 2015’s Dan Carter and 1987’s Grant Fox. Carter is not considered exceptionally big in the modern game, but still weighs in 20 kg heavier than Fox.

It’s not all bad news for 1987. One player is taller and heavier than his 2015 version. All Black legend John Kirwan is 11 cm taller and 10 kg heavier than 2015’s Nehe Milner-Skudder in the right wing position. Perhaps this is part of the reason why Kirwan was so good in his era?

So, let’s run our 2000 match simulations and see how much of the size advantage of the 2015 team translates into winning advantage. The chart below presents the results.

The results show that the 2015 All Blacks win 59% of the games, 38% are won by the 1987 All Blacks, with the remaining 3% ending in draws. Two perfectly evenly matched sides will each win around 48.5% of matches, so through increased size alone the 2015 All Blacks have increased their chances of winning over 1987 side by about 10%.

So what causes this advantage? Is it the increased height, increased weight, or a combination of the two?

To answer this question we will return to running some simulations with input consistent with two evenly matched professional rugby sides. The same input used in previous posts. When doing this but allowing one team to be 10 cm taller than the other in every position and running 2000 match simulations, no significant difference in winning percentage was observed. This tells us that a typical professional rugby team is already sufficiently tall, and that height for heights sake is not of much further use to them.

Why might this be? Well, most of the aerial contests in rugby in general play involve the ball coming down from a relatively steep angle. They are contested by jumping, and it is a players ability to jump high rather than be tall that is important in these contests. Tall players might be more useful in the lineout, but it would appear that a 200 cm lock is already sufficiently tall, being 210 cm tall does not appear to improve the teams chances of winning. At least not in a statistically significant way over 2000 matches.

Assuming no significant interaction between height and mass, it must be that the improvement in winning is a result of increased mass only. In this regard we note that height is still a useful attribute to a rugby team, as a large frame likely comes with more mass. We also note that mass is not important to a rugby team in and of itself. It’s when is combines with velocity to create momentum that it becomes, well, hard to stop (excuse the physics pun).

The graph below gives us a closer look at the effects of increased mass by comparing our two evenly matched rugby sides whilst allowing one of them to progressively increase the weight of every player on the team, whilst holding all of their other attributes constant. Each data point represents 2000 match simulations.

The effect of increasing player weight on the winning percentage of a typical professional rugby team.

The results show that the winning percentage increases from around 48.5% to 63% over a 20 kg range. The line is in fact highly linear ($latex R^2=0.999$), with a slope of 0.74 win percent per kg. This means that, all other things being equal, our typical professional rugby team will gain about 3/4 of a percent in winning probability for each kg in weight they gain over the range we have tested. Another way of stating this is for every 1.4 kg they gain they will increase their probability of winning by 1%. The key here is that this weight must be gained in such a way that all other things remain constant. For example, they must not lose speed, fitness or have the quality of their skill execution decline. Achieving mass gain without any such reductions will require skill on the part of the strength, conditioning, and coaching staff.

In any case, we have made an important step in this post by moving from knowing intuitively that size is important to performance in rugby, to being able to quantify its value. We have also importantly been able to prioritise mass as being more important than height. Among other explorations, we’ll continue to progress toward the goal of determining not just what is important, but how important it is. For example, we know mass is important, but is it more important the speed or handling? The answer will of course be different for different teams depending on their relative strengths and future potential, but whatever the case, determining relative importance of any parameter that is related to performance will ultimately allow us to prioritise our training efforts and maximise our chances of winning. After all, that’s what it’s all about.

If you enjoyed this post, don’t forget to subscribe by entering your email above. Feel free to share this article too.