Accessibility / Report Error

SIMULATION-BASED METHODOLOGY FOR PREDICTING FOOTBALL MATCH OUTCOMES CONSIDERING EXPERTS' OPINIONS: THE 2010 AND 2014 FOOTBALL WORLD CUP CASES

ABSTRACT

In this paper we propose a simulation-based method for predicting the 2010 and 2014Football World Cup. Adopting a bayesian perspective, we modeled the number of goals of two opposing teams as a Poisson distribution whose mean is proportional to the relative technical level of opponents. FIFA ratings were taken as the measure of technical level of teams and experts' opinions about scores of matches were taken to construct prior distribution of parameters. Just before each round, tournament simulations were performed in order to estimate probabilities of events of main interest for audience and bettors such as qualifying to the knockout stage, reaching semi-finals, reaching the final match, winningthe tournament, among others.

Keywords:
Bayesian inference; Football World Cup; forecasting; poisson distribution; simulation

1 INTRODUCTION

The World Cup tournament (WCT) organized by FIFA (Fédération Internationale de Football Association, French for International Federation of Football Association) is the most important international football championship, which occurs every four years. The first tournament took place in Uruguay in 1930 with 13 invited countries and Uruguay became the first champion. In the current format, the competition gathers 32 teams, the host nation(s) has a guaranteed place and the others are selected from a qualifying phase which occurs in the three-year period preceding the tournament.

The tournament is composed by a group stage followed by a knockout stage. In the group stage, the teams play against each other within their group and the top two teams in each group advance to the next stage. In the knockout stage, 16 teams play one-off matches in a single-elimination system, with extra time of 30 minutes (divided in 2 halves of 15 minutes each) and penalty shootouts used to decide the winners when necessary.

In the literature, several articles are directed to football prediction, mostly applied to championship leagues. Keller (1994) has fitted the Poisson distribution to the number of goals scored by England, Ireland, Scotland and Wales in the British International Championship from 1883 to 1980. Lee (1997) also considers the Poisson distribution, but allows for the parameter to depend on a general home-ground effect and individual offensive and defensive effects. (Karlis & Ntzoufras, 200910 KARLIS D & NTZOUFRAS I. 2009. Bayesian modelling of football outcomes: Using the Skellam's distribution for the goal difference. IMA J. Manag. Math., 20: 133-145.) applied the Skellam's distribution to model the goal difference between home and away teams. The authors argue that this approach does not rely neither on independence nor on the marginal Poisson distribution assumptions for the number of goals scored by the teams. A bayesian analysis for predicting match outcomes for the English Premiere League (2006-2007 season) is carried out using a log-linear link function and non-informative prior distributions for the model parameters.

Taking another approach, Brillinger (2008) proposed to model directly the win, draw and loss probabilities by applying a trinomial regression model to the Brazilian 2006 Series A championship. By means of simulation, it is estimated for each team the total points, the probability of winning the championship and the probability of ending the season in the top four places.

To model the number of goals scored by teams in each match, (Baio & Blangiardo, 20102 BAIO G & BLANGIARDO M. 2010. Bayesian hierarchical model for the prediction of footballresults. Journal of Applied Statistics, 37(2): 253-264.) proposes a Bayesian hierarchical model, while (Koopman & Lit, 201512 KOOPMAN SJ & LIT R. 2015. A dynamic bivariate poisson model for analysing and forecasting match results in the english premier league. Journal of the Royal Statistical Society Series A, 178(1): 167-186.) assumes a bivariate Poisson distribution with intensity coefficients that change stochastically over time. Where as (Louzada et al., 201413 LOUZADA F, SUZUKI AK & SALASAR LEB. 2014. Predicting match outcomes in the englishpremier league: Which will be the final rank? Journal of Data Science, 12: 235-254.) assumes a univariate Poisson distribution and consider linear models that express the sum and the difference of goals scored in terms of covariates. Moreover, (Alves et al., 20111 ALVES AM, DE MELLO JCCBS, RAMOS TG & SANT'ANNA AP. 2011. Logit models for the probability of winning football games. Pesquisa Operacional, 31(3): 459-465.) propose logit models for the probability of winning football games, (Calôba & Lins, 20065 CALÔBA GM & LINS MPE. 2006. Performance assessment of the soccer teams in brazil using dea. Pesquisa Operacional, 26(3): 521-536.) assess the performance of soccer teams in Brazil using DEA, (Dobson & Goddard, 20037 DOBSON S & GODDARD J. 2003. Persistence in sequences of football match results: A monte carlo analysis. European Journal of Operational Research, 148(2): 247-256.) provided a a Monte Carlo analysis of the persistence in sequences of football match results, (Koning et al., 200311 KONING RH, KOOLHAAS M, RENES G & RIDDER G. 2003. A simulation model for footballchampionships. European Journal of Operational Research, 148(2): 268-276.) provides a simulation model for football championships, (Scarf & Shi, 200816 SCARFF PA & SHI X. 2008. The importance of a match in a tournament. Computers & Operations Research, 35(7): 2406-2418.) determines the importance of a match in a tournament, and (Sant'Anna, 200815 SANT'ANNA AP. 2008. Rough sets analysis with antisymmetric and intransitive attributes: Classification of brazilian soccer clubs. Pesquisa Operacional, 28(2): 217-230.) proposes to classify Brazilian soccer clubs by considering rough sets analysis with antisymmetric and intransitive attributes.

In spite of the vast literature directed to League Championship prediction, few articles concern score predictions for the WCT (Dyte & Clarke, 2000; Volf, 200918 VOLF P. 2009. A random point process model for the score in sport matches. IMA J. Manag. Math., 20: 121-131.; Suzuki et al., 200917 SUZUKI AK, SALASAR LEB, LOUZADA-NETO F & LEITE JG. 2009. A bayesian approach for predicting match outcomes: The 2006 (association) football world cup. Journal of the Operational Research Society, 61: 1530-1539 (October 2010).; Bastos & da Rosa, 20133 BASTOS LS & DA ROSA JMC. 2013. Predicting probabilities for the 2010 fifa world cup games using a poisson-gamma model. Journal of Applied Statistics, 40(7): 1553-1544.). Probably, the shortage of researches on the WCT is due to the limited amount of valuable data related to international matches and also to the fact that few competitions confronts teams from different continents.

Dyte & Clarke (2000) presents a log-linear Poisson regression model which takes the FIFA ratings as covariates. The authors present some results on the predictive power of the model and also present simulation results to estimate probabilities of winning the championship for the 1998 WCT. (Volf, 200918 VOLF P. 2009. A random point process model for the score in sport matches. IMA J. Manag. Math., 20: 121-131.), using a counting processes approach, modeled the development of a match score as two interacting time-dependent random point processes. The interaction between teams are modeled via a semi-parametric multiplicative regression model of intensity. The author has applied his model to the analysis of the performance of the eight teams that reached the quarter-finals of the 2006 WCT. (Suzuki et al., 200917 SUZUKI AK, SALASAR LEB, LOUZADA-NETO F & LEITE JG. 2009. A bayesian approach for predicting match outcomes: The 2006 (association) football world cup. Journal of the Operational Research Society, 61: 1530-1539 (October 2010).) proposed a bayesian methodology for predicting match outcomes using experts' opinions and the FIFA ratings as prior information. The method is applied to calculate the win, draw and loss probabilities for each match and also to estimate classification probabilities in group stage and winning tournament chances for each team on the 2006 WCT. (Bastos & da Rosa, 20133 BASTOS LS & DA ROSA JMC. 2013. Predicting probabilities for the 2010 fifa world cup games using a poisson-gamma model. Journal of Applied Statistics, 40(7): 1553-1544.) present a Bayesian Poisson-Gamma model where the priors distributions are chosen using historical information for predicting the 2010 WCT.

In this article, we proposed a bayesian method for predicting match outcomes with use of the experts' opinions and the FIFA ratings as prior information, but differently from (Suzuki et al., 200917 SUZUKI AK, SALASAR LEB, LOUZADA-NETO F & LEITE JG. 2009. A bayesian approach for predicting match outcomes: The 2006 (association) football world cup. Journal of the Operational Research Society, 61: 1530-1539 (October 2010).), for all 48 matches of the first phase (group stage in which each team played three matches) the experts' opinions were provided before the beginning of the 2010 and 2014 WCTs. The motivation for such purpose was the difficulty to get the experts' opinions (4 sportswriters contributed with their opinions) at the end of each round of group stage. The drawback is that these matches were played within 15 days, mostly in different dates and times. For the second phase, the experts' opinions were provided before each round. An attractive advantage of ourapproach is the possibility of calibrating the experts' opinions, directing for a control on the model prediction capability.

We used the predictive distributions to perform a simulation based on 10,000 runs of the whole competition, with the purpose of estimating various probability measures of interest, such as the probability that a given team wins the tournament, reaches the final, qualifies to the knockout stage and so on.

The paper is outlined as follows. In Section 2, we present the probabilistic model and expressions for priors and posterior distribution of parameters, as well predictive distributions. In Section 3, we present the method used to estimate the probabilities of winning the tournament and of reaching the final match. In Section 4, we present the results. In Section 5, we give our final considerations about the results and further work.

2 PROBABILISTIC MODEL

The probabilistic model is derived as follows. Consider a match between teams A and B with respective FIFA ratings RA and Rb. In the following we shall assume that, given the parameters λA and λB, the number of goals XAB and XBA scored by team A and B, respectively, are two independent random variables with

In this model, the ratings are used to quantify each team's ability and the mean number of goals A scores against B is directly proportional to team A's rating and inversely proportional to team B's rating. If A and B have the same ratings (RA = RB), then the mean score for that match is (λA, λB). So, the parameter λA can be interpreted as the mean number of goals team A scores against a team with the same ability and an analogous interpretation applies to λB.

2.1 Prior distribution

In order to formulate the prior distribution, a number of experts provide their expected final scores for the incoming matches which we intend to predict. This kind of elicitation procedure is natural and simple, since the model parameters are directly related to the number of goals and the requested information is readily understandable by the respondents not requiring any extra explanation. Differently from the usual empirical Bayes methods in which considers that the prior distribution is estimated from the data (Migon et al., 2014), we have adopted multiple experts since we believe it aggregates more information than using only one expert.

Assuming independent experts' opinions and following a Poisson distribution, we shall obtain the prior distribution for the parameters using a procedure analogous to the power prior method (Chen & Ibrahim, 2000) with the historical data replaced by the experts' expected scores. The proposed elicitation process is based on the assumption that the experts are able to provide plausible outcomes for the incoming matches that could be observed but in fact were not. This elicitation process is in accordance with the bayesian paradigm for prior elicitation as discussed in (Genest & Zidek, 19868 GENEST C & ZIDEK JV. 1986. Combining probability distributions: a critique and an annotated bibliography. Statistical Science, 1(1): 114-148.) and (Clemen & Winkler, 19996 CLEMEN RT & WINKLER RL. 1999. Combining probability distributions from experts in risk analysis. Risk Analysis, 19(2): 187-103.) as we shall see later in this section. Moreover, although the independence assumption is taken mainly because of mathematical simplicity, we can argue that, at least approximately, the independence assumption holds in our case since the selected experts work at different media and do not maintain any contact. They are sport experts on TV and newspapers specialised in sports.

Indeed, for predicting football match outcomes, various factors should be needed, such aschanges in tactical patterns of teams, scoring or conceding a goal, injury or expulsion of an important player, presence of crisis in a team, performance of a team against other in former matches, among others. This type of data, although important, can be difficult to access. Inparticular thinking about teams that are competing in a WCT, where the amount of matches that were played between them may be very small. For instance, Brazil only once played against Ghana in 2006. It is at this juncture, data difficult access and small amount, that fits the use of information arising subjective priori experts. It fills the lack of information when there is shortage of objective information (data) available. In or case, we shall emphasize the importance of the experts information on the quality of the predictions in Section 4.1 (please see Fig. 2).

Suppose we intend to predict a match between teams A and B. Consider that s experts provide their expected scores for m incoming matches of team A and B. Denote by yi, j the j-th expert's expected number of goals scored by team A against opponent OAi and by zi, j the expected number of goals scored by team B against opponent OBi, i=1,...,m, j=1,...,s. This information is summarized in Table 1.

Table 1
Experts' expected scores.

In the following, we shall assume as the initial information about the parameters the probability density function given by

and

where δ0 > 0 and β0 ≥ 0. Note that if δ0 = 1/2 and β0 = 0 we have the Jeffreys' prior for the Poisson model, if δ0 = 1 and β0 = 0 we have an uniform distribution over the interval (0, ∞) and if δ0 > 0 and β0 > 0 we have a proper Gamma distribution. The first two cases are usual choices to represent noninformative distribution for the parameter.

Updating this initial prior distribution with the experts' expected scores, we obtain the power prior of λA

where 0 ≤ a0 ≤ 1 represents a "weight" given to experts' information and D0 denotes all the experts' expected scores. Thus, if 0 ≤ a0 ≤ 1, the prior distribution of λA is

and if a0 = 0 the prior for λA is the initial Jeffreys' prior (3) and corresponds to disconsider all the experts' information. In particular if a0 = 1 the prior for λA equals to the posterior which would be obtained if all the expected scores were in fact real data. Thus, the a0 parameter can be interpreted as a degree of confidence in the experts' information.

The elicitation of the prior distribution (5) can also be viewed in the light of the bayesian paradigm of elicitation (Genest & Zidek, 19868 GENEST C & ZIDEK JV. 1986. Combining probability distributions: a critique and an annotated bibliography. Statistical Science, 1(1): 114-148.), when we consider for the experts information the likelihood

and combine it with the initial noninformative prior distribution (3) by applying the Bayes theorem. The likelihood (6) provides information for the parameter through the experts' information, which are treated like data, i.e, we assume them to follow a Poisson distribution.

Analogously, the prior distribution of λB is

It is important to note that by the way the experts present their opinions there is possibility of contradictory information. There is some literature on prior elicitation of group opinions directed towards to remove such inconsistencies. According to (O'Hagan et al., 200614 O'HAGAN A, BUCK CE, DANESHKHAH A, EISER JR, GARTHWAITE PH, JENKINSON DJ, OAKLEY JE & RAKOW T. 2006. Uncertain judgements: eliciting experts' probabilities. John Wiley, London.), they range from informal methods, such as Delphi method (Pill, 1971), which encourage the experts to discuss the issue in the hope of reaching consensus, to formal ones, such as weighted averages, opinion polling or logarithmic opinion pools. For a review of methods of pooling expert opinions see (Genest & Zidek, 19868 GENEST C & ZIDEK JV. 1986. Combining probability distributions: a critique and an annotated bibliography. Statistical Science, 1(1): 114-148.).

2.2 Posterior and predictive distributions

Our interest is to predict the number of goals that team A scored against team B, using all the available information (hereafter denoted by D). This information is originated from two sources: the experts' expected score and the actual scores of matches already played. So, we may be in two distinct situations: (i) we do have the experts' information but no matches have been played; (ii) we have both the experts' opinions and the scores of played matches.

In situation (i), we only have the experts' information. So, from the model (1) and the prior distribution (5), it follows that the prior predictive distribution of XAB is

where NB(r, γ) denotes the Negative Binomial distribution with probability function given by

with parameters r>0 and 0< γ <1.

Analogously, from model (2) and the prior distribution (7), it follows that the prior predictive distribution of XBA is given by

In situation (ii), assume that team A has played k matches, the first against team C1, the second against team C2, and so on until the k-th match against team Ck. Suppose also that, given λA, XA, C1, ..., XA, Ck are independent Poisson random variables with parameters

Hence, from model (1) it follows that the likelihood is given by

where xi, A are the number of goals scored by A against the i-th opponent, i=1,...,k.

From the likelihood (10) and the prior distribution (5), it follows that the posterior distribution of λA is

which implies by the model (1) that the posterior predictive distribution of XAB is

Analogously, the posterior distribution of λB is given by

where is the number of goals team B scores against the l-th opponent, Dl, l = 1, ..., k. Hence, from the model (2) and the posterior (13), it follows that the posterior predictive distribution of XBA is

At this point, it is important to note that matches taken to construct the prior distribution are distinct from those considered to the likelihood function, that is, the matches already played have their contribution (through their final scores) included in the likelihood function but not in the prior.

3 METHODS

In this section, we shall consider the competition divided into 7 rounds, where the first three rounds are in the group stage (first phase) and the last four in the knockout stage (second phase). The 4 experts' were asked for their expected final scores for the matches in 5 distinct times: just before the beginning of tournament and just before each of the 4 rounds in the knockout stage. At the beginning of competition, experts provided their expected final scores for all matches in the group stage at once, while in the knockout stage they provide their expected final scores only for matches in the incoming round. To account for the mean experts' opinion, we have chosen a0 = 1/(3 * 4) = 1/12 (for the group stage) and a0 = 1/4 (for the knockout stage), in the sense that the posterior distribution of the parameter is the same as that, which would be obtained if we took one observation equal to the mean expected score from the sampling distribution. It is important to note that the experts were selected from different sports media, in order to make their guesses as independent as possible. In the Section 4 in the tournament prediction, we addressed the issue of the a0 values choice.

In this paper, we have considered as the initial prior distribution (equations (3) and (4)) the Jeffreys' prior, i.e., δ0 = 1/2 and β0 = 0. We also have investigated other two possibilities for noninformative distributions for the parameters: the uniform prior distribution over the interval (0, ∞) (δ0 = 1 and β0 = 0), a vague Gamma prior (δ0 = 0.01 and β0 = 0.01) and a vague Exponential prior (δ0 = 1 and β0 = 0.01).

For the knockout stage, teams can play for additional 30 minutes if they remain level after the 90 minutes regulation time and if the result persists the teams proceed to a penalty shootout decision. For the extra time, we considered a Poisson distribution with parameter multiplied by 1/3 to account for the shrinkage of time, which is equivalent to 30 minutes of extra time. That is, 1/3 of the overall match time (90 minutes). For penalty shootout, we assumed that the probability of victory of team A versus team B equals the ratio a/(a+b) with a and b the posterior means of the number of goals of teams A and B, respectively.

The exact calculation of probabilities is possible just for the case of a single match prediction. We calculate the probabilities exactly from the predictive distributions. The probabilities regarding qualifying chances, winning tournament chances among others must be performed by simulation, since they may involves many combinations of match results.

3.1 Predictions for single matches

For a given match played by teams A and B, the probabilities of win (PW), draw (PD) and loss (PL) for team A against B is obtained from the predictive distributions by using the expressions

and

It is useful to consider the set of all possible forecasts given by the simplex set

S = {(PW, PD, PL) ∈ [0, 1]3: PW + PD + PL = 1}.

Observe that the vertices (1,0,0),(0,1,0) and (0,0,1) of S represent the outcomes win, draw and loss, respectively. Thus, a method used to measure the goodness of a prediction is to calculate the De Finetti distance (DeFinetti, 1972) which is the square of the Euclidean distance between the point corresponding to the outcome and the one corresponding to the prediction, e.g., if a prediction is (0.2,0.65,0.15) and the outcome is a draw (0,1,0), then the De Finetti distance is (0.2 - 0)2 + (0.65 - 1)2 + (0.15 - 0)2 = 0.185. Also, we can associate to a set of predictions the average of its De Finetti distances, known as the De Finetti measure. So, we shall consider the best among some prediction methods the one with the least De Finetti measure. Furthermore, the De Finetti value 2/3 can be considered as a reference for comparison, because an equiprobable predictor which assigns equal probability for all outcomes (PW = PD = PL = 1/3) has 2/3 as its De Finetti measure.

For the matches of the first stage, the De Finetti measure for the Gamma, Uniform and Exponential priors were 43.90%, 50.18% and 44.70% higher than De Finetti measure for the Jeffreys' prior, justifying our choice for the Jeffreys' prior.

Another standard way of measuring the goodness of a prediction method is calculating the percentage of correct forecasts, where a forecast shall be considered correct if the forecast with the greatest probability coincides with the observed outcome.

3.2 Predictions for the whole tournament

Besides the prediction of outcomes for a single match, we also use the predictive distributions to perform a simulation of 10.000 replicas of the whole competition. The purpose of this simulation is to estimate probabilities of events related to the competition as a whole such as the probability that a given team qualifies to the knockout stage, reaches the final, wins the tournament, and so on. The probabilities are estimated by the percentage of times the event of interest occurred and are presented in Section 4.

4 THE 2010 AND 2014 WCTS

The 2010 FIFA WCT took place in South Africa from 11 June to 11 July 2010. The following 32 teams were qualified for the tournament: Algeria, Argentina, Australia, Brazil, Cameroon, Chile, Denmark, England, France, Germany, Ghana, Greece, Honduras, Italy, Ivory Coast, Japan, Mexico, Netherlands, New Zealand, Nigeria, North Korea, Paraguay, Portugal, Serbia, Slovakia, Slovenia, South Africa, South Korea, Spain, United States, Uruguay, Switzerland. In the final, Spain defeated Netherlands 1-0 after extra time, to win the tournament.

The 2014 FIFA WCT took place in Brazil from 12 June to 13 July 2014. The following 32 teams were qualified for the tournament: Algeria, Argentina, Australia, Belgium, Bosnia & Herzegovina, Brazil, Cameroon, Chile, Colombia, Costa Rica, Croatia, Ecuador, England, France, Germany, Ghana, Greece, Honduras, Iran, Italy, Ivory Coast, Japan, Mexico, Holland, Nigeria, Portugal, Russia, South Korea, Spain, Switzerland, United States, Uruguay. In the final, Germany defeated Argentina 1-0 to win the tournament, securing the country's fourth world title. Brazil lost to Germany in the semi-finals and finished the tournament in fourth place.

In this section apply the methodology present in the paper for both 2010 and 2014 FIFA WCTs. We present some of the results obtained by the application of our modelling for the cases of a single match prediction with and without previous observed data. We also illustrate the simulation methods performed to obtain the probabilities of qualifying to the knockout stage, reaching quarter finals, semi finals, the final match and being the champion.

4.1 2010 WCT - Single Match Prediction

We illustrate the predictions for the match Argentina versus Nigeria in Group B which occurredin the first round. The mean experts' expected final score for this match was 2-0.75 for Argentina against Nigeria. We observe a very large difference in skill between the two teams expressed by their different FIFA ranking positions, 7-th and 18-th (please see Table 10 in appendix APPENDICE A: FIFA RATINGS Table 10 FIFA ratings prior to 2010 WCT. Team Group FIFA ratings Team Group FIFA ratings Brazil G 1611 Australia D 886 Spain H 1565 Nigeria B 883 Portugal G 1249 Switzerland H 866 Netherlands E 1231 Slovenia C 860 Italy F 1184 Côte d'Ivoire G 856 Germany D 1082 Algeria C 821 Argentina B 1076 Paraguay F 820 England C 1068 Ghana D 800 France A 1044 Slovakia F 777 Greece B 964 Denmark E 767 USA C 957 Honduras H 734 Serbia D 947 Japan E 682 Uruguay A 899 Korea Republic B 632 Mexico A 895 New Zealand F 410 Chile H 888 South Africa A 392 Cameroon E 887 Korea DPR G 285 ). Table 2 presents the calculated score probabilities for this match. The observed result was 1-0 for Argentina, which is one of the most plausible results according to Table 2 (6.27%). The winning probabilities for Argentina is 55.47%, for Nigeria is 17.62% and a draw has probability 26.91%.

Table 2
Score chances for the match Argentina versus Nigeria.

Table 3 displays the outcomes probabilities, the De Finetti measures and the correct prediction indicator for all matches in group G when we have the observed matches before the first round, the second round and the third round. For the the 1st, 2nd and 3rd round the De Finetti measures are 0.4423, 0.4445 and 0.53, while the proportion of correct prediction are 66.67%, 75% and 50%, respectively. All the outcome predictions change slightly with the incorporation of observed data, except from the prediction for the match Portugal versus Korea DPR in the second round. Portugal win probability has decreased from 77.5% to 55% and the Korea win raise from 11.3% to 23.8% with the observation of outcomes of the first round.

Table 3
Predictions for the matches of Group G.

The reason for such great change in the probabilities is that Portugal scored no goals against Côte d'Ivoire and Korea DPR scored one against the Brazilian team, which was at that moment the highest FIFA rating team. Then, the analysis confirms the expected favoritism of Brazil and Portugal in this group.

Table 4 shows the probabilities attached to the outcomes (results in 90 minutes, in other words, in regulation time), the De Finetti measures and correct prediction indicator for matches of Round of 16 phase. The DeFineti measure for this round is 0.4944 and the correct prediction proportion 75%.

Table 4
Predictions for the round of 16.

For the final matches (first and third place matches), the chances for each finalist are presented in Table 5. Despite the optimism of experts in favor of Spain (mean expected score 1.25-0.75) and the best FIFA ranking (2nd against 4th), the lower number of goals scored in the tournament in comparison with Netherlands (7 versus 12) resulted in a slight favoritism to the Netherlands (56.79% Netherlands victory against 43.21% for Spain victory).

For the third place match, Germany versus Uruguay, we observe a widespread predicted favoritism of Germany (72.32% Germany win against 27.68% Uruguay win), which can be explained by its best performance in previous matches (13 goals scored against 9), its highest ranking position (6th against 13th) and the significant wins against strong and traditional teams such as England (Germany 4-1 England) and Argentina (Germany 4-0 Argentina), which also affected the experts' opinions in favor of Germany (the mean expected score was 1.75 against 1.25).

Table 5
Prediction for the final and third place match.

To assess the goodness of fit of the proposed model for the number of goals, we realized two proper goodness of fit procedures. The first one is a usual chi-square test for adherence comparing the observed and expected number of goals. The p-value equals to 0.525 indicating the data fit well the assumed model. This result is corroborated by the bar plots of the observed and expected number of goals shown in the upper panel of the Figure 1. As a second goodness of fit procedure, following (Bittner et al., 20094 BITTNER E, NUSSBAUMER A, JANKE W & WEIGEL M. 2009. Football fever: Goal distributions and non-gaussian statistics. European Physical Journal B, 67(3): 459-471.) and (Janke et al., 20099 JANKE W, BITTNER E, NUSSBAUMER A & WEIGEL M. 2009. Football fever: self-affirmation model for goal distributions. Condensed Matter Physics, 12(4): 739-752.), we plot in the lower panel of Figure 1 the relative observed frequency of the number of goals, superimposed by the lower and upper quantiles (0.025 and 0.975) of the predictive density over 1,000 replications of the whole tournament. There is no important deviations, indicating the data fit well the assumed model.

For sake of comparison we also considered a model where a discrete uniform distribution in the interval 0 to 4 is assumed for the number of goals of any team in any match in the competition, the a0 is assumed to be equal to 0 and the FIFA rank is assumed to be equal to all teams. Such model can be regarded as a sort of random guessing model. In this case, the usual chi-square test for adherence comparing the observed and expected number of goals produced a p-value smaller than 0.00001, indicating that the deviations of the random guessing model from the truth are substantially bigger than what would be expected if it is the true model.

Figure 1
Upper panel: observed and expected relative frequency of the number of goals. Lower panel: relative observed frequency of the number of goals, superimposed by the lower and upper quantiles of the predictive density.

In order to assess the impact of the experts' information on the quality of the predictions, Figure 2 displays the DeFinetti measure for our pool of experts fixing a0 at 0 and 0.25, denoting respectively the total absence of experts' opinion and the amount of experts' information as considered to account for the mean of their opinion in our method. Moreover, for sake of comparison with the best possibility, we replaced our experts' information by a fictitious pool of "perfect" experts, who always forecast the exact score for each one of the matches, with a0 fixed 0.25.

At the initial rounds, the use of experts' information greatly improves prediction, with the DeFinetti measure with a0 = 25 always closer to the results of the "perfect" experts' opinion. However, as observed data enters the model, with the progress of the competition, the gain of using expert information is decreasing. This feature is in fully agreement with our initial motivation to consider experts' information in our modeling: filling the lack of information when there is shortage of objective information (data) available. For the knockout stage matches, experts' information did not improve prediction, which can be explained in part by the small difference of skill level between teams and by the lack of confidence on Spain and Netherlands teams who defeated the traditional teams of Germany and Brazil, respectively.

Figure 2
De Finetti measure considering our pool of experts with a0 = 0 (dashed line) and a0 = 25 (dotted line). The solid line represents the case of a fictitious pool of "perfect" experts with a0 = 25.

4.2 2010 WCT - Classification Predictions at Group Stage

In this section, we illustrate how the simulation procedure is applied to calculate the qualifying probabilities for Group D, compose by the teams: Autralia, Germany, Ghana and Serbia.

Table 6 displays, for each team, the classification probabilities in all the four positions and also the qualifying probabilities for the knockout stage. From these results, we see that before any match occur, Germany's team has a great favoritism in this group, which can be explained by the high FIFA ranking (6th) and by the confidence of the experts in its performance. Ghana is the team with the second highest qualifying chances, despite having the lowest FIFA ranking (24th). This fact can be explained by the experts' optimism about Ghana performance.

After 1st round, we can see that the Germany team confirmed the favoritism and thrashed Australia (4-0), increasing its qualifying chance and more than doubling the chance of finishing in the first place. At the 1st round, Serbia was defeated by Ghana and its qualifying chances has decreased from 45.45% to 8.31%. On the other hand, Ghana increased its qualifying chances from 55.83% to 78.22%.

Table 6
Classification forecasts for group D.

After the second round, the "unexpected" (1-0) Serbia win against Germany, increased considerably Serbia's qualifying chances (from 8.31% to 59.81%) and decreased Germany's chances from 97.94% to 71.54%. The great increase in Serbia's qualifying chances can be explained by the fact that, besides of winning three points, Serbia beat the very strong German team and will play at the final round against the weak Australia team. The (1-1) draw in the match Australia versus Ghana let all teams with chances of qualifying for the next stage, with Serbia and Ghana having very similar qualifying chances.

After the third round, Germany confirmed the predictions and defeated Ghana (1-0). On the other hand, Serbia lost its match against Australia and finished in the fourth place. Despite its defeat, Ghana classified in the second place.

4.3 2010 WCT - Tournament Prediction

In this section, we present the results of the simulation study performed to estimate probabilities of events related to the tournament. A quantity of 10,000 tournament replicas has been carried out in each simulation.

Just before the beginning of tournament, a simulation was performed to calculate the chances of reaching each round of the knockout stage and also of winning the championship. Table 7 displays the chances for the top 16 teams with the greatest chances of qualifying for the knockout stage. Based on this results, we observe a large favouritism of Spain and Netherlands, and also an equilibrium among Argentina, England and Portugal. Surprisingly, the traditional teams of France and Italy (who joined the 2006 WCT final match) have less chances than Chile and Portugal. Actually, France and Italy did not qualified for the knockout stage.

Table 7
Probabilities of the 16 top teams for the next phases.

Considering only the four finalists (Spain, Netherlands, Germany and Uruguay) we obtained, just before each round of the knockout stage, the winning tournament probabilities assuming different values for the a0 weight attached to the experts' information. Table 9 presents the obtained results for this simulation study. In all situations, varying the a0 value we obtain different predictions for the probability of tournament win for the teams, which means that predictions are sensible to different degrees of confidence on experts' information.

4.4 2014 WCT - Group Stage Classification Probalities

From now on we focus on the 2014 WCT. In this section, we illustrate how the simulation procedure is applied to calculate the qualifying probabilities for Groups A (Brazil Group), B (Spain Group, the champion of the 2010 WCT), D (Group of Death with the 3 ex-champions: Italy, England and Uruguay), F (Argentina Group, in the actual 2nd place) and G (German Group, actual champion).

Table 8 displays, for each team, the classification probabilities in all the four positions and also the qualifying probabilities for the knockout stage for Groups A, B, D, F and G. From these results, we see that Brazil's team has a great favouritism in its group, which can be explained by its third highest last pre-tournament FIFA world ranking as well as by the confidence of the experts in its performance. The same conclusions seems to be true to Argentina, Germany and Portugal, classified in second and fourth places in the last pre-tournament FIFA world ranking.

Table 8
Classification forecasts for Groups A, B, D, F and G of the 2014 WCT.

4.5 2014 WCT - Tournament Prediction

In this section, we present the results of the simulation study performed to estimate probabilities of events related to the 2014 tournament. A quantity of 10,000 tournament replicas has been carried out in each simulation.

Before the beginning of tournament, a simulation was performed to calculate the chances of winning the championship. Figure 3 displays the Box-plot for the chances of winning the 2010 WCT for the top 16 teams. Based on this results, we observe a large favouritism of Germany, following by an equilibrium between Argentina and Portugal and sequencially Brazil. Netherlands appears more ahead but still with a good chance. Indeed, the four finalists were Germany, Argentina, Netherlands and Brazil.

Figure 3
Box-plot for the chances of winning the 2014 WCT before the tournament starts.

5 FINAL REMARKS

In this paper, we propose a bayesian simulation methodology for predicting match outcomes of the 2010 and 2014 WCTs, which makes use of the FIFA ratings and experts' opinions. FIFA ratings system are based on previous four year performance of teams. The drawbacks of this ratings system is the great changes in the formation of teams in such a large period of time and the small number of games played between teams of different continent in comparison with those played by teams of the same continent. Other measures of strength of teams should be considered further and compared with the FIFA ratings. Moreover, the development of two ratings for teams, possibly via experts' information or even FIFA documentation, one for attack and another for defense, could improve prediction since there are teams which have strong defense but weak attack and vice-versa. A simple possibility to incorporate those abilities would be to add one parameter for each team directly on the Poisson model, as it was made for instance in (Volf, 200918 VOLF P. 2009. A random point process model for the score in sport matches. IMA J. Manag. Math., 20: 121-131.). This major embedding can be seen as a direct generalization of our modeling and should be considered in future research in the field.

The prior distributions are updated every round, providing flexibility to the modeling, once the experts' opinion are influenced by all previous events to the match. The use of expert's opinions may compensate, at least in part, for the lack of information of the factors which can influence a football match during a competition, such as tactic disciplines, team psychological conditions, referee, player injured or suspended, amongst others.

The method may be used to calculate the win, draw and loss probabilities at each single match, as well as to simulate the whole competition in order to estimate, for instance, probabilities of classification at group stage, of reaching the knockout stage or the final match, and of winning the tournament.

Moreover, the method presents a high performance within a simulation structure since known predictive distributions are obtained. This enables a rapid generation of predictive distribution values and consequently the probabilities of interest are obtained quickly.

For sake of comparison of our approach with other methodologies applied for predicting match outcomes of WCTs, we consider the methodology proposed by (Bastos & da Rosa, 20133 BASTOS LS & DA ROSA JMC. 2013. Predicting probabilities for the 2010 fifa world cup games using a poisson-gamma model. Journal of Applied Statistics, 40(7): 1553-1544.). While the overall DeFinetti measure for their best predictive model for the 2010 WCT are 0.626 and 0.611, respectively for group stage and overall, our modeling performance is better with 0.568 and 0.570, respectively for the group stage and overall. Besides, our modeling joined a prediction model competition for the matches of the group stage of the 2010 WCT organized by the Brazilian Society of Operational Research, the World Cup 2010 Football Forecast Competition, reaching the first place. We believe the inclusion of subjective information into our modeling through expert's opinions was crucial for such achievements.

For the 2010 WCT, the forecasts presented in this paper as well as others results were available and updated before each round on the website www.copa2010.ufscar.br. The internet users could also provide their guess on the match scores, interacting with our model through a chances simulator. The simulator uses an unique and innovative technology which makes an interaction between the programming languages such as PHP, HTML, CSS, JavaScript, SQL and R, to provide real time answers. In this context, the internet users were provided with all the teams' chances of winning in each stage of the knockout stage as well as the chance of each team to be the champion. At that time, according to our counter, we had over 53,000 site visits with over 11,000 simulations made. Thus, a large number of internet users and viewers had been exposed to our methodology.

For the 2014 WCT, the forecasts presented in this paper as well as others results were available and updated before each round on the website www.previsaoesportiva.com.br/copa. At that time, according to our counter, we had almost 50,000 site visits.

In our analysis the weights of the experts' opinion are fixed and known. As further work it may be considered one distinct value of a0 for each expert and round allowing changes of the values over the rounds. One interpretation that can be made is that, for a particular expert, if a0 increases over the rounds, the confidence of the information given by this expert increases as well. Alternatively, if a0 decreases, that means the information provided by this expert in previous rounds were not reliable. Furthermore, we can assume that a0 is a random variable and use a full hierarchical structure specifying a parametric distribution for the parameter a0, like a beta distribution, as suggested in Chen & Ibrahim (2000).

Future developments of the model should also consider the changes in tactical patterns of teams which occur during the match due to several factors such as scoring or conceding a goal, injury or expulsion of an important player, among others. The dynamics of the match are specially distinct for the knockout matches and for the third round matches of the group stage, since, for instance, one of the teams may be classified in a given moment and can adopt a defensive posture to not risk its qualification for the next round. Another possible improvement of our modeling would be the use of a measure for the skill of teams different from the FIFA ratings and to procedure a comparison between their predictive accuracies. Besides, previous to know experts' information, we considered here noninformative Jeffreys' prior distributions for the Poisson parameters λA and λB. However, other prior distributions could be work well such as a bounded gamma one. Moreover, here the predictive distributions were computed separately, since our model and prior distribution are factorable in terms of function of λA and λB. However, another approach would be the consideration of bivariate models as discussed in (Karlis & Ntzoufras, 200910 KARLIS D & NTZOUFRAS I. 2009. Bayesian modelling of football outcomes: Using the Skellam's distribution for the goal difference. IMA J. Manag. Math., 20: 133-145.), which allows for dependence between the number of goals of teams.

In our modeling we assume the presence of independence between the expert's opinions, which simplifies the obtained mathematical expressions. However, a more realistic approach would assume an exchangeable distribution or a multivariate distribution with dependence structure modeled via copulas or other means. Future developments of the model should also consider such model enlargement.

Table 9
Percentage of tournament wins.

Considering the proposed modeling, we observe that teams which score more goals seem to have their probability of win increased. At least in principle, the dependence on the number of goals seems to be present in models based on the number of goals scored. This is particularly true when a WCT is considered, due to the small number of matches per team, up to six matches for teams that remain in the tournament until the finals. Therefore, a change in the result of a match may impact the probability of win of the teams. In order to verify the impact of the number the goals scored by a team in our modeling, we change some match results by adding goals artificially. For instance, Spain won Paraguay by 1-0 in the quarter-finals. Considering the final match, Netherlands and Spain probabilities of win were 56.79% and 43.21%, respectively, pointing to a balanced match. We however changed the result of the Spain and Paraguay match to 2-0, 3-0 and 4-0. The Netherlands probability of win decreases to 53.57%, 51.20% and 47.64%, respectively, while the Spain probability of win increases to 46.43%, 48.80% and 52.36%, respectively.That is, the changes in the result of Spain and Paraguay match leads to changes, but does not drastic, in the final results, yet pointing to a balanced match. Indeed, the last match was very hard to Spain, which scored a goal only in the extra time. There is now a number of methods on how to predict the outcomes of football matches or even matches of other kind of sport, some of then cited in the references. A large sample comparison of their accuracy could be an important future research theme in the light of our modeling.

Finally, at least in principle, the proposed methodology can be used for prediction of match outcomes of other sports, for instance, indoor soccer, basketball, water polo, handball and hockey, but taking into account the differences of tournament structure of each one.

ACKNOWLEDGMENTS

The authors thanks the Editorial Boarding and the anonymous referees for their criticisms, comments and suggestions on an early version of the manuscript. The research is partially supported by CNPq, CAPES and FAPESP.

REFERENCES

  • 1
    ALVES AM, DE MELLO JCCBS, RAMOS TG & SANT'ANNA AP. 2011. Logit models for the probability of winning football games. Pesquisa Operacional, 31(3): 459-465.
  • 2
    BAIO G & BLANGIARDO M. 2010. Bayesian hierarchical model for the prediction of footballresults. Journal of Applied Statistics, 37(2): 253-264.
  • 3
    BASTOS LS & DA ROSA JMC. 2013. Predicting probabilities for the 2010 fifa world cup games using a poisson-gamma model. Journal of Applied Statistics, 40(7): 1553-1544.
  • 4
    BITTNER E, NUSSBAUMER A, JANKE W & WEIGEL M. 2009. Football fever: Goal distributions and non-gaussian statistics. European Physical Journal B, 67(3): 459-471.
  • 5
    CALÔBA GM & LINS MPE. 2006. Performance assessment of the soccer teams in brazil using dea. Pesquisa Operacional, 26(3): 521-536.
  • 6
    CLEMEN RT & WINKLER RL. 1999. Combining probability distributions from experts in risk analysis. Risk Analysis, 19(2): 187-103.
  • 7
    DOBSON S & GODDARD J. 2003. Persistence in sequences of football match results: A monte carlo analysis. European Journal of Operational Research, 148(2): 247-256.
  • 8
    GENEST C & ZIDEK JV. 1986. Combining probability distributions: a critique and an annotated bibliography. Statistical Science, 1(1): 114-148.
  • 9
    JANKE W, BITTNER E, NUSSBAUMER A & WEIGEL M. 2009. Football fever: self-affirmation model for goal distributions. Condensed Matter Physics, 12(4): 739-752.
  • 10
    KARLIS D & NTZOUFRAS I. 2009. Bayesian modelling of football outcomes: Using the Skellam's distribution for the goal difference. IMA J. Manag. Math., 20: 133-145.
  • 11
    KONING RH, KOOLHAAS M, RENES G & RIDDER G. 2003. A simulation model for footballchampionships. European Journal of Operational Research, 148(2): 268-276.
  • 12
    KOOPMAN SJ & LIT R. 2015. A dynamic bivariate poisson model for analysing and forecasting match results in the english premier league. Journal of the Royal Statistical Society Series A, 178(1): 167-186.
  • 13
    LOUZADA F, SUZUKI AK & SALASAR LEB. 2014. Predicting match outcomes in the englishpremier league: Which will be the final rank? Journal of Data Science, 12: 235-254.
  • 14
    O'HAGAN A, BUCK CE, DANESHKHAH A, EISER JR, GARTHWAITE PH, JENKINSON DJ, OAKLEY JE & RAKOW T. 2006. Uncertain judgements: eliciting experts' probabilities. John Wiley, London.
  • 15
    SANT'ANNA AP. 2008. Rough sets analysis with antisymmetric and intransitive attributes: Classification of brazilian soccer clubs. Pesquisa Operacional, 28(2): 217-230.
  • 16
    SCARFF PA & SHI X. 2008. The importance of a match in a tournament. Computers & Operations Research, 35(7): 2406-2418.
  • 17
    SUZUKI AK, SALASAR LEB, LOUZADA-NETO F & LEITE JG. 2009. A bayesian approach for predicting match outcomes: The 2006 (association) football world cup. Journal of the Operational Research Society, 61: 1530-1539 (October 2010).
  • 18
    VOLF P. 2009. A random point process model for the score in sport matches. IMA J. Manag. Math., 20: 121-131.

APPENDICE A: FIFA RATINGS

Table 10
FIFA ratings prior to 2010 WCT.

Publication Dates

  • Publication in this collection
    Sep-Dec 2015

History

  • Received
    10 Jan 2014
  • Accepted
    11 Sept 2015
Sociedade Brasileira de Pesquisa Operacional Rua Mayrink Veiga, 32 - sala 601 - Centro, 20090-050 Rio de Janeiro RJ - Brasil, Tel.: +55 21 2263-0499, Fax: +55 21 2263-0501 - Rio de Janeiro - RJ - Brazil
E-mail: sobrapo@sobrapo.org.br