On the use of simulation methods to compute probabilitiesapplication to the first division Spanish soccer league

  1. Núñez Antón, Vicente A.
  2. Díaz-Emparanza, Ignacio
Journal:
Sort: Statistics and Operations Research Transactions

ISSN: 1696-2281

Year of publication: 2010

Volume: 34

Issue: 2

Pages: 181-200

Type: Article

More publications in: Sort: Statistics and Operations Research Transactions

Sustainable development goals

Abstract

We consider the problem of using the points a given team has in the First Division Spanish Soccer League to estimate its probabilities of achieving a specific objective, such as, for example, staying in the first division or playing the European Champions League. We started thinking about this specific problem and how to approach it after reading that some soccer coaches indicate that a team in the first division guarantees its staying in that division if it has a total of 42 points at the end of the regular season. This problem differs from the typical probability estimation problem because we only know the actual cumulative score a given team has at some point during the regular season. Under this setting a series of different assumptions can be made to predict the probability of interest at the end of the season. We describe the specific theoretical probability model using the multinomial distribution and, then, introduce two approximations to compute the probability of interest, as well as the exact method. The different proposed methods are then evaluated and also applied to the example that motivated them. One interesting result is that the predicted probabilities can then be dynamically evaluated by using data from the current soccer competition.

Bibliographic References

  • Agresti, A. (1990).Categorical Data Analysis. New York: Wiley.
  • Brillinger, D. R. (2008). Modelling game outcome of the Brazilian 2006 Series A Championship as ordinalvalued.Brazilian Journal of Probability and Statistics, 22, 89-104.
  • Cottrell, A. and Lucchetti, R. (2009) Gretl User’s Guide. Gnu Regression, Econometrics and Time Series. http://sourceforge.net/projects/gretl/files/manual/ [Online; November, 2009 version].
  • Cryer, J. B. and Miller, R. B. (1991).Statistics for Business: Data Analysis and Modelling. Boston: PWSKENT publishing Company.
  • Dı́az-Emparanza, I. (2002). Is a small Monte Carlo analysis a good analysis? Checking the size, power and consistency of a simulation-based test.Statistical Papers, 43(4), 567-577.
  • Hogg, R. W. and Tanis, E. A. (1988).Probability and Statistical Inference. New York: Macmillan Publishing Company.
  • Karlis D. and Ntzoufras J. (2000). On modelling soccer data.Student, 3, 229-245.
  • Karlis, D. and Ntzoufras, I. (2009). Bayesian modelling of football outcomes: using the Skellam’s distribution for the goal difference.IMA Journal of Management Mathematics, 20, 133-145
  • Kleijnen, J. P. C. (1987).Statistical Tools for Simulation Practitioners. New York: Marcel Dekker, Inc.
  • Lee, A. J. (1997). Modeling scores in the premier league: is Manchester Unitedreally the best?Chance, 10, 15-19.
  • Morris, C. (1975). Central limit theorems for multinomial sums.The Annals of Statistics, 14(1), 165-188.
  • Rao, C. R. (1973).Linear Statistical Inference and its Applications. New York: Wiley.
  • Rue, H. and Salvesen, O. (2000). Prediction and retrospective analysis of soccer matches in a league. Journal of the Royal Statistical Society-Series D (The Statistician), 49, 399-418.