References
Aitchison, J., & Brown, J. A. C. (1957). The lognormal distribution. Cambridge University Press.
Anscombe, F. J. (1973). Graphs in statistical analysis. The American Statistician, 27(1), 17–21. https://doi.org/10.1080/00031305.1973.10478966
Blattberg, R. C., Kim, B.-D., & Neslin, S. A. (2008). Database marketing: Analyzing and managing customers. Springer. https://doi.org/10.1007/978-0-387-72579-6
Bortkiewicz, L. von. (1898). Das Gesetz der kleinen Zahlen: Untersuchungen über die Verteilung der seltenen Ereignisse. G. Fischer.
Breusch, T. S., & Pagan, A. R. (1979). A simple test for heteroskedasticity and random coefficient variation. Econometrica, 47(5), 1287–1294. https://doi.org/10.2307/1911963
Casella, G., & Berger, R. L. (2002). Statistical inference (2nd ed.). Duxbury.
Cleveland, W. S. (1979). Robust locally weighted regression and smoothing scatterplots. Journal of the American Statistical Association, 74(368), 829–836. https://doi.org/10.1080/01621459.1979.10481038
Cleveland, W. S., & Devlin, S. J. (1988). Locally weighted regression: An approach to regression analysis by local fitting. Journal of the American Statistical Association, 83(403), 596–610. https://doi.org/10.1080/01621459.1988.10478639
Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences (3rd ed.). Routledge.
De Moivre, A. (1733). The doctrine of chances: Or, a method of calculating the probabilities of events in play. W. Pearson.
Fader, P. S., & Hardie, B. G. S. (2009). Probability models for customer-base analysis. Journal of Interactive Marketing, 23(1), 61–69. https://doi.org/10.1016/j.intmar.2008.11.003
Fox, J., & Weisberg, S. (2019). An R companion to applied regression (3rd ed.). Sage.
Freedman, D., Pisani, R., & Purves, R. (2007). Statistics (4th ed.). W. W. Norton & Company.
Friedl, J. E. F. (2006). Mastering regular expressions (3rd ed.). O’Reilly Media.
Galton, F. (1886). Regression toward mediocrity in hereditary stature. Journal of the Anthropological Institute of Great Britain and Ireland, 15, 246–263. https://doi.org/10.2307/2841583
Gauss, C. F. (1809). Theoria motus corporum coelestium in sectionibus conicis solem ambientium. Perthes et Besser.
Gelman, A., Hill, J., & Vehtari, A. (2020). Regression and other stories. Cambridge University Press.
GitHub, Inc. (2024). GitHub. https://github.com
Gohel, A. (2026). flextable: Functions for tabular reporting (Version 0.7.6) [Computer software]. https://cran.r-project.org/package=flextable
Greene, W. H. (2018). Econometric analysis (8th ed.). Pearson.
Hald, A. (1998). A history of mathematical statistics from 1750 to 1930. Wiley.
Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.). Springer. https://doi.org/10.1007/978-0-387-84858-7
Havil, J. (2014). John Napier: Life, logarithms, and legacy. Princeton University Press.
Hunter, J. D. (2007). Matplotlib: A 2D graphics environment. Computing in Science & Engineering, 9(3), 90–95. https://doi.org/10.1109/MCSE.2007.55
Irizarry, R. A. (2024). Introduction to data science: Data wrangling and visualization with R (2nd ed.). Chapman & Hall/CRC.
Jolliffe, I. T. (2002). Principal component analysis (2nd ed.). Springer.
Jupyter Development Team. (n.d.). Jupyter. https://jupyter.org/
Kotlarski, I. (1967). Pareto distribution. In S. Kotz & N. L. Johnson (Eds.), Encyclopedia of statistical sciences (Vol. 7, pp. 384–388). Wiley.
Kuhn, M., & Johnson, K. (2013). Applied predictive modeling. Springer.
Legendre, A.-M. (1805). Nouvelles méthodes pour la détermination des orbites des comètes. F. Didot.
Limpert, E., Stahel, W. A., & Abbt, M. (2001). Log-normal distributions across the sciences: Keys and clues. BioScience, 51(5), 341–352. https://doi.org/10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2
McDonald, J. H. (2014). Handbook of biological statistics (3rd ed.). Sparky House Publishing.
McElreath, R. (2020). Statistical rethinking: A Bayesian course with examples in R and Stan (2nd ed.). CRC Press.
Microsoft. (n.d.). Visual Studio Code. https://code.visualstudio.com/
Montgomery, D. C., Peck, E. A., & Vining, G. G. (2021). Introduction to linear regression analysis (6th ed.). Wiley.
Moore, D. S., McCabe, G. P., & Craig, B. A. (2021). Introduction to the practice of statistics (10th ed.). W. H. Freeman & Company.
NumPy Developers. (n.d.). NumPy. https://numpy.org/
Pareto, V. (1897). Cours d’économie politique. F. Rouge.
Pearl, J., & Mackenzie, D. (2018). The book of why: The new science of cause and effect. Basic Books.
Pearson, K. (1896). Mathematical contributions to the theory of evolution. III. Regression, heredity, and panmixia. Philosophical Transactions of the Royal Society A, 187, 253–318. https://doi.org/10.1098/rsta.1896.0007
pandas Development Team. (n.d.). pandas documentation. https://pandas.pydata.org/docs/
Python Software Foundation. (n.d.). datetime — Basic date and time types. https://docs.python.org/3/library/datetime.html
Python Software Foundation. (n.d.). History of Python. https://www.python.org/doc/essays/blurb/
Python Software Foundation. (n.d.). math — Mathematical functions. https://docs.python.org/3/library/math.html
Python Software Foundation. (n.d.). Python documentation. https://docs.python.org/
R Core Team. (2026). R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.r-project.org/
R Core Team. (2026). mtcars dataset. R Foundation for Statistical Computing. https://www.r-project.org/
Ross, S. M. (2014). Introduction to probability and statistics for engineers and scientists (5th ed.). Academic Press.
Seabold, S., & Perktold, J. (2010). Statsmodels: Econometric and statistical modeling with Python. In Proceedings of the 9th Python in Science Conference (pp. 92–96).
Sievert, C. (2023). plotly: Create interactive web graphics via plotly.js (Version 4.11.1) [Computer software]. https://cran.r-project.org/package=plotly
Stigler, S. M. (1986). The history of statistics: The measurement of uncertainty before 1900. Belknap Press of Harvard University Press.
Student. (1908). The probable error of a mean. Biometrika, 6(1), 1–25. https://doi.org/10.2307/2331554
Tukey, J. W. (1977). Exploratory data analysis. Addison-Wesley.
Urdan, T. C. (2022). Statistics in plain English (5th ed.). Routledge. https://doi.org/10.4324/9781003196582
Vaughan, D. (2017). Statistics for the pharmaceutical sciences. CRC Press.
Virtanen, P., et al. (2020). SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nature Methods, 17(3), 261–272. https://doi.org/10.1038/s41592-019-0686-2
Waskom, M. (2021). seaborn: Statistical data visualization (Version 0.11.2) [Computer software]. https://doi.org/10.5281/zenodo.4569847
Wasserstein, R. L., & Lazar, N. A. (2016). The ASA statement on p-values: Context, process, and purpose. The American Statistician, 70(2), 129–133. https://doi.org/10.1080/00031305.2016.1154108
White, H. (1980). A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity. Econometrica, 48(4), 817–838. https://doi.org/10.2307/1912934
Wickham, H. (2014). Tidy data. Journal of Statistical Software, 59(10), 1–23. https://doi.org/10.18637/jss.v059.i10
Wooldridge, J. M. (2022). Introductory econometrics: A modern approach (8th ed.). Cengage Learning.