References

Aitchison, J., & Brown, J. A. C. (1957). The lognormal distribution. Cambridge University Press.

Anscombe, F. J. (1973). Graphs in statistical analysis. The American Statistician, 27(1), 17–21. https://doi.org/10.1080/00031305.1973.10478966

Blattberg, R. C., Kim, B.-D., & Neslin, S. A. (2008). Database marketing: Analyzing and managing customers (International Series in Quantitative Marketing, Vol. 18). Springer. https://doi.org/10.1007/978-0-387-72579-6

Bortkiewicz, L. von. (1898). Das gesetz der kleinen zahlen: Untersuchungen über die Verteilung der seltenen ereignisse. G. Fisher.

Breusch, T. S., & Pagan, A. R. (1979). A simple test for heteroscedasticity and random coefficient variation. Econometrica, 47(5), 1287–1294. https://doi.org/10.2307/1911963

Casella, G., & Berger, R. L. (2002). Statistical inference (2nd ed.). Duxbury.

Cleveland, W. S. (1979). Robust locally weighted regression and smoothing scatterplots. Journal of the American Statistical Association, 74(368), 829–836. https://doi.org/10.1080/01621459.1979.10481038

Cleveland, W. S., & Devlin, S. J. (1988). Locally weighted regression: An approach to regression analysis by local fitting. Journal of the American Statistical Association, 83(403), 596–610.

Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences (3rd ed.). Routledge.

Cotton, R. (2017). rebus: Build regular expressions in a human readable way (R package version 0.1-3) [Computer software]. CRAN. https://CRAN.R-project.org/package=rebus

De Moivre, A. (1733). The doctrine of chances: Or, a method of calculating the probabilities of events in play (1st ed.). W. Pearson.

Diedenhofen, B., & Musch, J. (2015). Cocor: A comprehensive solution for the statistical comparison of correlations. PLOS ONE, 10(4), e0121945. https://doi.org/10.1371/journal.pone.0121945

Fader, P. S., & Hardie, B. G. S. (2009). Probability models for customer-base analysis. Journal of Interactive Marketing, 23(1), 61–69. https://doi.org/10.1016/j.intmar.2008.11.003

Fox, J., & Weisberg, S. (2019). An R companion to applied regression (3rd ed.). Sage.

Freedman, D., Pisani, R., & Purves, R. (2007). Statistics (4th ed.). W. W. Norton & Company.

Friedl, J. E. F. (2006). Mastering regular expressions (3rd ed.). O’Reilly Media.

Galton, F. (1886). Regression toward mediocrity in hereditary stature. Journal of the Anthropological Institute of Great Britain and Ireland, 15, 246–263. https://doi.org/10.2307/2841583

Gauss, C. F. (1809). Theoria motus corporum coelestium in sectionibus conicis solem ambientium. Perthes et Besser.

Gelman, A., Hill, J., & Vehtari, A. (2020). Regression and other stories. Cambridge University Press.

GitHub, Inc. (2024). GitHub [Computer software]. https://github.com

Gohel, A. (2026). flextable: Functions for tabular reporting (R package version 0.7.6) [Computer software]. https://cran.r-project.org/package=flextable

Greene, W. H. (2018). Econometric analysis (8th ed.). Pearson.

Hald, A. (1998). A history of mathematical statistics from 1750 to 1930. Wiley.

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning: Data mining, inference, and prediction (2nd ed.). Springer. https://doi.org/10.1007/978-0-387-84858-7

Havil, J. (2014). John Napier: Life, logarithms, and legacy. Princeton University Press.

Hothorn, T., Zeileis, A., & Graham, N. (2023). lmtest: Testing linear regression models (R package version 0.9-41) [Computer software]. CRAN. https://CRAN.R-project.org/package=lmtest

Ihaka, R., & Gentleman, R. (1996). R: A language for data analysis. Journal of Computational and Graphical Statistics, 5(3), 299–314. https://doi.org/10.1080/10618600.1996.10474713

Irizarry, R. A. (2024). Introduction to data science: Data wrangling and visualization with R (2nd ed.). Chapman & Hall/CRC.

Jolliffe, I. T. (2002). Principal component analysis. Springer.

Kotlarski, I. (1967). Pareto distribution. In S. Kotz & N. L. Johnson (Eds.), Encyclopedia of statistical sciences (Vol. 7, pp. 384–388). Wiley.

Kuhn, M., & Johnson, K. (2013). Applied predictive modeling. Springer.

Legendre, A.-M. (1805). Nouvelles méthodes pour la détermination des orbites des comètes. F. Didot.

Limpert, E., Stahel, W. A., & Abbt, M. (2001). Log-normal distributions across the sciences: Keys and clues. BioScience, 51(5), 341–352. https://doi.org/10.1641/0006-3568(2001)051[0341:LNDATS]2.0.CO;2

McDonald, J. H. (2014). Handbook of biological statistics (3rd ed.). Sparky House Publishing.

McElreath, R. (2020). Statistical rethinking: A Bayesian course with examples in R and Stan (2nd ed.). CRC Press.

Moore, D. S., McCabe, G. P., & Craig, B. A. (2021). Introduction to the practice of statistics (10th ed.). W. H. Freeman & Company.

Montgomery, D. C., Peck, E. A., & Vining, G. G. (2021). Introduction to linear regression analysis (6th ed.). Wiley.

Pearl, J., & Mackenzie, D. (2018). The book of why: The new science of cause and effect. Basic Books.

Pearson, K. (1896). Mathematical contributions to the theory of evolution. III. Regression, heredity, and panmixia. Philosophical Transactions of the Royal Society of London. Series A, 187, 253–318. https://doi.org/10.1098/rsta.1896.0007

Posit Team. (2024). RStudio: Integrated development environment for R (Version 2024.04) [Computer software]. https://posit.co

R Core Team. (2024). R: A language and environment for statistical computing [Computer software]. https://www.r-project.org/

R Core Team. (2026). mtcars: Motor Trend car road tests (1974) [Data set]. In R: A language and environment for statistical computing (Version 5.6.0). https://www.r-project.org/

Robinson, D., Hayes, A., & Couch, S. (2023). broom: Convert statistical analysis objects into tidy tibbles (R package version 1.0.8) [Computer software]. CRAN. https://CRAN.R-project.org/package=broom

Ross, S. M. (2014). Introduction to probability and statistics for engineers and scientists (5th ed.). Academic Press.

Sarkar, D. (2023). gridExtra: Miscellaneous functions for “grid” graphics (R package version 2.4-1) [Computer software]. CRAN. https://CRAN.R-project.org/package=gridExtra

Sievert, C. (2023). plotly: Create interactive web graphics via plotly.js (R package version 4.11.1) [Computer software]. CRAN. https://CRAN.R-project.org/package=plotly

Stigler, S. M. (1986). The history of statistics: The measurement of uncertainty before 1900. Belknap Press.

Student. (1908). The probable error of a mean. Biometrika, 6(1), 1–25. https://doi.org/10.2307/2331554

Tukey, J. W. (1977). Exploratory data analysis. Addison-Wesley.

Urdan, T. C. (2022). Statistics in plain English (5th ed.). Routledge. https://doi.org/10.4324/9781003196582

Vaughan, D. (2017). Statistics for the pharmaceutical sciences. CRC Press.

Wasserstein, R. L., & Lazar, N. A. (2016). The ASA statement on p-values: Context, process, and purpose. The American Statistician, 70(2), 129–133. https://doi.org/10.1080/00031305.2016.1154108

White, H. (1980). A heteroskedasticity-consistent covariance matrix estimator and a direct test for heteroskedasticity. Econometrica, 48(4), 817–838. https://doi.org/10.2307/1912934

Wickham, H. (2014). Tidy data. Journal of Statistical Software, 59(10), 1–23. https://doi.org/10.18637/jss.v059.i10

Wickham, H. (2019). readr: Read rectangular text data (R package version 1.4.0). https://CRAN.R-project.org/package=readr

Wickham, H. (2023). dplyr: A grammar of data manipulation (R package version 1.1.0). https://CRAN.R-project.org/package=dplyr

Wickham, H. (2024). ggplot2: Create elegant data visualisations using the grammar of graphics (R package version 3.4.4). https://CRAN.R-project.org/package=ggplot2

Wickham, H. (2024). scales: Scale functions for visualization (R package version 1.2.1). https://CRAN.R-project.org/package=scales

Wickham, H. (2025). stringr: Simple, consistent wrappers for common string operations (R package version 1.6.0). https://stringr.tidyverse.org

Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L., François, R., … Yutani, H. (2019). Welcome to the tidyverse. Journal of Open Source Software, 4(43), 1686. https://doi.org/10.21105/joss.01686

Wickham, H., François, R., Henry, L., & Müller, K. (2023). dplyr: A grammar of data manipulation (R package version 1.1.0). CRAN. https://CRAN.R-project.org/package=dplyr

Wickham, H., Vaughan, D., & Girlich, M. (2026). lubridate: Tidy messy date-times (R package version 1.3.2). https://lubridate.tidyverse.org

Wickham, H., Vaughan, D., & Girlich, M. (2026). tidyr: Tidy messy data (R package version 1.3.2). https://tidyr.tidyverse.org

Wooldridge, J. M. (2022). Introductory econometrics: A modern approach (8th ed.). Cengage Learning.

Yee, T. W. (2024). VGAM: Vector generalized linear and additive models (R package). CRAN. https://CRAN.R-project.org/package=VGAM

Zeileis, A. (2023). sandwich: Robust covariance matrix estimators (R package version 3.2-0). CRAN. https://CRAN.R-project.org/package=sandwich