References

Adomavicius, G., & Tuzhilin, A. (2005). Toward the next generation of recommender systems: A survey of the state-of-the-art and possible extensions. IEEE Transactions on Knowledge and Data Engineering, 17(6), 734–749.

Agrawal, R., & Srikant, R. (1994). Fast algorithms for mining association rules in large databases. In Proceedings of the 20th International Conference on Very Large Data Bases (VLDB) (pp. 487–499). Morgan Kaufmann.

Agrawal, R., Imieliński, T., & Swami, A. (1993). Mining association rules between sets of items in large databases. In P. Buneman & S. Jajodia (Eds.), Proceedings of the 1993 ACM SIGMOD International Conference on Management of Data (pp. 207–216). ACM. https://doi.org/10.1145/170035.170072

Agresti, A. (2012). Categorical data analysis (3rd ed.). Wiley.

Anscombe, F. J. (1973). Graphs in statistical analysis. The American Statistician, 27(1), 17–21. https://doi.org/10.1080/00031305.1973.10478966

Bayes, T. (1763). An essay towards solving a problem in the doctrine of chances. Philosophical Transactions of the Royal Society of London, 53, 370–418. https://doi.org/10.1098/rstl.1763.0053

Bellman, R. (1961). Adaptive control processes: A guided tour. Princeton University Press.

Blattberg, R. C., Kim, B.-D., & Neslin, S. A. (2008). Database marketing: Analyzing and managing customers (International Series in Quantitative Marketing, Vol. 18). Springer. https://doi.org/10.1007/978-0-387-72579-6

Breiman, L. (1996). Bagging predictors. Machine Learning, 24(2), 123–140. https://doi.org/10.1007/BF00058655

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32. https://doi.org/10.1023/A:1010933404324

Breiman, L., Friedman, J. H., Olshen, R. A., & Stone, C. J. (1984). Classification and regression trees. Chapman & Hall/CRC.

Casella, G., & Berger, R. L. (2002). Statistical inference (2nd ed.). Duxbury.

Cleveland, W. S. (1979). Robust locally weighted regression and smoothing scatterplots. Journal of the American Statistical Association, 74(368), 829–836. https://doi.org/10.1080/01621459.1979.10481038

Cleveland, W. S., & Devlin, S. J. (1988). Locally weighted regression: An approach to regression analysis by local fitting. Journal of the American Statistical Association, 83(403), 596–610.

Cohen, J. (1960). A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20(1), 37–46. https://doi.org/10.1177/001316446002000104

Cohen, J., Cohen, P., West, S. G., & Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences. Routledge.

Cotton, R. (2017). rebus: Build regular expressions in a human readable way (R package version 0.1–3). https://CRAN.R-project.org/package=rebus

De Moivre, A. (1733). The doctrine of chances: Or, a method of calculating the probabilities of events in play (1st ed.). W. Pearson.

de Vries, A., & Ripley, B. D. (2023). ggdendro: Create dendrograms and tree diagrams using ‘ggplot2’ (R package version 0.2.0). https://CRAN.R-project.org/package=ggdendro

Draper, N. R., & Smith, H. (1998). Applied regression analysis (3rd ed.). Wiley.

Ester, M., Kriegel, H.-P., Sander, J., & Xu, X. (1996). A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD’96) (pp. 226–231). AAAI Press.

Everitt, B., Landau, S., Leese, M., & Stahl, D. (2011). Cluster analysis (5th ed.). Wiley.

Fader, P. S., & Hardie, B. G. S. (2009). Probability models for customer-base analysis. Journal of Interactive Marketing, 23(1), 61–69. https://doi.org/10.1016/j.intmar.2008.11.003

Finn, C., Abbeel, P., & Levine, S. (2017). Model-agnostic meta-learning for fast adaptation of deep networks. In Proceedings of the 34th International Conference on Machine Learning (ICML 2017) (pp. 1126–1135). PMLR. https://proceedings.mlr.press/v70/finn17a.html

Freedman, D., Pisani, R., & Purves, R. (2007). Statistics (4th ed.). W. W. Norton & Company.

Friedl, J. E. F. (2006). Mastering regular expressions (3rd ed.). O’Reilly Media.

Galton, F. (1886). Regression toward mediocrity in hereditary stature. The Journal of the Anthropological Institute of Great Britain and Ireland, 15, 246–263. https://doi.org/10.2307/2841583

Gauss, C. F. (1809). Theoria motus corporum coelestium in sectionibus conicis solem ambientium [Theory of the motion of the heavenly bodies moving about the sun in conic sections]. Perthes et Besser.

Gelman, A., Hill, J., & Vehtari, A. (2020). Regression and other stories. Cambridge University Press.

GitHub, Inc. (2024). GitHub [Computer software]. https://github.com

Gohel, A. (2026). flextable: Functions for tabular reporting (R package version 0.7.6) [Computer software]. https://cran.r-project.org/package=flextable

Goodfellow, I., Bengio, Y., & Courville, A. (2016). Deep learning. MIT Press.

Greene, W. H. (2018). Econometric analysis (8th ed.). Pearson.

Hahsler, M., Grün, B., & Hornik, K. (2005). arules: A computational environment for mining association rules and frequent item sets. Journal of Statistical Software, 14(15), 1–25. https://doi.org/10.18637/jss.v014.i15

Han, J., Kamber, M., & Pei, J. (2012). Data mining: Concepts and techniques (3rd ed.). Morgan Kaufmann.

Hastie, T., Tibshirani, R., & Friedman, J. (2009). The elements of statistical learning (2nd ed.). Springer. https://doi.org/10.1007/978-0-387-84858-7

Hechenbichler, K., & Schliep, K. P. (2004). Weighted k-nearest-neighbor techniques and ordinal classification. Discussion Paper No. 399, SFB 386. Ludwig-Maximilians University Munich.

Hocking, R. R. (1976). The analysis and selection of variables in linear regression. Biometrics, 32(1), 1–49. https://doi.org/10.2307/2529336

Hvitfeldt, E., Kuhn, M., & Posit Software, PBC. (2025). discrim: Model wrappers for discriminant analysis (R package version 1.1.0) [Computer software]. https://CRAN.R-project.org/package=discrim

Hyndman, R. J., & Athanasopoulos, G. (2021). Forecasting: Principles and practice (3rd ed.). OTexts. https://otexts.com/fpp3/

Ihaka, R., & Gentleman, R. (1996). R: A language for data analysis. Journal of Computational and Graphical Statistics, 5(3), 299–314. https://doi.org/10.1080/10618600.1996.10474713

Irizarry, R. A. (2024). Introduction to data science: Data wrangling and visualization with R (2nd ed.). Chapman & Hall/CRC.

Jain, A. K. (2010). Data clustering: 50 years beyond k-means. Pattern Recognition Letters, 31(8), 651–666.

Johnson, S. C. (1967). Hierarchical clustering schemes. Psychometrika, 32(3), 241–254. https://doi.org/10.1007/BF02289588

Jolliffe, I. T. (2002). Principal component analysis. Springer.

Kaufman, L., & Rousseeuw, P. J. (2009). Finding groups in data: An introduction to cluster analysis (2nd ed.). Wiley.

Ketchen, D. J., Jr., & Shook, C. L. (1996). The application of cluster analysis in strategic management research: An analysis and critique. Strategic Management Journal, 17(6), 441–458.

Kuhn, M., & Johnson, K. (2013). Applied predictive modeling. Springer.

Kuhn, M., & Johnson, K. (2019). Feature engineering and selection: A practical approach for predictive models. CRC Press.

Kuhn, M., & Silge, J. (2022). Tidy modeling with R: A framework for modeling in the tidyverse. O’Reilly Media.

Kuhn, M., & Silge, J. (2022). Tidymodels: A framework for modeling and machine learning using tidyverse principles. O’Reilly Media.

Kuhn, M., & Wickham, H. (2020). tidymodels: A collection of packages for modeling and machine learning using tidyverse principles (R package version 0.1.4) [Computer software]. https://CRAN.R-project.org/package=tidymodels

Kuhn, M., & Wickham, H. (2024). modeldata: Data sets used in modeling packages (R package version 1.4.0) [Computer software]. https://CRAN.R-project.org/package=modeldata

Lantz, B. (2023). Machine learning with R: Learn techniques for building and improving machine learning models, from data preparation to model tuning, evaluation, and working with big data (4th ed.). Packt Publishing.

LeCun, Y., & He, K. (2022). Deep learning. Nature, 604(7900), 921–930. https://doi.org/10.1038/s41586-022-04455-8

Legendre, A.-M. (1805). Nouvelles méthodes pour la détermination des orbites des comètes. F. Didot.

Lloyd, S. P. (1982). Least squares quantization in PCM. IEEE Transactions on Information Theory, 28(2), 129–137.

MacQueen, J. (1967). Some methods for classification and analysis of multivariate observations. In L. M. LeCam & J. Neyman (Eds.), Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability (Vol. 1, pp. 281–297). University of California Press.

Majka, M. (2024). naivebayes: High-performance implementation of the Naive Bayes algorithm (R package version 1.0.0) [Computer software]. https://CRAN.R-project.org/package=naivebayes

McCallum, A., & Nigam, K. (1998). A comparison of event models for Naive Bayes text classification. In AAAI-98 Workshop on Learning for Text Categorization (pp. 41–48). AAAI Press.

McElreath, R. (2020). Statistical rethinking: A Bayesian course with examples in R and Stan (2nd ed.). CRC Press.

Milborrow, S. (2019). rpart.plot: Plot rpart models (R package version 3.0.8) [Computer software]. https://CRAN.R-project.org/package=rpart.plot

Mitchell, T. M. (1997). Machine learning. McGraw-Hill.

Montgomery, D. C., Peck, E. A., & Vining, G. G. (2021). Introduction to linear regression analysis (6th ed.). Wiley.

Moore, D. S., McCabe, G. P., & Craig, B. A. (2021). Introduction to the practice of statistics (10th ed.). W. H. Freeman.

Moro, S., Rita, P., & Laureano, R. (2018). Hotel booking demand datasets. Data in Brief, 22, 41–49.

Murtagh, F., & Contreras, P. (2012). Algorithms for hierarchical clustering: An overview. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 2(1), 86–97. https://doi.org/10.1002/widm.53

Ockham, W. of. (1990). Philosophical writings (P. Boehner, Ed.). Hackett Publishing Company. (Original work from 14th century)

Pearl, J., & Mackenzie, D. (2018). The book of why: The new science of cause and effect. Basic Books.

Pearson, K. (1896). Mathematical contributions to the theory of evolution. III. Regression, heredity, and panmixia. Philosophical Transactions of the Royal Society A, 187, 253–318. https://doi.org/10.1098/rsta.1896.0007

Pedersen, T. L. (2023). ggforce: Accelerating ggplot2 (R package version 0.4.1). https://CRAN.R-project.org/package=ggforce

Peters, R. (2019). ipred: Improved predictors (R package version 0.9-9) [Computer software]. https://CRAN.R-project.org/package=ipred

Posit Team. (2024). RStudio: Integrated development environment for R (Version 2024.04) [Computer software]. https://posit.co

R Core Team. (2024). R: A language and environment for statistical computing [Computer software]. https://www.r-project.org/

R Core Team. (2026). mtcars: Motor Trend car road tests (1974) [Data set]. In R: A language and environment for statistical computing (Version 5.6.0). https://www.r-project.org/

Robinson, D., Hayes, A., & Couch, S. (2023). broom: Convert statistical analysis objects into tidy tibbles (R package version 1.0.8) [Computer software]. https://CRAN.R-project.org/package=broom

Sarkar, D. (2023). gridExtra: Miscellaneous functions for grid graphics (R package version 2.4-1) [Computer software]. https://CRAN.R-project.org/package=gridExtra

Schafer, J. B., Konstan, J. A., & Riedl, J. (2001). E-commerce recommendation applications. Data Mining and Knowledge Discovery, 5(1–2), 115–153.

Schliep, K., Hechenbichler, K., & Lizee, A. (2025). kknn: Weighted k-nearest neighbors (R package version 1.4.1) [Computer software]. https://CRAN.R-project.org/package=kknn

Sievert, C. (2023). plotly: Create interactive web graphics via plotly.js (R package version 4.11.1) [Computer software]. https://CRAN.R-project.org/package=plotly

Sutton, R. S., & Barto, A. G. (2018). Reinforcement learning: An introduction (2nd ed.). MIT Press.

Swets, J. A. (1988). Measuring the accuracy of diagnostic systems. Science, 240(4857), 1285–1293. https://doi.org/10.1126/science.3287615

Tan, P.-N., Steinbach, M., & Kumar, V. (2019). Introduction to data mining (2nd ed.). Pearson.

Taunk, K., De, S., & Verma, S. (2019). A brief review of nearest neighbor algorithm for learning and classification. In Proceedings of the 2019 International Conference on Intelligent Computing and Control Systems. https://doi.org/10.1109/ICCS45141.2019.9065747

Therneau, T. M., Atkinson, B., & Ripley, B. (2019). rpart: Recursive partitioning and regression trees (R package version 4.1-15) [Computer software]. https://CRAN.R-project.org/package=rpart

Tierney, N. J., Cook, D., McBain, M., & Fay, C. (2023). naniar: Data structures, summaries, and visualisations for missing data in R (R package version 1.1.0). https://CRAN.R-project.org/package=naniar

Tukey, J. W. (1977). Exploratory data analysis. Addison-Wesley.

Urdan, T. C. (2022). Statistics in plain English (5th ed.). Routledge. https://doi.org/10.4324/9781003196582

Vaughan, D. (2017). Statistics for the pharmaceutical sciences. CRC Press.

Wedel, M., & Kamakura, W. A. (2000). Market segmentation: Conceptual and methodological foundations (2nd ed.). Springer.

Wickham, H. (2014). Tidy data. Journal of Statistical Software, 59(10), 1–23. https://doi.org/10.18637/jss.v059.i10

Wickham, H. (2019). readr: Read rectangular text data (R package version 1.4.0). https://CRAN.R-project.org/package=readr

Wickham, H. (2023). forcats: Tools for working with categorical variables (factors) in R (R package version 1.0.3). https://CRAN.R-project.org/package=forcats

Wickham, H. (2024). ggplot2: Create elegant data visualisations using the grammar of graphics (R package version 3.4.4). https://CRAN.R-project.org/package=ggplot2

Wickham, H. (2025). stringr: Simple, consistent wrappers for common string operations (R package version 1.6.0). https://stringr.tidyverse.org

Wickham, H., & Seidel, D. (2024). scales: Scale functions for visualization (R package version 1.2.1). https://CRAN.R-project.org/package=scales

Wickham, H., Averick, M., Bryan, J., Chang, W., McGowan, L., François, R., et al. (2019). Welcome to the tidyverse. Journal of Open Source Software, 4(43), 1686. https://doi.org/10.21105/joss.01686

Wickham, H., François, R., Henry, L., & Müller, K. (2023). dplyr: A grammar of data manipulation (R package version 1.1.0). https://CRAN.R-project.org/package=dplyr

Wickham, H., Vaughan, D., & Girlich, M. (2026a). lubridate: Tidy messy date-times (R package version 1.3.2). https://lubridate.tidyverse.org

Wickham, H., Vaughan, D., & Girlich, M. (2026b). tidyr: Tidy messy data (R package version 1.3.2). https://tidyr.tidyverse.org

Wolpert, D. H. (1996). The lack of a priori distinctions between learning algorithms. Neural Computation, 8(7), 1341–1390. https://doi.org/10.1162/neco.1996.8.7.1341

Wooldridge, J. M. (2022). Introductory econometrics: A modern approach (8th ed.). Cengage Learning.

Wright, M. N., & Ziegler, A. (2017). ranger: A fast implementation of random forests for high dimensional data in C++ and R (R package version 0.12.1). https://CRAN.R-project.org/package=ranger

Zhang, H. (2004). The optimality of Naive Bayes. In Proceedings of the Seventeenth International Florida Artificial Intelligence Research Society Conference (FLAIRS 2004) (pp. 562–567). AAAI Press.

Zhu, X., & Goldberg, A. B. (2009). Introduction to semi-supervised learning. Synthesis Lectures on Artificial Intelligence and Machine Learning, 3(1), 1–130. https://doi.org/10.2200/S00196ED1V01Y200906AIM006