STAT-5525: Statistical Learning
Description: Theory and application of supervised and unsupervised methods of statistical and machine learning. 5525: Methods of supervised statistical and machine learning for regression and classification. Overview of statistical (data) and algorithmic models. Detailed study of regression models for continuous and discrete data (linear, nonlinear, and generalized linear models). Detailed study of methods for classifying categorical outcomes (logistic and multinomial models, discriminant analysis, naïve Bayes). Tree-based methods for regression and classification. Feature selection, regularization, and dimension reduction for high-dimensional problems (Lasso, Ridge, PCR, PLS). Cross-validation and resampling for model tuning and uncertainty estimation. Statistical analyses using R or Python. 5526: Supervised and unsupervised statistical and machine learning for complex or high-dimensional data. Methods include: global and local models with smoothing (nearest-neighbor, kernel, and basis expansion techniques). Generalized (linear and additive) models. Methods for correlated (clustered) data, mixed models. Unsupervised learning for summarization, visualization, dimension reduction, imputation, and grouping; (K-means, PCA, hierarchical clustering, model-based clustering, association rules, self-organizing maps, and biclustering). Ensemble learning (bagging, model averaging, boosting, stacking). Support vector machines and neural networks. Introduction to methods and algorithms for deep learning. Model interpretability and explainability. Statistical analyses using R or Python. Pre: Graduate Standing.
Pathways: N/A
Course Hours: 3 credits
Sections Taught: 16
Average GPA: 3.81 (rounds to A-)
Strict A Rate (No A-) : 70.27%
Average Withdrawal Rate: 0.35%
Martin Skarzynski | 2023 | 100.0% | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | 3.98 | 2 |
Xinwei Deng | 2024 | 95.5% | 0.0% | 1.5% | 0.0% | 1.5% | 1.5% | 3.88 | 2 |
Anuj Karpatne | 2021 | 66.6% | 31.7% | 0.0% | 0.0% | 0.9% | 0.9% | 3.60 | 3 |
Oliver Schabenberger | 2023 | 100.0% | 0.0% | 0.0% | 0.0% | 0.0% | 0.0% | 3.95 | 1 |
Chandan K Reddy | 2020 | 92.1% | 7.8% | 0.0% | 0.0% | 0.0% | 0.0% | 3.82 | 1 |
Christian Lucero | 2024 | 80.0% | 20.0% | 0.0% | 0.0% | 0.0% | 0.0% | 3.77 | 1 |
Thomas H Woteki | 2020 | 83.4% | 0.0% | 16.7% | 0.0% | 0.0% | 0.0% | 3.62 | 1 |
Jyotishka Datta | 2022 | 94.4% | 5.6% | 0.0% | 0.0% | 0.0% | 0.0% | 3.94 | 3 |
Robert B Gramacy | 2019 | 70.5% | 29.3% | 0.0% | 0.0% | 0.0% | 0.0% | 3.67 | 1 |
Jonathan K Alt | 2020 | 83.3% | 16.7% | 0.0% | 0.0% | 0.0% | 0.0% | 3.88 | 1 |