Odysseus Logo

Virginia Tech

STAT-5526: Statistical Learning

Description: Theory and application of supervised and unsupervised methods of statistical and machine learning. 5525: Methods of supervised statistical and machine learning for regression and classification. Overview of statistical (data) and algorithmic models. Detailed study of regression models for continuous and discrete data (linear, nonlinear, and generalized linear models). Detailed study of methods for classifying categorical outcomes (logistic and multinomial models, discriminant analysis, naïve Bayes). Tree-based methods for regression and classification. Feature selection, regularization, and dimension reduction for high-dimensional problems (Lasso, Ridge, PCR, PLS). Cross-validation and resampling for model tuning and uncertainty estimation. Statistical analyses using R or Python. 5526: Supervised and unsupervised statistical and machine learning for complex or high-dimensional data. Methods include: global and local models with smoothing (nearest-neighbor, kernel, and basis expansion techniques). Generalized (linear and additive) models. Methods for correlated (clustered) data, mixed models. Unsupervised learning for summarization, visualization, dimension reduction, imputation, and grouping; (K-means, PCA, hierarchical clustering, model-based clustering, association rules, self-organizing maps, and biclustering). Ensemble learning (bagging, model averaging, boosting, stacking). Support vector machines and neural networks for classification and regression. Introduction to methods and algorithms for deep learning. Model interpretability and explainability. Statistical analyses using R or Python.

Pathways: N/A

Course Hours: 3 credits

Prerequisites: ADS-5525 or STAT-5525

Required By: ADS-5814

Corequisites: N/A

Crosslist: ADS-5526

Repeatability: N/A

Sections Taught: 7

Average GPA: 3.88 (rounds to A)

Strict A Rate (No A-) : 77.27%

Average Withdrawal Rate: 0.00%

Oliver Schabenberger202480.0%20.0%0.0%0.0%0.0%0.0%3.801
Allison S Crandell202283.4%16.7%0.0%0.0%0.0%0.0%3.831
Xinwei Deng2024100.0%0.0%0.0%0.0%0.0%0.0%3.982
Thomas H Woteki2020100.0%0.0%0.0%0.0%0.0%0.0%3.941
Christian Lucero2023100.0%0.0%0.0%0.0%0.0%0.0%4.001
Chandan K Reddy201957.2%42.9%0.0%0.0%0.0%0.0%3.611

Grade Distribution Over Time