Odysseus Logo

Virginia Tech

STAT-5525: Statistical Learning

Description: Theory and application of supervised and unsupervised methods of statistical and machine learning. 5525: Methods of supervised statistical and machine learning for regression and classification. Overview of statistical (data) and algorithmic models. Detailed study of regression models for continuous and discrete data (linear, nonlinear, and generalized linear models). Detailed study of methods for classifying categorical outcomes (logistic and multinomial models, discriminant analysis, naïve Bayes). Tree-based methods for regression and classification. Feature selection, regularization, and dimension reduction for high-dimensional problems (Lasso, Ridge, PCR, PLS). Cross-validation and resampling for model tuning and uncertainty estimation. Statistical analyses using R or Python. 5526: Supervised and unsupervised statistical and machine learning for complex or high-dimensional data. Methods include: global and local models with smoothing (nearest-neighbor, kernel, and basis expansion techniques). Generalized (linear and additive) models. Methods for correlated (clustered) data, mixed models. Unsupervised learning for summarization, visualization, dimension reduction, imputation, and grouping; (K-means, PCA, hierarchical clustering, model-based clustering, association rules, self-organizing maps, and biclustering). Ensemble learning (bagging, model averaging, boosting, stacking). Support vector machines and neural networks. Introduction to methods and algorithms for deep learning. Model interpretability and explainability. Statistical analyses using R or Python. Pre: Graduate Standing.

Pathways: N/A

Course Hours: 3 credits

Prerequisites: N/A

Required By: ADS-5526, STAT-5234, STAT-5526

Corequisites: N/A

Crosslist: ADS-5525

Repeatability: N/A

Sections Taught: 16

Average GPA: 3.81 (rounds to A-)

Strict A Rate (No A-) : 70.27%

Average Withdrawal Rate: 0.35%

Martin Skarzynski2023100.0%0.0%0.0%0.0%0.0%0.0%3.982
Xinwei Deng202495.5%0.0%1.5%0.0%1.5%1.5%3.882
Anuj Karpatne202166.6%31.7%0.0%0.0%0.9%0.9%3.603
Oliver Schabenberger2023100.0%0.0%0.0%0.0%0.0%0.0%3.951
Chandan K Reddy202092.1%7.8%0.0%0.0%0.0%0.0%3.821
Christian Lucero202480.0%20.0%0.0%0.0%0.0%0.0%3.771
Thomas H Woteki202083.4%0.0%16.7%0.0%0.0%0.0%3.621
Jyotishka Datta202294.4%5.6%0.0%0.0%0.0%0.0%3.943
Robert B Gramacy201970.5%29.3%0.0%0.0%0.0%0.0%3.671
Jonathan K Alt202083.3%16.7%0.0%0.0%0.0%0.0%3.881

Grade Distribution Over Time

1234GPA
Spring 2019Fall 2019Summer I 2020Fall 2020Summer I 2021Fall 2021Summer I 2022Spring 2023Summer I 2023Fall 2023Summer I 2024Fall 2024Term050% W