Odysseus Logo

Virginia Tech

STAT-5525: Data Analytics

Description: 5525: Basic techniques in data analytics including the preparation and manipulation of data for analysis and the creation of data files from multiple and dissimilar sources. The data mining and knowledge discovery process. Overview of data mining algorithms in classification, clustering, association analysis, probabilistic modeling, and matrix decompositions. Detailed study of classification methods including tree-based methods, Bayesian methods, logistic regression, ensemble, bagging and boosting methods, neural network methods, use of support vectors and Bayesian networks. Detailed study of clustering methods including k-means, hierarchical and self-organizing map methods. Prerequisite: Graduate Standing required. 5526: Techniques in unsupervised and visualized learning in high dimension spaces. Theoretical, probabilistic, and applied aspects of data analytics. Methods include generalized linear models in high dimensional spaces, regularization, lasso and related methods, principal component regression (pca), tree methods, and random forests. Clustering methods including k-means, hierarchical clustering, biclustering, and model-based clustering will be thoroughly examined. Distance-based learning methods include multi dimensional scaling, the self organizing map, graphical/network models, and isomap. Supervised learning will consist of discriminant analyses, supervised pca, support vector machines, and kernel methods.

Pathways: N/A

Course Hours: 3 credits

Prerequisites: N/A

Required By: ADS-5224, STAT-5234

Corequisites: N/A

Crosslist: N/A

Repeatability: N/A

Sections Taught: 24

Average GPA: 3.78 (A)

Strict A Rate (No A-) : 63.78%

Average Withdrawal Rate: 0.11%

Bodicherla A Prakash201847.3%46.3%6.4%0.0%0.0%0.0%3.382
Robert B Gramacy201970.5%29.3%0.0%0.0%0.0%0.0%3.671
Anuj Karpatne202166.6%31.7%0.0%0.0%0.9%0.9%3.603
Chandan K Reddy202085.8%14.2%0.0%0.0%0.0%0.0%3.734
Scott C Leman2017100.0%0.0%0.0%0.0%0.0%0.0%4.003
Thomas H Woteki202083.4%0.0%16.7%0.0%0.0%0.0%3.621
Jonathan K Alt202083.3%16.7%0.0%0.0%0.0%0.0%3.881
Oliver Schabenberger2023100.0%0.0%0.0%0.0%0.0%0.0%3.951
Jyotishka Datta202294.4%5.6%0.0%0.0%0.0%0.0%3.943
Xinwei Deng2023100.0%0.0%0.0%0.0%0.0%0.0%3.942
Chang Tien Lu201266.6%33.3%0.0%0.0%0.0%0.0%3.571
Martin Skarzynski2023100.0%0.0%0.0%0.0%0.0%0.0%3.982

Grade Distribution Over Time