500
Study of mathematical concepts used in data science applications. Topics include differentiation and integration of functions, optimization techniques, matrix operations, eigenvalues and eigenvectors, curve fitting, and discrete mathematics.
3
Distribution of random variables, conditional probability and independence, distributions of functions of random variables, and limiting distributions are discussed.
3
Prerequisites
DATA 50000 or prior coursework in Calculus
Overview of the field of data mining and analytics; includes large-scale file systems and Map-Reduce, measures of similarity, link analysis, frequent item sets, clustering, e-advertising as an application, recommendation systems.
3
Prerequisites
CPSC 50200 or DATA 50000, and CPSC 50100, DATA 51100, or prior programming experience, or an undergraduate degree in Computer Science
Programming structures and algorithms for large-scale statistical data processing and visualization. Students will use commonly available data analysis software packages to apply concepts and skills to large data sets and will also develop their own code using an objectoriented programming language.
3
Prerequisites
CPSC 50100 or prior programming experience
Multivariate data analysis, preliminary data analysis, exploratory factor analysis, multiple regression analysis, multiple discriminant analysis, logistic regression, MANOVA and GLM, and cluster analysis are discussed.
3
Prerequisites
DATA 50100
The theory and practice of visualizing large, complicated data sets to clarify areas of emphasis. Human factors best practices will be presented. Programming with advanced visualization frameworks and practices will be demonstrated and used in group programming projects.
3
Prerequisites
CPSC 50100, DATA 51100, or prior programming experience
The design and operation of large-scale, cloud-based systems for storing data. Topics include operating system virtualization, distributed network storage; distributed computing, cloud models (IAAS, PAAS, and SAAS), and techniques for securing cloud and virtual systems.
3
Prerequisites
CPSC 50100, DATA 51100, or prior programming experience
Algorithms for enabling artificial systems to learn from experience; supervised and unsupervised learning; clustering, reinforcement learning; control. Students will write programs that demonstrate machine learning techniques.
3
Prerequisites
CPSC 50200 or DATA 50000, and CPSC 50100, DATA 51100, or prior programming experience
The capstone experience for students pursuing the Computer Science concentration in Data Science. Students will develop a solution for a real-world problem in data mining and analytics, document their work in a scholarly report, and present their methodology and results to faculty and peers.
3
Prerequisites
A minimum of 24 hours earned in the MS Data Science program.