DATA - Data Science

DATA-20000 Introduction to Data Science

This course provides an introduction to the field of data science. It examines problems that can be solved by data scientists. The course presents the data science methodology of solving problems, including data gathering, preprocessing, modeling, evaluation, and visualization. Students will use standard data science tools and be introduced to programming concepts.
3

DATA-23500 Programming for Data Analysis

Disciplines and industries are collecting increasing amounts of data to help guide their work. This course presents programming techniques for working with large data sets. It teaches computer programming from the perspective of developing tools to analyze data.
3

Prerequisites

MATH 21500, MATH 22000, MATH 31500 OR PSYC 30300, AND CPSC 21000

DATA-30000 Visualizing and Communicating Data Knowledge

In this course, students will study effective communication of knowledge derived from data. The course also covers visualization of data for purposes of analysis and communication. Students will use standard software tools and programming libraries for visualization. The course will require writing technical reports that present the data science process and results. It also includes a discussion of ethical issues involved in data science.
3

Prerequisites

DATA 23500

DATA-40000 Big Data Systems

This course covers the study of systems for storing and processing large datasets. Covered concepts include standard architectures for Big Data, use of common software frameworks, and applications to batch and real-type systems. Students will work on projects using Big Data technologies such as Hadoop, MapReduce, Hive, Spark or NoSQL databases.
3

Prerequisites

DATA 23500 and CPSC 33000

DATA-47100 Machine Learning

This course studies programs that use experience for improving their performance at solving a variety of tasks such as classification, regression, or clustering. Topics include supervised and unsupervised learning, reinforcement learning, parametric and non-parametric methods, ensemble learning and introduction to computational learning theory. Students will learn how to evaluate the performance of machine learning methods and how to utilize the techniques in various applications.
3

Prerequisites

CPSC 21000 and MATH 21000

DATA-47200 Introduction to Data Mining

An introduction to the concepts, techniques, and systems of data warehousing and data mining, including (1) design and implementation of data warehouse and on-line analytical processing (OLAP) systems, and (2) data mining concepts, methods, systems, implementations, and applications.
3

Prerequisites

CPSC 21000 and MATH 21000

DATA-49000 Data Science Undergraduate Capstone Project

In this course, students will work in teams to develop a data-driven solution for a real-world problem using data science methods, will document their work in a scholarly report, and present their methodology and results to faculty and peers. Students will identify appropriate project topics with help of the faculty, research appropriate current methods and technologies, then apply them to find a solution. The results will be presented in a form of a technical report and an oral presentation. Additionally, this course will cover topics in professional ethics, intellectual properties, privacy and professional communication.
3

Prerequisites

DATA 30000, DATA 40000, AND DATA 47100 OR DATA 47200, plus senior status with major in Data Science.