Teaching
MS Data Science Program at Rochester Institute of Technology
DSCI-644: Software Engineering for Data Science
Spring 2025Course Overview
This course focuses on the software engineering challenges of building scalable and highly available big data software systems. Software design and development methodologies and available technologies addressing the major software aspects of a big data system including software architectures, application design patterns, different types of data models and data management, and deployment architectures will be covered in this course.
Course Projects
Project 1: Software Development Life Cycle
Project 2: Advanced Batch Processing ETL Pipeline
Project Description (Coming Soon)
GitHub Repository (Coming Soon)
Project 3: Database Normalization and Optimization
Project Description (Coming Soon)
GitHub Repository (Coming Soon)
Project 4: Stream Processing ETL Pipeline
Project Description (Coming Soon)
GitHub Repository (Coming Soon)