AY2019/2020 Semester 2
School of Computing
National University of Singapore
Taught by He Bingsheng
Data science incorporates varying elements and builds on techniques and theories from many fields, including statistics, data engineering, data mining, visualization, data warehousing, and high-performance computing systems with the goal of extracting meaning from big data and creating data products. Data science needs advanced computing systems such as Apache Hadoop and Spark to address big data challenges. In this module, students will learn various computing systems and optimization techniques that are used in data science with emphasis on the system building and algorithmic optimizations of these techniques.
- Lecture: 2 hrs
- Tutorial: 1 hrs
- Project: 3 hrs
- Preparation: 4 hrs
- Assignments 20%
- Team Project 45%
- Final Test 35%
- System Survey 10% (for CS5425 only)