About This Course
Scales machine learning models and data analysis to a Big Data platform. Map Reduce and Spark frameworks are introduced as approaches to parallel algorithm development. Hands-on labs.
Sample course topics: Matrix and graph operations, linear regression model, similarity, mining data streams, clustering, dimensionality reduction, and Hadoop big data platform.
Sample textbook: Mining of Massive Datasets, 2nd Edition by Jure Leskovec, Anand Rajaraman, and Jeffrey David Ullman.