Distributed Statistical Computing for Big Data (2018 Fall) Prerequisite Basic knowledge in statistics Basic knowledge in computing Literature Selected books in distributed computing and big data Lecture notes Basic Hadoop/Spark Tutorial for Statisticians Lecture notes L1: Introduction to Hadoop L2: Understanding MapReduce L3: Statistical Modeling with Hadoop MapReduce L4: Programming Hive L5: Introduction to Spark L6: Spark and Machine Learning L7: Statistical Learning with Mahout