The Distributed Statistical Computing course was developed and taught by Dr. Feng Li in 2014 for a joint master’s program in statistics with prestigious universities, Peking University, Renmin University of China, Central University of Finance and Economics, University of Chinese Academy of Sciences, and Capital University of Economics and Business.
This course is also offered by Dr. Feng Li for the Business Analytics program at Peking University since 2020.
- Basic knowledge of statistics
- Basic knowledge in computing
- Distributed statistical computing [New online book | Print version]
- Lecture notes
- Demo Hadoop/Spark configurations
- Basic Hadoop/Spark Tutorial for Statisticians
- The Chinese version of teaching videos are also available on https://space.bilibili.com/509963672
Slides and lecture notes
Read with online Jupyter Notebook viewer
- Download all Jupyter Notebooks in a zip file.
- Download all data in a zip file.
Part I: Distributed Systems and Distributed Computing
Part II: Advanced Distributed Statistical Computing