Computer Code and Statistical Cases for book: Distributed Statistical Computing for Big Data (大数据分布式计算与案例——李丰著)
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
00-teaching-slides
01-stratified-sampling
02-kmeans-movie-review
03-poisson-regression
04-k-nearest-neighbors
05-discriminant-analysis
06-logistic-regression-split
07-logistic-regression-mahout
08-text-classification
09-quadratic-classifier
10-decision-tree
11-random-forests
12-naive-bayes
13-ridge-regression
16-recommendation-systems
17-hive-example
18-hive-hadoop-streaming
19-spark-word-count
.gitignore
README.md

README.md

Distributed Statistical Computing (大数据分布式计算与案例)

Feng Li
School of Statistics and Mathematics
Central University of Finance and Economics

This is the code repository for my forthcoming book: Distributed Statistical Computing for Big Data (in Chinese)

  • 00-teaching-slides includes teaching slides (PDF) in both English and Chinese version, demo code and Jupyter notebook.

  • xx-case-examples includes statistical cases in markdown and tex format.