Skip to content

vaddya/big-data-spark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Big Data with Spark

  • useragents - User agents analytics
  • binclass - Binary classification using linear regression & random forest
  • titanic - Predicting Titanic survivors
  • wikipedia - Languages popularity based on Wikipedia articles
  • stackoverflow - Distributed k-means algorithm which clusters posts on StackOverflow according to their score
  • timeusage - Identifying three groups of activities and observing how do people allocate their time between them

Releases

No releases published

Packages

No packages published

Languages