Skip to content
forked from CASISCAS/asyspark

Asynchronous spark machine learning with parameter server

License

Notifications You must be signed in to change notification settings

ljzzju/asyspark

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

asyspark

Spark

Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Spark Streaming for stream processing.

asySpark

AsySpark is an component of spark, this component can make machine learning work more efficient with a asynchronous computing model.such as asynchronous stochastic gradient descent.

Tips

If you want to do something with us, contact us.

Forther reading

###Web resources 1:Dean, NIPS‘13, Li, OSDI‘14 The parameter server architecture
2:淘宝参数服务器架构
###Papers 1:Langford, NIPS‘09, Agarwal, NIPS‘11 theoretical convergence of asynchronous SGD
2:Li, WSDM‘16 practical considerations for asynchronous SGD with the parameter server

About

Asynchronous spark machine learning with parameter server

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Scala 99.6%
  • Shell 0.4%