Skip to content

pcoving/KDDCup

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

24 Commits
 
 
 
 
 
 
 
 

Repository files navigation

KDDCup 2013

Details of the competition can be found here.

To keep the repo lightweight, the dataset does not ship with the code. The .csv data can be downloaded from Kaggle (requires account) and untarred in the top-level directory.

Some benchmarks require the scikit-learn package.

Theory

Semi-supervised learning review:

The competition appears to be an instance of bipartite ranking:

Personalized PageRank with Monte Carlo looks promising:

Ideas

  • Build features with link analysis on author/paper graph, possibly with NetworkX library (doesn't seem to scale, looks like we need our own implementation)
  • How to use titles, keywords, affliction and other raw text features?

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages