Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Final project for Stats 132 at UC Berkeley; use machine learning algorithms to classify PLoS articles into subjects (accessing them via the SOLR API)
Fetching latest commit…
Cannot retrieve the latest commit at this time.
|Type||Name||Latest commit message||Commit time|
|Failed to load latest commit information.|
This is the code for my final project in Stats 132 at UC Berkeley (Practical Machine Learning) The goal is to use machine learning algorithms to classify scientific articles in to subjects using data from the publicly available PLoS search API (http://api.plos.org). The python code is used to pull down articles from PLoS and format them into vectors. It is gradually being assembled in to a decent project structure, and eventually will include the classification algorithms. The R scripts are built to facilitate investigation and contain some (poorly implemented) machine learning algorithms.