Skip to content

AlexanderShaw/scirate_fun

Repository files navigation

Project Outline:

Goal:
Predict what quantum computing papers that come out on arXiv are likely worth reading based on input features. Put it all on a heroku app where you can plug in a new paper (w/o any scitations or anything) and get a "whether you should read it" rating.

Input features:

Author info:

  • Number of scitations on recent papers
  • Number of citations on recent papers (5 years ✔️)
  • Number of total scitations / citations ✔️
  • Collaborators recent scitations
  • Collaborators recent citations
  • Collaborators number of total scitations / citations
  • Number of papers published ✔️
  • h-index ✔️

Paper info:

  • Abstract ✔️
  • Papers cited ✔️

Training features

  • Number of scitations on the paper ✔️
  • Number of citations on the paper ✔️
  • Who scited it ✔️

Output:

A recommended read score.

Workflow

  1. gather raw data
  2. engineer relevant features
  3. create training and test datasets
  4. choose models
  5. train and validate models
  6. compare models on test sets
  7. create heroku dash app
  8. put model on app

Notes

Gather raw data

Stats questions to answer

- Do scitations and citations vary for a given paper? Why? - How much of a well-scited paper changes from arxiv to publish?

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors