Goal:
Predict what quantum computing papers that come out on arXiv are likely worth reading based on input features. Put it all on a heroku app where you can plug in a new paper (w/o any scitations or anything) and get a "whether you should read it" rating.
Author info:
- Number of scitations on recent papers
- Number of citations on recent papers (5 years ✔️)
- Number of total scitations / citations ✔️
- Collaborators recent scitations
- Collaborators recent citations
- Collaborators number of total scitations / citations
- Number of papers published ✔️
- h-index ✔️
Paper info:
- Abstract ✔️
- Papers cited ✔️
- Number of scitations on the paper ✔️
- Number of citations on the paper ✔️
- Who scited it ✔️
A recommended read score.
- gather raw data
- engineer relevant features
- create training and test datasets
- choose models
- train and validate models
- compare models on test sets
- create heroku dash app
- put model on app