Skip to content

This Python's code ranks a list of publications requested from Google Scholar search engine by the number of citations.

Notifications You must be signed in to change notification settings

anjukan/sort-google-scholar

 
 

Repository files navigation

This Python code ranks publications data from Google Scholar by the number of citations. It is useful for finding relevant papers in a specific field.

The data acquired from Google Scholar is Title, Citations, Links and Rank. The example of the code will look for the top 100 papers related to the keyword 'non intrusive load monitoring', and rank them by the number of citations As output this program will plot the number of citations in the Y axis and the rank of the result in the X axis. It also, optionally, export the database to a .csv file. Please update the keyword and other initialization variables

Update: Google Robot Checking

If you are having problems with Google robot checking, as a suggestion, you can try using a proxy. Follow those steps:

  1. Go to the website https://free-proxy-list.net/ and pick one proxy (example: 200.162.142.178).

  2. Add a new line with the chosen proxy, for example:

proxies = {
  'http': 'http://200.162.142.178:3128',
  'https': 'http://200.162.142.178:3128',
}
  1. Add proxies as an attribute in session.get. In other words, change the line with page = session.get(url) to page = session.get(url, proxies=proxies).

  2. I hope that works! For a feedback, send me an email: fernando [dot] wittmann [at] gmail [dot] com

About

This Python's code ranks a list of publications requested from Google Scholar search engine by the number of citations.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 96.7%
  • Python 3.3%