CS109 Final Project: First Author or Perish
Order of analysis (in iPynotebook):
testMedlineXmlRead: example scripts for xml data extraction
getNlmXmlTags: extract pertinent information out of base NLM Baseline XML dataset
pubYearDetermination: determines the year range (min/max year) from xml-extracted json files
extractAuthors: file-wise extraction of author publication list (affiliation, journal, year, author order, total author)
extractAuthors_prod: production-version for batch processing of XML dataset
getEigenFactor: web-scraping script to get eigenfactor table