Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create journalCategoryScore #262

Closed
paulalbert1 opened this issue Jul 28, 2018 · 1 comment
Closed

Create journalCategoryScore #262

paulalbert1 opened this issue Jul 28, 2018 · 1 comment
Assignees

Comments

@paulalbert1
Copy link
Contributor

paulalbert1 commented Jul 28, 2018

Faculty in certain departments are far more likely to publish in journals associated with one or two specialties compared to another. For example, Weill Cornell faculty in Orthopaedic Surgery are 200 times more likely to author papers in journals assigned to the "Orthopedics" subfield/category (as assigned by ScienceMetrix) compared to random chance. We can take advantage of this during scoring.

While there are tens of thousands of combinations between department and subfield, only 100 departments-subfield combinations represent 57% of all known WCM publications. For full validity, we would need to evaluate this approach in other institutions.

Preprocessing

As a one-time thing, we need to create two new static tables into DynamoDB.

Scoring

  1. Get all ISSNs from article.

  2. Get identity.organizationalUnits for targetAuthor. Include all types including programs.

  3. Look up any department synonyms in application.properties. For example:

departmentSynonym: Healthcare Policy & Research|Public Health|Health Care Policy & Research, Radiation Oncology|Radiation for Oncology, Otolaryngology - Head and Neck Surgery|Otolaryngology|Otorhinolaryngology

For example, support a user has known department of "Public Health", we would need to look in ScienceMetrixDepartmentCategory for departments matching "Healthcare Policy & Research" and "Health Care Policy & Research."

  1. Does ISSN exist in the ScienceMetrix table? Check the following possible matches:
  • Article.ISSNLinking = ScienceMetrix.ISSN
  • Elseif Article.ISSNLinking = ScienceMetrix.eISSN
  • Elseif Article.ISSNPrint = ScienceMetrix.ISSN
  • Elseif Article.ISSNPrint = ScienceMetrix.eISSN
  • Elseif Article.ISSNElectronic = ScienceMetrix.ISSN
  • Elseif Article.ISSNElectronic = ScienceMetrix.eISSN

**

  • If yes, get journalSubfieldLabel and journalSubfieldID. Go to 5.
  • If no, output the following:
journalSubfieldScienceMetrixLabel: null
journalSubfieldScienceMetrixID: null
journalSubfieldDepartment: Anesthesiology
journalSubfieldScore: null
  1. Does any organizationalUnit exist for the subfield in question?
  • If yes, go to 6.
  • If no, output
journalSubfieldScienceMetrixLabel: Biochemistry & Molecular Biology
journalSubfieldScienceMetrixID: 86
journalSubfieldDepartment: Anesthesiology
journalSubfieldScore: -1 /* stored in application.properties */
  1. There's a match on subfield and department.

Multiply logOddsRatio by some factor as stored in application.properties:

journalSubfieldFactorScore: 1

Output the following:

journalSubfieldScienceMetrixLabel: Pharmacology & Pharmacy
journalSubfieldScienceMetrixID: 122
journalSubfieldDepartment: Pharmacology
journalSubfieldScore: 2.19 /* value comes from ScienceMetrixDeptCategory */

If there are multiple matches because a user has multiple departments (infrequent but may occur), choose the highest scoring one.

@paulalbert1 paulalbert1 changed the title Create articleCategoryScore Create journalCategoryScore Jul 28, 2018
@sarbajitdutta sarbajitdutta self-assigned this Aug 1, 2018
@jl987-Jie
Copy link
Contributor

Please reference this commit ea3f258 for information on how to query by either eissn or issn.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants