-
Notifications
You must be signed in to change notification settings - Fork 0
OSL Internship ‐ 2024 ‐ 2nd Cycle
Ivan Ogasawara edited this page Apr 17, 2024
·
1 revision
An ElasticSearch instance for serving scientific journals metadata. Currently, it has support for biorXiv and medrXiv.
- Documentation: https://thegraphnetwork-literev.github.io/es-journals
- License: BSD 3 Clause: https://github.com/thegraphnetwork-literev/es-journals/blob/main/LICENSE
- Code of Conduct: https://github.com/thegraphnetwork-literev/es-journals/blob/main/CODE_OF_CONDUCT.md
ES-Journals relays on scripts that download articles' metadata from original sources and load it into a ElasticSearch instance.
In some sense it is very similar to https://github.com/CenterForOpenScience/SHARE, but the purpose of ES-Journals is to keep it as simple as possible and serve just the ElasticSearch instance .
Currently, it supports biorXiv and medrXiv.
- Review https://github.com/CenterForOpenScience/SHARE, and check how it get metadata from the journals.
- Add support for arXiv
- Add support for PubMed (maybe via pymedx)
- Add support for PubMedCentral (maybe via pymedx)
- Scripts for download metadata from arXiv and load its data to ES
- Scripts for download metadata from PubMed and load its data to ES
- Scripts for download metadata from PubMedCentral and load its data to ES
- Prerequisites:
- Python
- Django
- Expected Time: 350 hours
- Potential Mentor(s): Ivan Ogasawara