Skip to content

A set of scripts for parsing scientific articles from arXiv.

License

Notifications You must be signed in to change notification settings

metalcorebear/arXiv-parser

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

arXiv Parser

(C) 2021 Mark M. Bailey, PhD

About

This set of scripts is useful for parsing arXiv using its API. The 'arxiv_scraper.py' script will save atom XML output from the API as a set of JSON files. The 'arxiv_parse.py' script will convert all the json files into one json file with the arxiv query metadata removed. This script is useful for collecting data for meta analysis of large bodies of scientific work.

Future Work

At some point, maybe I will build this into a library.

Releases

No releases published

Packages

No packages published

Languages