Skip to content

Some basic visualization of ArXiv Metadata, see my blog post "arXiv and the Information Explosion"

License

Notifications You must be signed in to change notification settings

edouardfouche/arxiv-information-explosion

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

arXiv and the Information Explosion

Hello, welcome to the companion repository for my blog post: arXiv and the Information Explosion.

Requirements: Python 3, Jupyter notebooks (you may simply get Anaconda ^^)

I used the following packages (other versions may work as well):

  • pandas ==0.25.3
  • numpy==1.18.1
  • scipy==1.4.1
  • matplotlib==2.0.2
  • seaborn==0.9.0

Everything is in the .ipynb file.

jupyter notebook

Get the data from Kaggle and extract arxiv-metadata-oai-snapshot.json in this base folder.

To convert the .pdf to svg (to embed in the blog article), I used pdf2svg

cd plots/
bash pdf2svg.sh

About

Some basic visualization of ArXiv Metadata, see my blog post "arXiv and the Information Explosion"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published