Skip to content
This repository has been archived by the owner on May 16, 2023. It is now read-only.
/ papers-library Public archive

Download and analyze papers from arXiv.org

Notifications You must be signed in to change notification settings

davamix/papers-library

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

32 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Papers Library (WIP)

Get the paper information from arXiv.org based on the ID, take your notes about the paper and save it into a MongoDB database.

Working on the model to analyze paper's data.

Screenshot

Quick Start

Build

docker-compose build

Run

docker-compose up

Navigate to http://127.0.0.1:5000/library

To reset the DB

docker-compose down --volumes

Tools

  • arxiv-downloader.py: Script to download pdf papers from arXiv

  • pdf-image.py: This script converts all the pdf's from the source folder into images

  • split-data.py: This script split the data from a source folder into a training set and validation set.

TODO:

  • Analyze the text and extract other sections in addition to the abstract

About

Download and analyze papers from arXiv.org

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published