IRC bot for fetching papers/pdfs on IRC using phenny
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
modules
paperbot
phenny @ 7752b56
tests
translation-server @ 4d35648
.gitignore
.gitmodules
README.md
dev-requirements.txt
requirements.txt

README.md

paperbot

Paperbot is an IRC bot that fetches academic papers. It monitors all conversation for links to scholarly content, then fetches the content and posts a public link. This seems to help enhance the quality of discussion and make us less ignorant. When a link fails to lead to a pdf with the zotero translators, paperbot will not attempt further downloads of the paper unless paperbot was specifically spoken to.

## deets

All content is scraped using zotero/translators. These are javascript scrapers that work on a large number of academic publisher sites and are actively maintained. Paperbot offloads links to zotero/translation-server, which runs the zotero scrapers headlessly in a gecko and xulrunner environment. The scrapers return metadata and a link to the pdf. Then paperbot fetches that particular pdf. Sometimes in IRC someone drops a link straight to a pdf, which paperbot is also happy to compulsively archive.

Environment variables:

  • SCIHUB_PASSWORD - must be set to SciHub cookie
  • LOGGING - optional: bot spams logs to a channel defined as "#" plus the value of LOGGING
## TODO

It would be nice to use multiple proxies to resolve a pdf request.

## active demo

say hi to paperbot on irc.freenode.net ##hplusroadmap

## license

BSD.