research
Create beautiful and semantically meaningful articles with pandoc.
S2ORC: The Semantic Scholar Open Research Corpus: https://www.aclweb.org/anthology/2020.acl-main.447/
Python library that classifies content from scientific papers with the topics of the Computer Science Ontology (CSO).
VOSviewer Online is a tool for network visualization. It is a web-based version of VOSviewer, a popular tool for constructing and visualizing bibliometric networks.
Rewrite of Citation Gecko as a React app
An open-source NLP research library, built on PyTorch.
A machine learning software for extracting information from scholarly documents
Online demo without installing at - https://buildit.so/tryit
Flask app for article abstract and listing pages
Simple node.js client for GROBID REST services
GROBID extension for identifying and normalizing physical quantities.
🗃 Open source self-hosted web archiving. Takes URLs/browser history/bookmarks/Pocket/Pinboard/etc., saves HTML, JS, PDFs, media, and more...
A proof of concept to scrape papers from journals
pix2tex: Using a ViT to convert images of equations into LaTeX code.
Community maintained fork of pdfminer - we fathom PDF
Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.
docTR (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.
DocumentGPT is a web application that allows you to chat over your research document using OpenAI's chat API and perform semantic search using vector databases. This tool provides a seamless interf…
A toolkit for automatically extracting semantic information from PDF files of scientific articles