grobid-parser

A tool that can process scholarly literature into structured dataset.

Setting

You need to prepare a Python environment in your computer. My execution environment python 3.10.15.
Install required Python packages via pip install -r requirements.txt. For your convinence, I recommend you install package under conda virtual environment.
We can use demo site to parse pdf file, so there is no need to start GROBID server in background.

You need to prepare a academic paper in PDF format, and upload it through web interface.
You need to setup following options in .env file.

GROBID_URL=<your-grobid-server-url>
STATIC_FOLDER=<path-to-STATIC-directory>

python app.py

Open a web browser and navigate to http://localhost:5000
Upload a PDF file through the web interface
The application will process the PDF and provide:
- Success/failure notification
- A portion of the content in the parsed results
- Option to download the parsed results as CSV

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
model		model
templates		templates
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt
vercel.json		vercel.json