pdf-parser

Machine-readable grade distributions are Good Bull. If you wind up using this data or code in some form, credit would be appreciated. Just add your name to using.md.

What is it?

This is a simple script designed to:

Download TAMU grade distribution PDFs from previous years (they have removed PDFs from before 2014) (using requests)
Extract the data in those PDFs (using PyPDF2)
Convert that data into CSV format and save it for use in your ML/stats project, scheduling app, or whatever you might need it for.

How do I use it?

If you'd like to use the data, there will be a ZIP file published as a release automatically every month that contains all of the scraped CSV data.

If you want the PDFs or CSVS individually, just run (with Python 3 installed)

Create a new virtual environment (I use python3 -m venv env on Ubuntu 18.04) and activate it (source env/bin/activate)
Install the dependencies (pip install -r requirements.txt)
python main.py

Why'd you make it?

Texas A&M University likes to publish their grade distributions publicly for record-keeping. The university does not provide access to machine-readable versions of these files without a department head signature and special permission. I don't want to get either of those things.

How do I help?

Pull requests of all kinds are welcome! Some issues I'm trying to tackle:

Automating releases to be every 3 months or so using GitHub Actions.
Refactoring/cleaning up code
Including data from before 2014 (Marcus Salinas has 2012-2014, but I can't find anything from before 2012).
Refining data collection
- Using instructor's real names, rather than "LAST, F."

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.github/workflows		.github/workflows
.idea		.idea
documents/pdfs		documents/pdfs
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
using.md		using.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pdf-parser

What is it?

How do I use it?

Why'd you make it?

How do I help?

About

Releases 13

Packages

Contributors 2

Languages

License

SaltyQuetzals/TAMU-Grade-Distribution-CSVs

Folders and files

Latest commit

History

Repository files navigation

pdf-parser

What is it?

How do I use it?

Why'd you make it?

How do I help?

About

Resources

License

Stars

Watchers

Forks

Releases 13

Packages 0

Contributors 2

Languages

Packages