Skip to content

Delemangi/accreditations-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Accreditations Scraper

Script for scraping the accreditations for FINKI (FCSE).

Installation

If you have Poetry, you can use poetry install, otherwise, pip install -r requirements.txt.

Running

[poetry run] python -m app [-f ...] [-c ...] [-l ...] [-y ...] [-e ...]

Arguments:

  • -c or --cycle: select the cycle of the courses you want to scrape (1, 2, 3, default is all)
  • -y or --year: select the accreditation year of the courses you want to scrape (2023, 2018, 2013, default is all)
  • -l or --limit: select the number of courses you wish to scrape (default is 0 to scrape all)
  • -e or --export: select the .csv file with the results to clean up (this interrupts the normal flow of the script, by only cleaning up an existing file, and not scraping)
  • -f or --file: set the name of the file that will be saved after scraping (default: output.csv)

The script also looks for a file called cookies.json in the root of the project. It should contain the cookies for the session of the site. Here is an example Chrome extension capable of dumping all cookies used by a page. If you do end up using the given Chrome extension, keep in that mind you should also:

  • Change the domain of all cookies to iknow.ukim.mk
  • Set the sameSite property of all cookies to "None"

if you want the scraper to work.

About

Scraper for FINKI (FCSE) accreditations

Resources

License

Stars

Watchers

Forks

Languages