iliasSpider

iliasSpider is a web scraper which downloads your materials from an ilias course (@Uni Constance) written in python.

Functionality

In the current state, the scraper just tries to copy to ilias file system to the computer, creating necessary folders on the way. Everything the spider does not know is ignored.

Features over original fork

Correct file extensions
Only files which do not exist locally are downloaded
Exclusion based on file formats in the config. By default .mp4 files are excluded
Checks if a file should be downloaded are done before downloading:)

Approach

I am using scrapy (docs) to login and download the files.

Scrapy is an application framework for crawling web sites and extracting structured data which can be used for a wide range of useful applications, like data mining, information processing or historical archival.

For those who are not familiar with the scrapy folder structure: The spider can be found here: ilias_spider/spiders/ilias.py.

Get things going

Setup

Install (if not satisfied): pip https://pip.pypa.io/en/stable/

FOR LINUX:
- Python 2.x or 3.x https://www.python.org/downloads/
- the following python reqs. via pip: $ pip install -r requirements.txt
FOR WINDOWS:
- Python 2.7 (since Python 3 is not supported on Windows with Scrapy)
- follow the instructions to set up scrapy & restart
- $ pip install keyring

Configure Spiders

An example configuration is given in iliasSpiders.py:

### Spider 1
c = Config(
    'vorname.nachname', 
    'entry url', 
    '/path/to/download/folder/', 
    )
runSpider(c)

More spiders can be configured by copying this code snippet.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
ilias_spider		ilias_spider
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
config.py		config.py
iliasSpiders.py		iliasSpiders.py
privacy.py		privacy.py
requirements.txt		requirements.txt
scrapy.cfg		scrapy.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

iliasSpider

Functionality

Features over original fork

Approach

Get things going

Setup

Configure Spiders

About

Releases

Packages

Languages

License

mawenzy/iliasSpider

Folders and files

Latest commit

History

Repository files navigation

iliasSpider

Functionality

Features over original fork

Approach

Get things going

Setup

Configure Spiders

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages