Get the best courses
📚 powered by openedx offline ⬇️
An offliner to create ZIM
Openedx is one of the most popular open source MOOC platforms which revolves around the idea of xblocks. It makes e-learning more accessible by providing an easy way to create courses for teachers, universities and others. It is used by many e-learning services as such as edX as a tool to create, organize and manage MOOCs quite easily.
This project is aimed at creating a tool to make openedx based MOOCs more accessible by creating ZIM files providing the same course materials and resources offline.
Install the dependencies
Make sure that you have
curl installed on your system before running the scraper (otherwise you'll get a warning to install them).
Enroll into the MOOC
You must be enrolled into the mooc you want to offline. Ensure that you do not open the openedx instance in the browser with the same account while the scraper runs. Also, this scraper must be used only with a MOOC with a free license.
Setup the package
One can eaisly install the PyPI version but let's setup the source version. Firstly, clone this repository and install the package as given below.
pip3 install -r requirements.txt
python3 setup.py install
That's it. You can now run
openedx2zim from your terminal
openedx2zim --course-url [URL] --email [EMAIL] --name [NAME]
For the full list of arguments, see this file or run the following
openedx2zim --course-url="https://courses.edx.org/courses/course-v1:edX+edx201+1T2020/course/" --publisher="edx201" --email="email@example.com" --name="sample" --tmp-dir="output" --output="output" --debug --keep --format="mp4"
You can create ZIMs for MOOCs powered by the openedx platform (find a list of openedx powered instances here), choose between different video formats (webm/mp4), different compression rates, and even use an S3 based cache.
The answers can be extracted only for "multiple choice question" type problems with single answer correct and multiple answer correct (only if the number of options in that case is at most 5). This is due to large number of requests required to extract answers for other types of answers. For more information, refer here.