Skip to content
Tools for working with (open) OU-XML docs
Python XSLT Other
Branch: master
Clone or download

Latest commit

Fetching latest commit…
Cannot retrieve the latest commit at this time.

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.jupyter/custom
binder
ouxml
utils
.gitignore
LICENSE
MANIFEST.in
README.md
setup.py

README.md

open-ouxml-tools

Tools for working with (open) OU-XML docs.

Binder

This package provides a range of command line and API tools for:

  • grabbing OU-XML "source" versions of OpenLearn units, along with related image assets, and storing them in a SQLite3 database;
  • converting OU-XML "source" documents to markdown files.

The markdown files can then be edited as simple text documents and used as part of markdown web publishing workflows.

The Binderised version of this repo allows you to test the package and generate markdown versions of OpenLearn units from their source OU-XML. The Binderised repo also installs Jupytext, which allows the markdown files to be edited in a Jupyter notebook interface. (Files cannot be saved directly back to Github from a MyBinder environment; they need to be exported and then uploaded to a Github repo, or example. The nbarchive extension is also installed in the Binderised environment to make it easier to export generated markdown files, etc.)

Quick example:

Run in MyBinder, open this README.md from the notebook homepage and it will open in a notebook UI.

Run the code cells (or from the notebook cell menu, select Run All).

# List units by search keywords
! ouxml_units --term "history scottish"

Having listed units of interest to you, you can grab scrape the OU-XML and image content from a selected URL with the following command (note that by default, a clean copy of the database is created each time you run the followig command; I still need to tweak the code to cleanly extract units from the db containing assets associated with multiple units):

# Grab XML for an OpenLearn unit
# For some reason, this may take ages:-(
! ouxml_grab https://www.open.edu/openlearn/science-maths-technology/chemistry/the-molecular-world/content-section-1.1

Once you have downloaded the assets, you can convert the XML to markdown files in a specified output directory (it will be automatically created if it does not already exist):

# Generate markdown from OU-XML
! ouxml2md --dbname openlearn_oer.db --outdir demo

In the above example, markdown files and images for the unit will appear in the demo directory.

If you run this in MyBinder, from the notebook homepage, you can navigate to the folder the generated markdown was placed in. click on a markdown file link, and through the magic of Jupytext, edit it in a notebook UI.

OU staff may wonder whether the same approach can be used to convert OU-XML for current OU modules to markdown too. Yes it can... Get in touch...

You can’t perform that action at this time.