Returns text lines and their bounding boxes from PDF as JSON. Needs python 3 and mupdf, convenient to run under nix-shell -p python3 -p mupdf.