Skip to content

Python module for interacting with data from the OOI CamHD seafloor camera system

License

Notifications You must be signed in to change notification settings

tjcrone/pycamhd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PyCamHD

This repository contains a Python module for interacting with data from the OOI CamHD seafloor camera system stored in the raw data archive. It can be used to obtain information about remote CamHD files or retrieve individual frames from these files without downloading them entirely. This code can also work with files on the local filesystem.

Requirements

This module currently only works with Python 3.x. It also requires Numpy, Requests, and PyAV>=0.4.0.

Installation

$ pip install pycamhd

Basic Usage

Get a single frame from a remote CamHD file as a numpy array:

>>> import pycamhd as camhd
>>> import numpy as np
>>> filename = 'https://rawdata.oceanobservatories.org/files/RS03ASHS/PN03B/06-CAMHDA301/2016/11/13/CAMHDA301-20161113T000000Z.mov'
>>> moov_atom = camhd.get_moov_atom(filename)
>>> frame_count = camhd.get_frame_count(filename, moov_atom)
>>> print(frame_count)
>>> frame_number = 7200
>>> frame = camhd.get_frame(filename, frame_number, 'rgb24')

Note: Obtaining the moov_atom first and passing it to any function is optional, but doing so will greatly speed up repeated calls to most functions for the same file. When multiple frames are to be obtained from the same file, getting the moov_atom first is recommended.

Get information about the remote archive:

>>> (file_count, total_size) = camhd.get_stats()
>>> print(file_count)
>>> print(total_size)
>>> file_list = camhd.get_file_list()
>>> for filename in file_list:
...   print(filename)

Note: Getting information about the repository can take several minutes, depending on server response times, because every index file must be downloaded

Function Reference

Retrieve a Single Frame from File

pycamhd.get_frame(filename, frame_number[, pix_fmt [, moov_atom]])
Retrieve a single frame from a remote or local file. pix_fmt should be one of the following: 'rgb24', 'bgr24', 'rgb48le', 'rgb48be', 'bgr48le', 'bgr48be', 'gray', 'gray16le', and 'gray16be'. The default pix_fmt is 'rgb24'. moov_atom should be a string containing raw packed binary data as returned by get_moov_atom(). Returns a numpy array with a shape and datatype appropriate to the frame size and pix_fmt.

Get Archive Stats

pycamhd.get_stats()
Return the total number of MOV files and the total size of the MOV files (in TB) in the data archive. Returns an integer and a float.
pycamhd.get_file_list()
Return a list of all MOV files in the data archive as fully-qualified URLs. Returns a list of strings.

Get File Information

pycamhd.get_atom_sizes(filename)
Return the sizes of the three top-level atoms in a remote file. Returns three integers.
pycamhd.get_chunk_count(filename[, moov_atom])
Return the number of video chunks in a remote file. moov_atom should be a string containing raw packed binary data as returned by get_moov_atom(). Returns an integer.
pycamhd.get_chunk_offsets(filename[, moov_atom])
Return the offsets of all chunks in a remote file. Returns a list of integers.
pycamhd.get_frame_count(filename[, moov_atom])
Return the number of frames in a remote file. Returns an integer.
pycamhd.get_frame_sizes(filename[, moov_atom])
Return the sizes of all frames in a remote file. Returns a list of integers.
pycamhd.get_frame_offsets(filename[, moov_atom])
Return the offsets of all frames in a remote file. Returns a list of integers.

Get File Components

pycamhd.get_moov_atom(filename)
Retrieve the moov atom from a remote file. Returns a string containing raw packed binary data.
pycamhd.get_frame_data(filename, frame_number[, moov_atom])
Retrieve the raw ProRes encoded frame data from a frame in a remote file. Returns a string containing raw packed binary data.

Decode Frame Data

pycamhd.decode_frame_data(frame_data, pix_fmt)
Decode ProRes frame data into image. pix_fmt should be one of the following: 'rgb24', 'bgr24', 'rgb48le', 'rgb48be', 'bgr48le', 'bgr48be', 'gray', 'gray16le', and 'gray16be'. The default pix_fmt is 'rgb24'. Returns a numpy array with a shape and datatype appropriate to the frame size and pix_fmt.

Low-level Functions

pycamhd.get_bytes(filename, byte_range)
Retrieve a subset of bytes from a remote file. filename should be a fully qualified URL specifiying a remote CamHD Quicktime MOV file. byte_range should be a two-element list. Returns a string containing raw packed binary data.
pycamhd.get_integer(filename, byte_range)
Return a 32-bit or 64-bit big-endian integer from a remote file. byte_range should be a two-element list specifying a 4-byte or 8-byte range.

Misc

pycamhd.__version__
Print the current version number of the module.

License

MIT License Copyright (c) 2016 Timothy Crone

Contributors

Timothy Crone (tjcrone@gmail.com) Friedrich Knuth (friedrich.knuth@gmail.com)

About

Python module for interacting with data from the OOI CamHD seafloor camera system

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages