ProViM (Basic Processing for Visualization and Multivariate Analysis of MSI Data) is a fast and easy to use preprocessing pipeline for Mass Spectrometry Imaging (MSI) data. The pipeline consists of multiple modules which can be chained together.
- Start Docker, navigate into the provim directory and call
docker build -t provim/pipeline .
to build the docker container. - Run the docker container with
docker run -v path-to-data:/data --rm provim/pipeline python script.py
. Usingscript.py -h
will provide all necessary information about parameters and functionality of the script.
The general workflow, with necessary and optional scipts indicated is as follows:
workflow_pybasis.py
(required)fast_convert_hdf.py
(required)matrix_preprocessing.py
(optional)
3.1.interactive_matrix_detection.py
/automated_matrix_detection.py
(optional, the interactive variant is highly recommended!)
3.2.matrix_postprocessing.py
(optional)workflow_peakpicking.py
(optional)msi_image_writer.py
(optional) While only step one and two are required and the remaining steps can be chained to specific needs, it is recommended to not interchange the order of the scripts and only skip single steps on demand.
The following scripts can also be used stand alone and therefore are easily to combine with other processing pipelines:
a. workflow_peakpicking.py
b. msi_image_writer.py
c. msi_dimension_reducer.py
If the existing file type is imzML instead of HDF5 we refer to https://github.com/Kawue/imzML-to-HDF5.
- Install Anaconda.
- Install the environment with the
environment.yml
by callingconda env create -f environment.yml
. - Activate the environment with
conda activate provim
and follow the General Workflow. - Call the scripts manually.
- Visit https://docs.docker.com/toolbox/toolbox_install_windows/. Follow the given instructions and install all required software to install the Docker Toolbox on Windows.
- Control if virtualization is enabled on your system. Task Manager -> Performance tab -> CPU -> Virtualization. If it is enabled continue with Step X.
- If virtualization is disabled, it needs to be enabled in your BIOS. Navigate into your systems BIOS and look for Virtualization Technology (VT). Enable VT, save settings and reboot. This option is most likely part of the Advanced or Security tab. This step can deviate based on your Windows and Mainboard Manufacturer.
- Open your CMD as administrator and call
docker-machine create default --virtualbox-no-vtx-check
. A restart may be required. - In your OracleVM VirtualBox selected the appropriate machine (probably the one labeled "default") -> Settings -> Network -> Adapter 1 -> Advanced -> Port Forwarding. Click on "+" to add a new rule and set Host Port to 8080 and Guest Port to 8080. Be sure to leave Host IP and Guest IP empty. Also, add another rule for the Port 5000 in the same way. A restart of your VirtualBox may be needed.
- Now you should be ready to use the Docker QuickStart Shell to call the Docker commands provided to start this tool.
Remark: Volume mount paths in the "Docker Quickstart Terminal" need to be in Linux style, i.e. C:\Users becomes /c/Users (volume letter becomes lowercase!).
For an early exploration of the HDF converted data sets we suggest du use VAIDRA (https://github.com/Kawue/vaidra) a visual analysis tool to explore data sets using dimension reduction methods.