A software library for segmenting and measuring of sedimentary particles in images. The segmentation is done with the Cellpose algorithm, designed for cell and nucleus segmentation in biomedical images. Its segmentation capability is transferred to geoscientific applications throurgh re-training of the model with images of sediment particles. Pretrained segmentation models from our data are available or custom models can be trained (see paper for details).
If you use software and/or data from here in your research, please cite the following works:
-
Mair, D., Witz, G., Do Prado, A.H., Garefalakis, P. & Schlunegger, F. (2023) Automated detecting, segmenting and measuring of grains in images of fluvial sediments: The potential for large and precise data from specialist deep learning models and transfer learning. Earth Surface Processes and Landforms, 1–18. https://doi.org/10.1002/esp.5755.
-
Stringer, C.A., Pachitariu, M., (2021). Cellpose: a generalist algorithm for cellular segmentation. Nat Methods 18, 100–106. https://doi.org/10.1038/s41592-020-01018-x.
If you use the human-in-the-loop approach for training custom models (see below), please also cite:
- Pachitariu, M. & Stringer, C. (2022): Cellpose 2.0: how to train your own model. Nat. Methods 19, 1634–1641. https://doi.org/10.1038/s41592-022-01663-4.
If you use ImageGrains to calculate percentile uncertainties please also cite:
- Mair, D., Henrique, A., Prado, D., Garefalakis, P., Lechmann, A., Whittaker, A., and Schlunegger, F. (2022): Grain size of fluvial gravel bars from close-range UAV imagery-uncertainty in segmentation-based data, Earth Surf. Dyn., 10,953-973. https://doi.org/10.5194/esurf-10-953-2022.
The easiest way to install the software is by using the conda package manager. If you do not have conda installed, please follow the instructions on the conda website. If you encounter problems during installation, have a look here. If these methods do not solve them, please open an issue.
To install the software, open an anaconda prompt / command prompt, then create a new environment with:
conda create --name imagegrains -c conda-forge python=3.9 imagecodecs
and activate it with:
conda activate imagegrains
Then install the package using
pip install imagegrains
If you want access to the cellpose GUI for retraining use:
python -m pip install cellpose[gui]
This installs by default the cellpose package (python -m pip install cellpose[gui]
).
By default, cellpose will run on the CPU. To use a GPU version on Windows or Linux, you will have to make sure you have a GPU compatible PyTorch version. For this:
-
Uninstall the PyTorch version that gets installed by default with Cellpose:
pip uninstall torch
-
Make sure your have up-to-date drivers for your NVIDIA card installed.
-
Re-install a GPU version of PyTorch via conda using a command that you can find here (this takes care of the cuda toolkit, cudnn etc. so no need to install manually anything more than the driver). The command will look like this:
conda install pytorch torchvision cudatoolkit=11.3 -c pytorch
If you work on a Mac you can try to install the Cellpose package manually with experimental M1 support. Details and more installation options of cellpose (including GPU versions for Windows and Linux) are also found here.
ImageGrains is organised in 3 main modules for Segmentation, Grain size measurements and Grain size distribution (GSD) analysis (see below). The most basic option is to run ImageGrains from the console by first activating the enviromnent:
conda activate imagegrains
Then download the pretrained models and demo data by executing:
python -m imagegrains --download_data True
To start an analysis from the console run:
python -m imagegrains --img_dir F:/REPLACE_WITH_PATH_TO_FOLDER_OF_IMAGES_(JPEG)
This will run the main application with the default settings on images in the provided location. You can use --help
to see all input options. Alternatively, you can run the jupyter notebooks in /notebooks
in their order. They offer more options and information for most workflow steps. Of course, any modules and functions can be combined in custom scripts or notebooks.
If you want to segment own images with pre-trained models, simply use the corresponding jupyter notebook notebooks/1_image_segmentation.ipynb
. To do so locally, open the console and activate the environment (conda activate imagegrains
) and start your jupyter instance (e.g., viajupyter lab
). Then, open the notebook and follow the instructions. You can use any model provied in /models
or train a custom model (see below).
To measure grain sizes, use the jupyter notebook notebooks/2_grain_sizes.ipynb
. It will load the segmented images and calculate the grain size for each grain on an image-by-image basis. Several options for outline fitting are available. The grain size is then scaled with the image resolution and stored in an output file. It is also possible to export individual grain outlines for further analysis.
To analyze the GSD, use the jupyter notebook notebooks/3_gsd_analysis.ipynb
. It will load the grain size measurements and calculate the GSD. Several for the uncertainty estimation are available. The uncertainty by default is calculated for each perecentile as 95% confidence interval. The GSD is then stored in an output file.
If you want to train your own models, you can use the jupyter notebook notebooks/4_model_training.ipynb
, you can use the Cellpose GUI (start it with python -m cellpose
) or train via console with the full funcitionality of Cellpose. To train custom models, you will first need manually annotated ground truth data ("labels"). This can be done either with the Cellpose GUI or with any dedicated annotation tool. We used the labkit plugin for ImageJ. Please note, that each grain has to have a unique class value.
- If you have problems with the pip installation, you can also install the package directly from the repository. If you have trouble building from the repository, make sure you have
git
installed and to have it in your path (this can be tricky on Windows). To install from the repository, use:
pip install git+https://github.com/dmair1989/imagegrains.git
- If you still have trouble, you can install the dependencies manually and use the jupyter notebooks in
/notebooks
to run the software and locally import the python files (i.e., by making sure the notebooks and files fromsrc/imagegrains/
are in the same folder and by changing any import statementfrom imagegrains import
toimport
). The dependencies are:
cellpose
matplotlib
scikit-image
pandas
scanpy
jupyter lab
- If you run into problems with OpenMP and libiomp5, you can try to create the environment using the
nomkl
package (more details here) before installing imagegrains with:
conda create --name imagegrains -c conda-forge python=3.8 imagecodecs nomkl