fishtoolbox

Contains a set of modules to analyze and visualize data from block1 and block2 of the experiment from 2021 September.

Dependencies

This repository is based on python and therefore requires conda and python-pip for installations. The following repositories are project-dependencies that have to been build inside the underlying environment:

Data Flow

Using the HPC cluster

Installation

Environment installations using Conda, including the python environment and c++ dependencies
```
conda env create -n toolbox --file environment.yml
conda activate toolbox
```

Python package installations using python-pip

# conda environment should be activated
python -m venv .venv # python virtual environment creation
source .venv/bin/activate
pip install -r requirements.txt

Fishproviz project-dependencies installation

# working directory should be equal to <path/to/fishtoolbox>
# conda environment should be activated
# python venv should be activated
cd ..
git clone git@github.com:lukastaerk/Fish-Tracking-Visualization.git
cd Fish-Tracking-Visualization
python setup.py install

Motionmapper project-dependencies installation

# working directory should be equal to <path/to/fishtoolbox>
# conda environment should be activated
# python venv should be activated
cd ..
git clone git@github.com:lukastaerk/motionmapperpy.git
cd motionmapperpy
python setup.py install

HPC Usage

Start on the GPU sbatch scripts/hpc-python.sh
NOTEBOOK

conda activate rapids-22.04
Type ifconfig and get the inet entry for eth0, i.e. the IP address of the node
srun --pty --partition=ex_scioi_gpu --gres=gpu:1 --time=0-02:00 bash -i to start a new shell with a GPU
ssh -L localhost:5000:localhost:5000 user.name@[IP address] on your local machine
jupyter-lab --no-browser --port=5000

Start

set the BLOCK variable to BLOCK1 or BLOCK2 in config.py
set the projectPath variable to the path of a new folder in config.py this is where the data will be stored
setup fishprovis with the correct paths and area configurations.
export the preprocessed data with python3 -m data_factory.processing
repeat for the other block

Program Parts

Parameters

parameters = set_parameters() to get the parameters that are used throughout the fishtoolbox

Data Factory

Processing

load_trajectory_data_concat load the x y coordinates, projections (the three features), time index, area
load_zVals_concat load the umap data
load_clusters_concat load cluster labels for individuals and day paramerter.kmeans = 5 to specify the clustering that you want to load.

Plasticity

There are three ways in this module to compute plasticity.

compute_cluster_entropy computes the cluster entropy for each individual and day. Using the watershed regions or kmeans clusters, by providing the function to load the corresponding clusters.
compute_coefficient_of_variation computes the coefficient of variation for each individual and day.

Table Export

Records function to export averaged step length to a csv file and melted them into a long format table for statistical analysis (Repeatability).

Repeatability

From means of features (step, angle, wall distance), e.g. batches of 60 data frames. Produce a long table, recording block number, id.

Sampling

The research question is how many samples are needed to get a good estimate of the repeatability. Provided a table with means of a feature (step length) over a number of consecutive data frames, we can sample from this table a number of minutes for a number of days. Further we look at the effect when sampling the time of the day only once for all days versus sampling the time of the day for each day.

Poltting

Caterpillar Plots

ethnogram_of_clusters

TODOs:

check the new area files, see if there are significant updates for any of them, what is the difference, do we need an refined get_area_function(fishkey,day) ?

Name		Name	Last commit message	Last commit date
Latest commit History 96 Commits
.env_config		.env_config
clustering		clustering
data		data
data_factory		data_factory
docs		docs
notebooks		notebooks
repeatability		repeatability
scripts		scripts
.gitignore		.gitignore
.python-version		.python-version
CITATION.cff		CITATION.cff
ENV.yml		ENV.yml
README.md		README.md
agenda.md		agenda.md
config.py		config.py
environment.yml		environment.yml
hpc_berman.py		hpc_berman.py
hpc_clustering.py		hpc_clustering.py
quick-routines.txt		quick-routines.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fishtoolbox

Dependencies

Data Flow

Using the HPC cluster

Installation

HPC Usage

Start

Program Parts

Parameters

Data Factory

Processing

Plasticity

Table Export

Repeatability

Sampling

Poltting

Caterpillar Plots

TODOs:

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

UlrikeScherer/fishtoolbox

Folders and files

Latest commit

History

Repository files navigation

fishtoolbox

Dependencies

Data Flow

Using the HPC cluster

Installation

HPC Usage

Start

Program Parts

Parameters

Data Factory

Processing

Plasticity

Table Export

Repeatability

Sampling

Poltting

Caterpillar Plots

TODOs:

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages