BLoad: Enhancing Neural Network Training with Efficient Sequential Data Handling

Welcome to the official repo for BLoad: Enhancing Neural Network Training with Efficient Sequential Data Handling. We provide an example of using our strategy on top of the Action Genome: Actions as Composition of Spatio-temporal Scene Graphs.

Overview

The repo has three main components:

AG.py: (Action Genome) Dataset Class: A foundational class for loading and preprocessing image data.
AG_BLoad.py: An extension of the AG dataset class, adopting the BLoad strategy.
test_loader.py: A utility script for demonstrating dataset loading and iterating through the data in batches.

Installation

Before using the toolkit, ensure you have the following dependencies installed:

PyTorch
Pillow
NumPy
tqdm

You can install these packages using pip.

pip install torch pillow numpy tqdm

AG (Action Genome) Dataset Class

The AG class is the core dataset class for loading and preprocessing data. It supports reading image frames and corresponding annotations, applying transformations, and preparing the dataset for model training or evaluation.

AG_BLoad

AG_BLoad implements the method described in the white paper. It uses a dictionary containing the number of frames as keys and the video name as a list of values.

Features

Block Generation: Organizes videos into blocks based on frame counts for efficient loading.
Dynamic Randomization: Ensures diversity in training and evaluation by changing block composition each epoch.
Batch Processing: Custom collate function to group data into batches, handling padding and resets as necessary.

Usage

This method can be applied to several datasets which contains different length inputs, be it videos, audios, etc.

Running the Test Loader

Execute the script with the necessary arguments.

python test_loader.py --AG_path "path/to/Action Genome/dataset" --max_size_defined 800

For the Action Genome dataset, it is expected to be in the following structure:

root/
├── frames/
│   ├── video1/
│   ├── video2/
│   ├── video3/
└── annotations/
    ├── AG_HOIA_train_sgdet.pkl
    ...

This will load the dataset, create a DataLoader, and iterate through the dataset in batches, printing progress with tqdm.

Contributing

Contributions to improve the toolkit are welcome. Please follow standard GitHub practices for submitting pull requests.

Citation

If you use this strategy on your project, please cite our work:

@misc{ruschel2023bload,
      title={BLoad: Enhancing Neural Network Training with Efficient Sequential Data Handling}, 
      author={Raphael Ruschel and A. S. M. Iftekhar and B. S. Manjunath and Suya You},
      year={2023},
      eprint={2310.10879},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
util		util
.gitignore		.gitignore
AG.py		AG.py
AG_BLoad.py		AG_BLoad.py
readme.md		readme.md
test_loader.py		test_loader.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

util

util

.gitignore

.gitignore

AG.py

AG.py

AG_BLoad.py

AG_BLoad.py

readme.md

readme.md

test_loader.py

test_loader.py

Repository files navigation

BLoad: Enhancing Neural Network Training with Efficient Sequential Data Handling

Overview

Installation

AG (Action Genome) Dataset Class

AG_BLoad

Features

Usage

Running the Test Loader

Contributing

Citation

About

Releases

Packages

Languages

RRuschel/BLoad

Folders and files

Latest commit

History

Repository files navigation

BLoad: Enhancing Neural Network Training with Efficient Sequential Data Handling

Overview

Installation

AG (Action Genome) Dataset Class

AG_BLoad

Features

Usage

Running the Test Loader

Contributing

Citation

About

Resources

Stars

Watchers

Forks

Languages