You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This project builds a video classification model using CNNs for spatial feature extraction and RNNs for temporal sequence modeling. Utilizing the UCF101 dataset, it covers data preprocessing, feature extraction, model training, and evaluation, providing a comprehensive approach to action recognition in videos.
My experimentation around action recognition in videos. Contains Keras implementation for C3D network based on original paper "Learning Spatiotemporal Features with 3D Convolutional Networks", Tran et al. and it includes video processing pipelines coded using mPyPl package. Model is being benchmarked on popular UCF101 dataset and achieves result…
Video classification exercise using UCF101 data for training an early-fusion and SlowFast architecture model, both using the PyTorch Lightning framework.