Skip to content

This is a group project for the course CS3501 - Data Science & Engineering Project. This project will focus on detecting human actions under challenging lighting conditions

Notifications You must be signed in to change notification settings

jasminaaa20/Human-Action-Recognition-in-the-dark

Repository files navigation

Action Recognition in the Dark using ARID Dataset

The Action Recognition in the Dark (ARID) dataset focuses on human action recognition in challenging lighting conditions, such as low-light and darkness. This project employs ARID v1.5 to train a model to detect human actions under these lighting conditions.

Dataset Overview

The ARID dataset consists of video clips featuring 11 distinct human actions performed in various scenes, both indoor and outdoor, under varying lighting conditions. More details about the ARID v1.5 dataset can be found here.

Preprocessing Steps

1. Frame Extraction from Videos

From each video, 20 frames are extracted to form a list of frames per video, serving as the primary dataset elements.

3. Histogram Equalization

To improve visibility, reduce noise, and standardize frames, histogram equalization is applied to each frame.

4. Key-point Detection using YOLOv8-pose

Key-point information is extracted from each frame using the YOLOv8-pose key-point detection model.

Model Architecture

The model architecture utilizes a Long-term Recurrent Convolutional Networks (LRCN) approach. The detailed code for the architecture can be found above.

Model Compilation and Training

The model is compiled using categorical cross-entropy loss, optimized with Adam optimizer. The training history is saved for plotting and evaluation purposes.

Model Evaluation

The model is evaluated using the test dataset, and a confusion matrix is generated to analyze its performance:

Results

The loss and accuracy curves of the model during training can be seen below:

Loss Curve

Loss Curve

Accuracy Curve

Accuracy Curve


Conclusion

This project has successfully built a model to detect and classify human actions in challenging lighting conditions using the ARID v1.5 dataset. The LRCN approach combined with strategic preprocessing steps has allowed for improved accuracy and performance.

About

This is a group project for the course CS3501 - Data Science & Engineering Project. This project will focus on detecting human actions under challenging lighting conditions

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Languages