Bachelor's Thesis: Skeleton-based Human Action Recognition

This project delves into human action recognition by utilizing skeletal data. The focus was on training and evaluating two prominent graph convolutional networks: ST-GCN and MST-GCN. These networks process skeletal data sequences to extract features and classify human actions effectively. Our workflow started with the extraction of skeletal data from video inputs using MediaPipe. Following data preparation, we focused on training and fine-tuning the models for optimal performance. The project also provides a user-friendly interface allowing users to upload videos and receive both the visualized skeletal data and the predicted actions from our trained models.

Dataset and Model Training

Skeletal data was extracted with the lightweight OpenPose pre-trained model and MediaPipe, offering a robust foundation for action recognition tasks. The GCN-based models were trained and tested on a skeleton dataset compiled from the Kinetics 400 video dataset, which is a large-scale, high-variety benchmark for action recognition algorithms. The skeletal data was extracted using the Light weight OpenPose pre-trained model and MediaPipe. Due to computational limitations, we selectively trained our models on a subset of the Kinetics 400 video dataset, focusing on 38 out of the available 400 classes. These classes represent a balanced mix of actions, providing a comprehensive overview of human activity. Examples include "archery," "high jump," and "playing ukulele," among others. This targeted selection allows us to demonstrate the robustness of the ST-GCN and MST-GCN networks across a diverse range of actions.

Model training implementation

Implementations of models, training process, and extracting pose are available in another repository here.

Results and Contributions

This thesis demonstrates the power of graph convolutional networks in interpreting and analyzing human skeletal movements. The comparative study between ST-GCN and MST-GCN with different hyperparameter settings offers insights into their respective capabilities and applications in real-world scenarios.

For more information on the methodology, experimental setup, and detailed results, please consult the full thesis report (in Persian).

User Interface Demo

Here is a demo of our user interface in action:

Name		Name	Last commit message	Last commit date
Latest commit History 39 Commits
app		app
backend		backend
Bcs-Thesis-HediehPourghasem-report.pdf		Bcs-Thesis-HediehPourghasem-report.pdf
README.md		README.md
interface-demo.gif		interface-demo.gif

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bachelor's Thesis: Skeleton-based Human Action Recognition

Dataset and Model Training

Model training implementation

Results and Contributions

User Interface Demo

About

Releases

Packages

Languages

hedzd/bsc-thesis

Folders and files

Latest commit

History

Repository files navigation

Bachelor's Thesis: Skeleton-based Human Action Recognition

Dataset and Model Training

Model training implementation

Results and Contributions

User Interface Demo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages