Skip to content

jaekyu-sim/Convolutional-Recurrent-Pose-Machine

Repository files navigation

Convolutional-Recurrent-Pose-Machine

This project is for detecting human body in any situation. I planned this project because CPM(Convolutional Pose Machine) can't detect human body when human is behind abstacle.

abstacle image2 abstacle image3

These above images are human images who is dancing with white rectangular abstacle.(opencv rectangular)

When this image is put in CPM Network, the output is not correct because CPM only can detect human body on one image.

So, I suggest network that has recurrent shape.

paper is here!

network

I applies recurrent network to CPM to detect skeleton position behind abstacle.

The output is here.

output

I put a video to test the performance of my network comparing to CPM.

Input video is consisted with 30 frame size. 1st~13rd frames are not applied abstacle, and 14th~30th frames are applied abstacle.

I'll compare performance by 15th frame images(abstacle image).

The (a) is images which doesn't apply any abstacle. I put (a) to CPM, and I got (b). The output(b) is very clear.

The (c) is images which apply abstacle to head. I put (c) to CPM, and I got (d).

The output(d) is not clear comparing to (b). There is ambigous head joint. But, the output(e) is clear comparing to (d) and it is almost same comparing to (b). Because, CPM can detect skeleton data in one images, not determining previous image. If the network determines previous image, network can detect human body skeleton data.


file description

This file is for inspecting the training data. If youtube data is put in CPM to make annotation data, there are some unnormal case, So I make Data_Inspection.ipynb to inspect dataset.

  1. Data_Inspection.ipynb

Firstly, I made CMU network which is handling image input. and I made several demo files to detect when putting image and video.

  1. Demo_CMU_Network.ipynb
  2. Demo_CMU_Network_CAM&VIDEO.ipynb
  3. Training_CMU.ipynb
  4. Training_CMU_30.ipynb

Secondly, I made Recurrent network which is handlign video input, and I made several demo files to detect when putting video.

  1. Demo_RNN.ipynb
  2. Training_rnn_network.ipynb
  3. Training_rnn_network_3d.ipynb

Thirdly, I made annotation data in Youtube data and MPII dataset.

  1. Make_Annotation_From_Image.ipynb
  2. Make_Annotation_From_Video.ipynb
  3. make_video_60_frame.ipynb

About

Pose machine with recurrent network model.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published