Public version v1

alexandrosstergiou · Sep 18, 2019 · d4ff74a · d4ff74a
1 parent 389b55e
commit d4ff74a
Show file tree

Hide file tree

Showing 5 changed files with 72 additions and 15 deletions.
diff --git a/Class_Feature_Pyramid.png b/Class_Feature_Pyramid.png
diff --git a/README.md b/README.md
@@ -1,65 +1,122 @@
 ## Class Feature Pyramids
 
+![supported versions](https://img.shields.io/badge/python-2.7%2C%203.5-green.svg)
 [![GitHub license](https://img.shields.io/github/license/GKalliatakis/DisplaceNet.svg)](https://github.com/alexandrosstergiou/Class_Feature_Visualization_Pyramid/blob/master/LICENSEhttps://github.com/GKalliatakis/DisplaceNet/blob/master/LICENSE)
+![supported versions](https://img.shields.io/badge/library-PyTorch-blue)
 [![Tweet](https://img.shields.io/twitter/url/http/shields.io.svg?style=social)](https://twitter.com/intent/tweet?text=Class%20Feature%20Pyramids%20for%20Video%20Explanation&url=https://github.com/alexandrosstergiou/Class_Feature_Visualization_Pyramid&hashtags=PyTorch)
 
 
 --------------------------------------------------------------------------------
 ### Introduction
 <p align="justify">We introduce <i>Class Feature Pyramids</i>, a method that traverses an entire network structure and
- incrementally discovers kernels at different network depths that are informative for a specific class. 
- Our method does not depend on the network’s architecture or the type of 3D convolutions, supporting grouped and depth-wise convolu- tions, 
+ incrementally discovers kernels at different network depths that are informative for a specific class.
+ Our method does not depend on the network’s architecture or the type of 3D convolutions, supporting grouped and depth-wise convolutions,
  convolutions in fibers, and convolutions in branches.</p>
 
 <p align="center">
-  <img src="https://github.com/alexandrosstergiou/Class_Feature_Visualization_Pyramid/blob/master/Class_Feature_Pyramid.png?raw=true" width="300" />
+   <img img  src="images/CFVP.gif" width="250em" height="250em" alt="CFVP-active" style="margin-top=2%" ondragstart="return false;">
 </p>
 
 <p align="center">
 <i>1<sup>st</sup> 2019 ICCV Workshop on <br> <a href="http://xai.unist.ac.kr/workshop/2019/" >Interpreting and Explaining Visual Artificial Intelligence Models</a> &nbsp;&nbsp;&nbsp;
 </i>
 <br>
-<a href="" target="_blank">[pdf]</a> 
+<a href="" target="_blank">[pdf]</a>
 </p>
 
 
 ### Dependencies
-* xxx
-* yyy
-* zzz
+Make sure that the following packages are installed in your machine:
+* OpenCV
+* Scipy
+* PyTorch
+Alternatively they cane be dowloaded as:
+
+```
+$ pip install opencv-python scipy torch torchvision
+```
+
+We offer an additional frame-reading method based on a frame SQL database, in the case that the frames are stores in such format (for smaller inode requirements and faster loading times).
+
+
+Models and weights used in the paper, are based on the following repositories:
+
+* [Kensho Hara's original implementation of "Can Spatiotemporal 3D CNNs Retrace the History of 2D CNNs and ImageNet?"](https://github.com/kenshohara/3D-ResNets-PyTorch)
+* [Yunpeng Chen's original implementation of "Multi-Fiber Networks for Video Recognition"](https://github.com/cypw/PyTorch-MFNet)
+* [Yana Hasson's PyTorch implementation of "Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset"](https://github.com/hassony2/kinetics_i3d_pytorch)
 
 ### Installation
-xxxx
-yyyy 
-zzzz
 
+Please make sure, Git is installed in your machine:
+```sh
+$ sudo apt-get update
+$ sudo apt-get install git
+$ git clone https://github.com/alexandrosstergiou/Class_Feature_Visualization_Pyramid.git
+```
 
 
 
 
 ### Getting started
 
+<p align="center">
+<img img class="static"  src="images/Backstep_compressed.png" width="500em" height="150em" alt="CFVP-static" style="margin-top=2%" ondragstart="return false;">
+</p>
+
+For using Class Feature Pyramids ensure that the main file is called with the following parser arguments:
+```
+python main.py
+--num_classes [number of classes]
+--model_weights [The filepath of the .pth weight file]
+--frame_dir [directory of the video/clip]
+--frames_start [starting frame number]
+--frames_end [ending frame number]
+--label [class to backstep for]
+--threshold [numeric threshold for the kernel-based activations]
+--backprop_depth [backwards network depth to backstep to - if 1 only the predictions layer is used]
+--visualisation_method [defines the kernels to be visualised]
+```
+
+The network to be used as well as the number of GPUs are currently manually defined in the code (lines 513 & 517). However, they will be integrated in the parser soon.
+
+Apart from creating a folder containing layer and kernel based saliency tubes, a `JSON` file is also created that contains a full list of all connections across kernels and layers for the specific class and example chosen. This can be used in conjunction with a visualising tool such as [D3](https://github.com/d3/d3).
 
 ---
 
-### Results 
+### Results
+Example results of biking class of HMDB-51 with different networks and network depths:
+
+<p align="center">
+   <img img  src="images/hmdb51_bike.gif" width="250em" height="450em" alt="CFVP-active" style="margin-top=2%" ondragstart="return false;">
+</p>
+
+
 
 
 ### Performance of Class Feature Pyramids
 
+Running times were based on a 2 x Nvidia GTX1080Ti machine with an Intel i7-8700 CPU.
+
+| Network | GFLOPS | Back-step time (msec) | # layers | theta |
+|:-------------:|:---------:|:-----:|:----:|:----:|
+|Multi-FiberNet | 22.70 | 24.43 | 3 | 0.6 |
+|I3D | 55.79 | 23.21 | 1 + mixed5c | 0.65 |
+|ResNet50-3D | 80.32 | 21.39 | 3 | 0.55 |
+|ResNet101-3D| 110.98| 39.48 | 3 | 0.6 |
+|ResNet152-3D | 148.91| 31.06 | 3 | 0.6 |
+|ResNeXt101-3D | 76.96| 70.49 | 3 | 0.6 |
+
 
 ---
 
-### Citing DisplaceNet
+### Citing Class Feature Pyramids
 If you use our code in your research or wish to refer to the baseline results, please use the following BibTeX entry:
 
     @InProceedings{
-    
+
     }
 
 <p align="center">
   :octocat:  <br>
   <i>We use GitHub issues to track public bugs. Report a bug by <https://github.com/alexandrosstergiou/Class_Feature_Visualization_Pyramid/issues">opening a new issue.</a></i><br>
 </p>
-
-
diff --git a/images/Backstep_compressed.png b/images/Backstep_compressed.png
diff --git a/images/CFVP.gif b/images/CFVP.gif
diff --git a/images/hmdb51_bike.gif b/images/hmdb51_bike.gif