-
Notifications
You must be signed in to change notification settings - Fork 1.2k
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
[Feature] UCF101-24 preparation (#219)
* init docs * update * update changelog * Update changelog.md Co-authored-by: lizz <innerlee@users.noreply.github.com>
- Loading branch information
1 parent
ac41cc0
commit a182997
Showing
3 changed files
with
73 additions
and
1 deletion.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,70 @@ | ||
# Preparing UCF101-24 | ||
|
||
For basic dataset information, you can refer to the dataset [website](http://www.thumos.info/download.html). | ||
Before we start, please make sure that the directory is located at `$MMACTION2/tools/data/ucf101_24/`. | ||
|
||
## Download and Extract | ||
|
||
You can download the RGB frames, optical flow and ground truth annotations from [google drive](https://drive.google.com/drive/folders/1BvGywlAGrACEqRyfYbz3wzlVV3cDFkct). | ||
The data are provided from [MOC](https://github.com/MCG-NJU/MOC-Detector/blob/master/readme/Dataset.md), which is adapted from [act-detector](https://github.com/vkalogeiton/caffe/tree/act-detector) and [corrected-UCF101-Annots](https://github.com/gurkirt/corrected-UCF101-Annots). | ||
|
||
**Note**: The annotation of this UCF101-24 is from [here](https://github.com/gurkirt/corrected-UCF101-Annots), which is more correct. | ||
|
||
After downloading the `UCF101_v2.tar.gz` file and put it in `$MMACTION2/tools/data/ucf101_24/`, you can run the following command to extract. | ||
|
||
```shell script | ||
tar -zxvf UCF101_v2.tar.gz | ||
``` | ||
|
||
## Check Directory Structure | ||
|
||
After extracting, you will get the `rgb-images` directory, `brox-images` directory and `UCF101v2-GT.pkl` for UCF101-24. | ||
|
||
In the context of the whole project (for UCF101-24 only), the folder structure will look like: | ||
|
||
``` | ||
mmaction2 | ||
├── mmaction | ||
├── tools | ||
├── configs | ||
├── data | ||
│ ├── ucf101_24 | ||
│ | ├── brox-images | ||
│ | | ├── Basketball | ||
│ | | | ├── v_Basketball_g01_c01 | ||
│ | | | | ├── 00001.jpg | ||
│ | | | | ├── 00002.jpg | ||
│ | | | | ├── ... | ||
│ | | | | ├── 00140.jpg | ||
│ | | | | ├── 00141.jpg | ||
│ | | ├── ... | ||
│ | | ├── WalkingWithDog | ||
│ | | | ├── v_WalkingWithDog_g01_c01 | ||
│ | | | ├── ... | ||
│ | | | ├── v_WalkingWithDog_g25_c04 | ||
│ | ├── rgb-images | ||
│ | | ├── Basketball | ||
│ | | | ├── v_Basketball_g01_c01 | ||
│ | | | | ├── 00001.jpg | ||
│ | | | | ├── 00002.jpg | ||
│ | | | | ├── ... | ||
│ | | | | ├── 00140.jpg | ||
│ | | | | ├── 00141.jpg | ||
│ | | ├── ... | ||
│ | | ├── WalkingWithDog | ||
│ | | | ├── v_WalkingWithDog_g01_c01 | ||
│ | | | ├── ... | ||
│ | | | ├── v_WalkingWithDog_g25_c04 | ||
│ | ├── UCF101v2-GT.pkl | ||
``` | ||
|
||
**Note**: The `UCF101v2-GT.pkl` exists as a cache, it contains 6 items as follows: | ||
1. `labels` (list): List of the 24 labels. | ||
2. `gttubes` (dict): Dictionary that contains the ground truth tubes for each video. | ||
A **gttube** is dictionary that associates with each index of label and a list of tubes. | ||
A **tube** is a numpy array with `nframes` rows and 5 columns, each col is in format like `<frame index> <x1> <y1> <x2> <y2>`. | ||
3. `nframes` (dict): Dictionary that contains the number of frames for each video, like `'HorseRiding/v_HorseRiding_g05_c02': 151`. | ||
4. `train_videos` (list): A list with `nsplits=1` elements, each one containing the list of training videos. | ||
5. `test_videos` (list): A list with `nsplits=1` elements, each one containing the list of testing videos. | ||
6. `resolution` (dict): Dictionary that outputs a tuple (h,w) of the resolution for each video, like `'FloorGymnastics/v_FloorGymnastics_g09_c03': (240, 320)`. |