Implementation of CVPR 2018 paper Crowd Counting via Adversarial Cross-Scale
We used the following enviroment:
- Python 3
- PyTorch
- OpenCV
- Numpy
- MatPlotLib
- Ubuntu 16.04
You can also run the code using the docker image of ufoym/deepo.
We make available UCF-CC-50 and Shanghai Tech datasets here, download and unzip it into the root of the repo. Directories should have the following hierarchy:
ROOT_OF_REPO
data
ucf_cc_50
UCF_CC_50
images
labels
ShanghaiTech
part_A
train_data
images
ground-truth
test_data
images
ground-truth
part_B
train_data
images
ground-truth
test_data
images
ground-truth
The code was developed such that data augmentation is computed before every other step and the results are stored in the hard drive. Thus, the first time you run the code it will take quite a long time. Augmented data is stored with the following hierarchy:
ROOT_OF_REPO
data
ucf_cc_50
people_thr_0_gt_mode_same
ShanghaiTech
part_A
people_thr_0_gt_mode_same
part_B
people_thr_0_gt_mode_same
To train using UCF-CC-50 (with all folds) and save the results log in log/ACSCP
you can run:
python3 train.py -d ucf-cc-50 --gt-mode same --people-thr 0 --train-batch 24 --save-dir log/ACSCP
In case you want to run a specific fold or part you can use flag --units
, check the Makefile
for more examples.
The training log is stored in log_train.txt
inside the corresponding log/fold/part directory.
After training you can re-load the trained weights (using flag --resume
) and use them for testing:
python3 train.py -d ucf-cc-50 --save-dir log/multi-stream --resume log/ACSCP/ucf-cc-50_people_thr_0_gt_mode_same --evaluate-only
The testing log is stored in log_test.txt
inside the corresponding log/fold/part directory. You can also generate the plots of the predictions using flag --save-plots
, results are stored in the directory plot-results-test
inside the corresponding log/fold/part directory.
- Results for UCF_CC_50 with this code are MAE 281,73 MSE 415,56 (--people-thr 20). Reported results by the authors are MAE 291.0 MSE 404.6.
- Validation for other dataset may be done in the future.
- Batch normalization is not used because of inestable learning.
- Ground thruth is generated using a gaussian of fixed size.
- Number of channels of autoencoder in the middle layer is changed to 3, instead of 4.
- Network receives images of 1 channel, instead of 3.
- You can use the flag
--overlap-test
to overlap the sliding windows used for testing (as implemented by the authors).