Skip to content
Dataset of Human Eye Fixation over Crowd Videos
Branch: master
Clone or download
Latest commit 16a2a99 Nov 16, 2019
Type Name Latest commit message Commit time
Failed to load latest commit information.
figs Add files via upload Sep 24, 2019
.gitignore updated info in comments Sep 27, 2019
CategoryInfo.xlsx First commit Sep 27, 2019 Update Nov 16, 2019 Update Oct 13, 2019
sample.avi Add files via upload Sep 27, 2019


Detail of the dataset and its analysis can be found in our arXiv preprint here.

Brief introduction

CrowdFix includes 434 videos with diverse crowd scenes, containing a total of 37,493 frames and 1,249 seconds. The diverse content refers to different crowd activities under three distinct categories - Sparse, Dense Free Flowing and Dense Congested. All videos are at 720p resolution and 30 Hz frame rate. For monitoring the eye movements, an EyeTribe eye tracker was used in our experiment. 26 participants (10 males and 16 females), aging from 17 to 40, participated in the eye-tracking experiment. All participants were non-experts for the eye-tracking experiment, with normal/corrected-to-normal vision. During the experiment, the distance between subjects and the monitor was fixed at 60 cm. Before viewing videos, each subject was required to perform a 9-point calibration for the eye tracker. After the calibration, the subjects were asked to free-view videos displayed in an MTV style. Finally, fixations of all 32 subjects on 538 videos were collected for our eye-tracking database.

Some stills from the dataset videos are shown in examples. A sample of resultant heatmaps of eye fixations over a few videos are shown in in sample.avi.

How to use

The dataset will be provided on request. Please contact /

  ┬ videos: 434 clips annotated with their category, 5GB 
  ├ Run without modifying folder hierarchy to generate the frames with the same name sequence as the corresponding ground truth binary fixation and saliency maps.
  ├ Binary Fixation Maps
  └ Saliency Maps

'CategoryInfo.xlsx' lists the information of crowd categories against each video. It includes the following information:

  • 'column 1' *: Video Number as given in the uploaded video folder.
  • 'column 2' *: Crowd Category Type (SP for Sparse, DF for Dense Free-Flowing and DC for Dense Congested)


You are welcome to freely use this database, and please cite with the following Bibtex code:

    title={CrowdFix: An Eyetracking Dataset of Real Life Crowd Videos},
    author={Memoona Tahira and Sobas Mehboob and Anis U. Rahman and Omar Arif},
You can’t perform that action at this time.