We found that MSR-VTT dataset contains a lot of noisy annotations. After analyzing the data carefully, we put some efforts on cleaning the annotations. We retrained some models on the cleaned dataset and found experimental results improved compared to the previous models.
- Python 3
- Jupyter Notebook
- TensorFlow 1.13
clean_process
is the folder for cleaning MSR-VTT dataset.
msrvtt_model
is the folder for training a new model.