You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First, thanks for the great codebase - everything is neatly organized in the source files - a nice deviation compared to what AI codebases from papers usually look like :)
Some questions:
Train and Predict
Run the training and prediction script. It will, first, train the captioning model and, then, evaluate the predictions of the best model in the learned proposal setting.
From your comments on training, it is clear that the captioning module is trained first (on GT proposals?). However, it is not very clear when the proposal module is trained. Is the training end-to-end as in Zhuo et al. [59] where both modules are trained in unison (where the captioning module is able to influence the event proposal mechanism)? Can you explain this sequence clearly (maybe for the sake for everyone by updating the readme?). Thanks!
The text was updated successfully, but these errors were encountered:
We train only the captioning module (yes, on GT proposals)
We use the predictions (./data/bafcg_val_100_proposal_result.csv – we contacted the authors for it) from the model proposed in Wang et. al 2018 which wasn't finetuned anyhow
Therefore, the sequence of training:
Training the captioning module on GT proposals (it will evaluate the model on GT automatically)
Evaluate the captioning module with the proposals from Wang et. al 2018
Hi Vladimir,
First, thanks for the great codebase - everything is neatly organized in the source files - a nice deviation compared to what AI codebases from papers usually look like :)
Some questions:
From your comments on training, it is clear that the captioning module is trained first (on GT proposals?). However, it is not very clear when the proposal module is trained. Is the training end-to-end as in Zhuo et al. [59] where both modules are trained in unison (where the captioning module is able to influence the event proposal mechanism)? Can you explain this sequence clearly (maybe for the sake for everyone by updating the readme?). Thanks!
The text was updated successfully, but these errors were encountered: