Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrated Gradients for Frame-Level Classification Tasks #1282

Open
david-gimeno opened this issue May 16, 2024 · 0 comments
Open

Integrated Gradients for Frame-Level Classification Tasks #1282

david-gimeno opened this issue May 16, 2024 · 0 comments

Comments

@david-gimeno
Copy link

david-gimeno commented May 16, 2024

馃殌 Feature

Integrated Gradients not only for single classification task, but also for those that are based on frame-level classification, such as speech recognition, active speaker detection, etc.

Additional context

In the case of automatic lipreading, we have a tensor of shape (T, H, W, C), where T, H, W, and C refer to the time (no. of frames), height of the image, width of the image and channels of the image, respectively. I would like to apply Integrated Gradients so the targets of the model can be a sequence, i.e., an integer for each frame of the sequence. Each integer would represent the corresponding text token composing the final transcription.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant