Adversarial Attacks and Testing the Robustness of Models #95

abheesht17 · 2022-04-07T17:25:16Z

Branching off from the issue which @aflah02 opened a few weeks ago, #39:

Is the KerasNLP team interested in implementing adversarial attacks? We could start off with simple attacks on classification models.

I understand if this is a bit broad, and the team may want to integrate it later to the repository, especially because we may need some augmentation APIs. For example, some adversarial attacks may want to perturb only those words which are assigned a higher importance score by the model. For perturbation, we can leverage the augmentation APIs.

A good resource is https://github.com/QData/TextAttack.

chenmoneygithub · 2022-04-11T22:05:55Z

@abheesht17 Thanks for opening this feature request!

Yes, having an adversarial attack system would be nice for model evaluation. Our current problem is that we do not have pretrained model available. When you start working on this, would you mind sharing a colab so that we can do some early reviews on the interface? Thanks!

abheesht17 · 2022-04-12T03:34:45Z

Sure, @chenmoneygithub! Will do. Waiting for some augmentation methods to be implemented before starting adversarial attacks.

mattdangerw · 2023-10-18T20:18:54Z

This probably does not quite fit with our current priorities.

mattdangerw closed this as completed Oct 18, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adversarial Attacks and Testing the Robustness of Models #95

Adversarial Attacks and Testing the Robustness of Models #95

abheesht17 commented Apr 7, 2022 •

edited

chenmoneygithub commented Apr 11, 2022

abheesht17 commented Apr 12, 2022

mattdangerw commented Oct 18, 2023

Adversarial Attacks and Testing the Robustness of Models #95

Adversarial Attacks and Testing the Robustness of Models #95

Comments

abheesht17 commented Apr 7, 2022 • edited

chenmoneygithub commented Apr 11, 2022

abheesht17 commented Apr 12, 2022

mattdangerw commented Oct 18, 2023

abheesht17 commented Apr 7, 2022 •

edited