Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adversarial Attacks and Testing the Robustness of Models #95

Closed
abheesht17 opened this issue Apr 7, 2022 · 3 comments
Closed

Adversarial Attacks and Testing the Robustness of Models #95

abheesht17 opened this issue Apr 7, 2022 · 3 comments

Comments

@abheesht17
Copy link
Collaborator

abheesht17 commented Apr 7, 2022

Branching off from the issue which @aflah02 opened a few weeks ago, #39:

Is the KerasNLP team interested in implementing adversarial attacks? We could start off with simple attacks on classification models.

I understand if this is a bit broad, and the team may want to integrate it later to the repository, especially because we may need some augmentation APIs. For example, some adversarial attacks may want to perturb only those words which are assigned a higher importance score by the model. For perturbation, we can leverage the augmentation APIs.

A good resource is https://github.com/QData/TextAttack.

@chenmoneygithub
Copy link
Contributor

@abheesht17 Thanks for opening this feature request!

Yes, having an adversarial attack system would be nice for model evaluation. Our current problem is that we do not have pretrained model available. When you start working on this, would you mind sharing a colab so that we can do some early reviews on the interface? Thanks!

@abheesht17
Copy link
Collaborator Author

Sure, @chenmoneygithub! Will do. Waiting for some augmentation methods to be implemented before starting adversarial attacks.

@mattdangerw
Copy link
Member

This probably does not quite fit with our current priorities.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants