Skip to content

SEEDGuard/SEEDAuditor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

14 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

SEEDAuditor

Welcome to SEEDAuditor, a pivotal component of the SEEDGuard.AI initiative. This project is dedicated to enhancing the security and integrity of data for AI models and corresponding datasets in software engineering. Basically this repository is for Auditing software engineering data.

Project Overview

SEEDAuditor is an open-source effort under the broader umbrella of SEEDGuard.AI, aimed at revolutionizing AI for Software Engineering with a keen focus on data security. Our mission is to safeguard AI models against data poisoning and backdoor threats, ensuring the development of trustworthy AI systems.

Key Features

  • Robust Security: Implementation of robust defenses against poison attacks and backdooring threats to datasets.
  • Scalable Infrastructure: Development of a scalable system infrastructure to support the growing needs of the AI for SE/Code domain.

Goals

  1. Enhancing System Fault Tolerance: Focusing on data security to protect datasets from poison attacks and ensure the integrity of the data.
  2. Optimizing Model Performance: Implementing retraining protocols to enhance the resilience of SEEDGuard against adversarial attacks.
  3. User-Friendly Functionality: Ensuring that the project's infrastructure and APIs are aligned with the objectives of SEEDGuard.AI, facilitating easy access and interaction for researchers and developers.

Getting Started

reference for docker commands : docker build -t spectralsignature:1.1 . docker run -it -v test_path --input_dir input_path --output_dir output_path --method method_name

Related Works

Paper Id Year-Id Title Venue Replication Package If Integrated? Approved By Author? Current Contributors
1 2023-1 Backdooring Neural Code Search ACL link pvinoda
2 2023-2 MalWuKong: Towards Fast, Accurate, and Multilingual Detection of Malicious Code Poisoning in OSS Supply Chains. ASE
3 2023 Multi-target Backdoor Attacks for Code Pre-trained Models ACL link
4 2022 CoProtector: Protect Open-Source Code against Unauthorized Training Usage with Data Poisoning WWW link
5 2022 You See What I Want You to See: Poisoning Vulnerabilities in Neural Code Search FSE link
6 2020 You Autocomplete Me: Poisoning Vulnerabilities in Neural Code Completion USENIX
7 2023 Stealthy Backdoor Attack for Code Models TSE link
8 2021 DeepPayload: Black-box Backdoor Attack on Deep Learning Models through Neural Payload Injection ICSE
9 2021 AdvDoor: adversarial backdoor attack of deep learning system ISSTA

Contributing

SEEDAuditor thrives on community contributions. Whether you're interested in enhancing its security features, expanding the API, or improving the current functionality, your contributions are welcome. Please refer to our contribution guideline at CONTRIBUTING.md for more information on how to contribute. Also refer to our Docker template if you are coming up with new Methods for the task.

Miscallaneous File Links

rb_function.txt {THE INPUT FILE }: https://drive.google.com/file/d/1PZmH6NthgX9hXM5DXynJSMdO9gJ1oWlh/view?usp=drive_link

Contents to be placed in the output directory before running: https://drive.google.com/drive/folders/1425f_JM0iX7QBUdRf1j-bzOdqpOWpXEj

About

Auditing software engineering data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published