ai-security

Star

Here are 44 public repositories matching this topic...

Giskard-AI / giskard

Sponsor

Star

🐢 Open-Source Evaluation & Testing for ML & LLM systems

Updated Nov 18, 2024
Python

normster / llm_rules

Star

RuLES: a benchmark for evaluating rule-following in language models

ai-safety ai-security gpt-4

Updated Sep 30, 2024
Python

LetterLiGo / SafeGen_CCS2024

Star

[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models

text-to-image ai-safety ai-security generative-ai thrustworthy-ai

Updated Oct 13, 2024
Python

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack success rate.

adversarial-machine-learning adversarial-attacks ai-security backdoor-attacks deep- poisoning-attacks

Updated May 9, 2023
Python

RjDuan / AdvDrop

Star

Code for "Adversarial attack by dropping information." (ICCV 2021)

pytorch adversarial-examples adversarial-attacks ai-security

Updated Jan 13, 2022
Python

jay-johnson / train-ai-with-django-swagger-jwt

Star

Train AI (Keras + Tensorflow) to defend apps with Django REST Framework + Celery + Swagger + JWT - deploys to Kubernetes and OpenShift Container Platform

machine-learning jwt deep-neural-networks ai openshift tensorflow rest-api django-rest-framework swagger drf keras celery network-analysis network-security celery-tasks machine-learning-security ai-security anti-nex

Updated Nov 2, 2018
Python

Hacking-Notes / VulnScan

Star

Performing website vulnerability scanning using OpenAI technologie

hacking-tool vulnerability-scanners vulnerability-scanning ai-security chatgpt

Updated Apr 19, 2024
Python

mitre-atlas / atlas-data

Star

ATLAS tactics, techniques, and case studies data

security machine-learning mitre-attack ai-security mitre-atlas

Updated Oct 2, 2024
Python

elliothe / CVPR_2019_PNI

Star

pytorch implementation of Parametric Noise Injection for adversarial defense

ai-security adversarial-defense

Updated Oct 23, 2019
Python

HKU-TASR / Imperio

Star

[IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the victim model's prediction for arbitrary targets.

ai-security backdoor-attacks llm

Updated Apr 17, 2024
Python

LetterLiGo / Inaudible-Adversarial-Perturbation-Vrifle

Star

[NDSS'24] Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time

artificial-intelligence iot-security ai-security

Updated Sep 28, 2024
Python

wssun / TiSE-CodeLM-Security

Star

This repository provide the studies on the security of language models for code (CodeLMs).

security language-model adversarial-attacks ai-security code-intelligence backdoor-attacks adversarial-defense backdoor-defense ai4se lm4se lm4code

Updated Oct 25, 2024
Python

AnthenaMatrix / Prompt-Injection-Testing-Tool

Star

The Prompt Injection Testing Tool is a Python script designed to assess the security of your AI system's prompt handling against a predefined list of user prompts commonly used for injection attacks. This tool utilizes the OpenAI GPT-3.5 model to generate responses to system-user prompt pairs and outputs the results to a CSV file for analysis.

ai prompt openai ai-security openai-api prompt-learning prompt-engineering prompting prompt-injection prompt-injection-tool ai-cyber-security

Updated Mar 21, 2024
Python

AI-Initiative-KAUST / VideoRLCS

Star

Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)

reinforcement-learning computer-vision deep-learning explainable-ai ai-security iccv2023

Updated Aug 19, 2023
Python

zhangzp9970 / MIA

Star

Unofficial pytorch implementation of paper: Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures

machine-learning research deep-learning ai-security model-inversion-attacks

Updated Oct 6, 2023
Python

modzy / sdk-python

Star

Python library for Modzy Machine Learning Operations (MLOps) Platform

python docker machine-learning microservices deployment api-client model-deployment model-serving serving explainable-ai production-machine-learning ai-security mlops kuberenetes drift-detection machine-learning-operations

Updated Sep 8, 2023
Python

AnthenaMatrix / Image-Prompt-Injection

Star

Image Prompt Injection is a Python script that demonstrates how to embed a secret prompt within an image using steganography techniques. This hidden prompt can be later extracted by an AI system for analysis, enabling covert communication with AI models through images.

ai cybersecurity ai-security prompt-engineering aisecurity prompt-injection prompt-injection-tool

Updated Mar 20, 2024
Python

reds-lab / Meta-Sift

Star

The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on poisoned dataset.

ai-security backdoor-attacks data-poisoning dataset-security

Updated Apr 27, 2023
Python

moonwatcher-ai / moonwatcher

Star

Evaluation & testing framework for computer vision models

computer-vision ai-safety ethical-artificial-intelligence ai-security mlops ml-safety ml-validation trustworthy-ai ml-testing

Updated Jun 20, 2024
Python

jay-johnson / antinex-datasets

Star

Datasets for training deep neural networks to defend software applications

Updated Jun 4, 2018
Python

Improve this page

Add a description, image, and links to the ai-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-security topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-security

Here are 44 public repositories matching this topic...

Giskard-AI / giskard

normster / llm_rules

LetterLiGo / SafeGen_CCS2024

reds-lab / Narcissus

RjDuan / AdvDrop

jay-johnson / train-ai-with-django-swagger-jwt

Hacking-Notes / VulnScan

mitre-atlas / atlas-data

elliothe / CVPR_2019_PNI

HKU-TASR / Imperio

LetterLiGo / Inaudible-Adversarial-Perturbation-Vrifle

wssun / TiSE-CodeLM-Security

AnthenaMatrix / Prompt-Injection-Testing-Tool

AI-Initiative-KAUST / VideoRLCS

zhangzp9970 / MIA

modzy / sdk-python

AnthenaMatrix / Image-Prompt-Injection

reds-lab / Meta-Sift

moonwatcher-ai / moonwatcher

jay-johnson / antinex-datasets

Improve this page

Add this topic to your repo