ai-security

Star

Here are 33 public repositories matching this topic...

Giskard-AI / giskard

Sponsor

Star

🐢 Open-Source Evaluation & Testing for LLMs and ML models

Updated Jun 30, 2024
Python

normster / llm_rules

Star

RuLES: a benchmark for evaluating rule-following in language models

ai-safety ai-security gpt-4

Updated Jun 21, 2024
Python

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack success rate.

adversarial-machine-learning adversarial-attacks ai-security backdoor-attacks deep- poisoning-attacks

Updated May 9, 2023
Python

RjDuan / AdvDrop

Star

Code for "Adversarial attack by dropping information." (ICCV 2021)

pytorch adversarial-examples adversarial-attacks ai-security

Updated Jan 13, 2022
Python

jay-johnson / train-ai-with-django-swagger-jwt

Star

Train AI (Keras + Tensorflow) to defend apps with Django REST Framework + Celery + Swagger + JWT - deploys to Kubernetes and OpenShift Container Platform

machine-learning jwt deep-neural-networks ai openshift tensorflow rest-api django-rest-framework swagger drf keras celery network-analysis network-security celery-tasks machine-learning-security ai-security anti-nex

Updated Nov 2, 2018
Python

Hacking-Notes / VulnScan

Star

Performing website vulnerability scanning using OpenAI technologie

hacking-tool vulnerability-scanners vulnerability-scanning ai-security chatgpt

Updated Apr 19, 2024
Python

elliothe / CVPR_2019_PNI

Star

pytorch implementation of Parametric Noise Injection for adversarial defense

ai-security adversarial-defense

Updated Oct 23, 2019
Python

HKU-TASR / Imperio

Star

[IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the victim model's prediction for arbitrary targets.

ai-security backdoor-attacks llm

Updated Apr 17, 2024
Python

mitre-atlas / atlas-data

Star

ATLAS tactics, techniques, and case studies data

security machine-learning mitre-attack ai-security mitre-atlas

Updated Apr 29, 2024
Python

AI-Initiative-KAUST / VideoRLCS

Star

Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)

reinforcement-learning computer-vision deep-learning explainable-ai ai-security iccv2023

Updated Aug 19, 2023
Python

modzy / sdk-python

Star

Python library for Modzy Machine Learning Operations (MLOps) Platform

python docker machine-learning microservices deployment api-client model-deployment model-serving serving explainable-ai production-machine-learning ai-security mlops kuberenetes drift-detection machine-learning-operations

Updated Sep 8, 2023
Python

zhangzp9970 / MIA

Star

Unofficial pytorch implementation of paper: Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures

machine-learning research deep-learning ai-security model-inversion-attacks

Updated Oct 6, 2023
Python

AnthenaMatrix / Prompt-Injection-Testing-Tool

Star

The Prompt Injection Testing Tool is a Python script designed to assess the security of your AI system's prompt handling against a predefined list of user prompts commonly used for injection attacks. This tool utilizes the OpenAI GPT-3.5 model to generate responses to system-user prompt pairs and outputs the results to a CSV file for analysis.

ai prompt openai ai-security openai-api prompt-learning prompt-engineering prompting prompt-injection prompt-injection-tool ai-cyber-security

Updated Mar 21, 2024
Python

AnthenaMatrix / Image-Prompt-Injection

Star

Image Prompt Injection is a Python script that demonstrates how to embed a secret prompt within an image using steganography techniques. This hidden prompt can be later extracted by an AI system for analysis, enabling covert communication with AI models through images.

ai cybersecurity ai-security prompt-engineering aisecurity prompt-injection prompt-injection-tool

Updated Mar 20, 2024
Python

reds-lab / Meta-Sift

Star

The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on poisoned dataset.

ai-security backdoor-attacks data-poisoning dataset-security