ai-security

Star

Here are 41 public repositories matching this topic...

Giskard-AI / giskard

Sponsor

Star

🐢 Open-Source Evaluation & Testing for ML models & LLMs

Updated Oct 18, 2024
Python

jay-johnson / train-ai-with-django-swagger-jwt

Star

Train AI (Keras + Tensorflow) to defend apps with Django REST Framework + Celery + Swagger + JWT - deploys to Kubernetes and OpenShift Container Platform

machine-learning jwt deep-neural-networks ai openshift tensorflow rest-api django-rest-framework swagger drf keras celery network-analysis network-security celery-tasks machine-learning-security ai-security anti-nex

Updated Nov 2, 2018
Python

LetterLiGo / SafeGen_CCS2024

Star

[CCS'24] SafeGen: Mitigating Unsafe Content Generation in Text-to-Image Models

text-to-image ai-safety ai-security generative-ai thrustworthy-ai

Updated Oct 13, 2024
Python

RjDuan / AdvDrop

Star

Code for "Adversarial attack by dropping information." (ICCV 2021)

pytorch adversarial-examples adversarial-attacks ai-security

Updated Jan 13, 2022
Python

elliothe / CVPR_2019_PNI

Star

pytorch implementation of Parametric Noise Injection for adversarial defense

ai-security adversarial-defense

Updated Oct 23, 2019
Python

normster / llm_rules

Star

RuLES: a benchmark for evaluating rule-following in language models

ai-safety ai-security gpt-4

Updated Sep 30, 2024
Python

AnthenaMatrix / Image-Prompt-Injection

Star

Image Prompt Injection is a Python script that demonstrates how to embed a secret prompt within an image using steganography techniques. This hidden prompt can be later extracted by an AI system for analysis, enabling covert communication with AI models through images.

ai cybersecurity ai-security prompt-engineering aisecurity prompt-injection prompt-injection-tool

Updated Mar 20, 2024
Python

reds-lab / Narcissus

Star

The official implementation of the CCS'23 paper, Narcissus clean-label backdoor attack -- only takes THREE images to poison a face recognition dataset in a clean-label way and achieves a 99.89% attack success rate.

adversarial-machine-learning adversarial-attacks ai-security backdoor-attacks deep- poisoning-attacks

Updated May 9, 2023
Python

mitre-atlas / atlas-data

Star

ATLAS tactics, techniques, and case studies data

security machine-learning mitre-attack ai-security mitre-atlas

Updated Oct 2, 2024
Python

zhangzp9970 / MIA

Star

Unofficial pytorch implementation of paper: Model Inversion Attacks that Exploit Confidence Information and Basic Countermeasures

machine-learning research deep-learning ai-security model-inversion-attacks

Updated Oct 6, 2023
Python

AnthenaMatrix / Prompt-Injection-Testing-Tool

Star

The Prompt Injection Testing Tool is a Python script designed to assess the security of your AI system's prompt handling against a predefined list of user prompts commonly used for injection attacks. This tool utilizes the OpenAI GPT-3.5 model to generate responses to system-user prompt pairs and outputs the results to a CSV file for analysis.

ai prompt openai ai-security openai-api prompt-learning prompt-engineering prompting prompt-injection prompt-injection-tool ai-cyber-security

Updated Mar 21, 2024
Python

reds-lab / Meta-Sift

Star

The official implementation of USENIX Security'23 paper "Meta-Sift" -- Ten minutes or less to find a 1000-size or larger clean subset on poisoned dataset.

ai-security backdoor-attacks data-poisoning dataset-security

Updated Apr 27, 2023
Python

Hacking-Notes / VulnScan

Star

Performing website vulnerability scanning using OpenAI technologie

hacking-tool vulnerability-scanners vulnerability-scanning ai-security chatgpt

Updated Apr 19, 2024
Python

HKU-TASR / Imperio

Star

[IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the victim model's prediction for arbitrary targets.

ai-security backdoor-attacks llm

Updated Apr 17, 2024
Python

AI-Initiative-KAUST / VideoRLCS

Star

Learning to Identify Critical States for Reinforcement Learning from Videos (Accepted to ICCV'23)

reinforcement-learning computer-vision deep-learning explainable-ai ai-security iccv2023

Updated Aug 19, 2023
Python

modzy / sdk-python

Star

Python library for Modzy Machine Learning Operations (MLOps) Platform

python docker machine-learning microservices deployment api-client model-deployment model-serving serving explainable-ai production-machine-learning ai-security mlops kuberenetes drift-detection machine-learning-operations

Updated Sep 8, 2023
Python

pagiux / maleficnet

Star

Neural networks, but malefic! 😈

security deep-neural-networks deep-learning malware-research ai-security

Updated Oct 5, 2024
Python

AiShieldsOrg / AiShieldsWeb

Star

AiShields is an open-source Artificial Intelligence Data Input and Output Sanitizer

ai application-security appsec sensitive-data-security data-security ai-security aisec applicationsecurity llm prompt-engineering aisecurity llm-security llmsecurity llmsec prompt-injection-remediation model-denial-of-service-remediation insecure-output-handling-remediation overreliance-remediation prompt-engineering-security artificial-intelligence-security

Updated Jun 5, 2024
Python

moonwatcher-ai / moonwatcher

Star

Evaluation & testing framework for computer vision models

computer-vision ai-safety ethical-artificial-intelligence ai-security mlops ml-safety ml-validation trustworthy-ai ml-testing

Updated Jun 20, 2024
Python

AashiqRamachandran / app-catcher

Star

Discover and inventory the SaaS applications used across your organization by intelligently analyzing incoming Gmail emails, providing valuable insights into your SaaS landscape.

security ai saas ai-security saas-security

Updated May 21, 2024
Python

Improve this page

Add a description, image, and links to the ai-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the ai-security topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ai-security

Here are 41 public repositories matching this topic...

Giskard-AI / giskard

jay-johnson / train-ai-with-django-swagger-jwt

LetterLiGo / SafeGen_CCS2024

RjDuan / AdvDrop

elliothe / CVPR_2019_PNI

normster / llm_rules

AnthenaMatrix / Image-Prompt-Injection

reds-lab / Narcissus

mitre-atlas / atlas-data

zhangzp9970 / MIA

AnthenaMatrix / Prompt-Injection-Testing-Tool

reds-lab / Meta-Sift

Hacking-Notes / VulnScan

HKU-TASR / Imperio

AI-Initiative-KAUST / VideoRLCS

modzy / sdk-python

pagiux / maleficnet

AiShieldsOrg / AiShieldsWeb

moonwatcher-ai / moonwatcher

AashiqRamachandran / app-catcher

Improve this page

Add this topic to your repo