#

llm-security

Here are 3 public repositories matching this topic...

levitation-opensource / Manipulative-Expression-Recognition

MER is a software that identifies and highlights manipulative communication in text from human conversations and AI-generated responses. MER benchmarks language models for manipulative expressions, fostering development of transparency and safety in AI. It also supports manipulation victims by detecting manipulative patterns in human communication.

benchmarking sentiment-analysis manipulation transparency fraud-prevention human-computer-interaction human-robot-interaction expression-recognition sentiment-classification fraud-detection psychometrics misinformation conversation-analysis conversation-analytics llm prompt-engineering prompt-injection llm-security llm-training llm-test

Updated Aug 3, 2024
HTML

llm-platform-security / chatgpt-plugin-eval

LLM Platform Security: Applying a Systematic Evaluation Framework to OpenAI's ChatGPT Plugins

openai llm chatgpt chatgpt-plugins llm-security llm-privacy llm-platform llm-platform-security

Updated Jul 29, 2024
HTML

yevh / TaaC-AI

AI-driven Threat modeling-as-a-Code (TaaC-AI)

ai threat application-security gpt threat-modeling secure-development devsecops threat-modeling-tool threat-models threat-modeling-from-code taac gpt-3 gpt-4 llm-security mistral-7b claude-3

Updated Jun 7, 2024
HTML

Improve this page

Add a description, image, and links to the llm-security topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-security topic, visit your repo's landing page and select "manage topics."