trustworthy-ai

This repository presents a novel approach developed by Prof. Patrick Cheridito and myself for computing conditional expectations with numerical guarantees.

monte-carlo-methods conditional-expectation trustworthy-ai least-squares-regression numerical-guarantees

Updated Sep 22, 2024
Python

TMIS-Turbo / OARL

Star

[TIV, 2022] Robust Lane Change Decision Making for Autonomous Vehicles: An Observation Adversarial Reinforcement Learning Approach

reinforcement-learning autonomous-vehicles adversarial-machine-learning trustworthy-ai robust-decision-making

Updated Sep 14, 2024
Python

aimagelab / safe-clip

Star

Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models. ECCV 2024

retrieval safety image-to-text text-to-image nsfw vision-and-language trustworthy-ai eccv2024

Updated Aug 10, 2024
Python

edadaltocg / ood-trajectory-projection

Star

Code for the Paper "A Functional Data Perspective and Baseline on Multi-Layer Out-of-Distribution Detection"

ood ood-detection trustworthy-ai safety-ai

Updated Aug 2, 2023
Python

nguyenhongson1902 / Venomancer

Star

Venomancer: Towards Imperceptible and Target-on-Demand Backdoor Attacks in Federated Learning

federated-learning backdoor-attacks trustworthy-ai

Updated Jul 25, 2024
Python

VinAIResearch / COMBAT

Star

COMBAT: Alternated Training for Effective Clean-Label Backdoor Attack (AAAI 2024)

machine-learning computer-vision backdoor-attacks deep-learning-security trustworthy-ai aaai2024

Updated Aug 26, 2024
Python

pittisl / FreezeAsGuard

Star

Code for paper "FreezeAsGuard: Mitigating Illegal Adaptation of Diffusion Models via Selective Tensor Freezing"

deep-learning fine-tuning diffusion-models trustworthy-ai

Updated Jun 9, 2024
Python

SamH135 / LLM-Trustworthiness-Assessment-Framework

Star

A custom framework designed to analyze and assess Large Language Models (LLMs) for trustworthiness, with a specific focus on detecting excessive agency. This project aims to determine if a language model is assuming more capabilities or authority than it should.

python testing redteam redteam-tools trustworthy-ai llm

Updated Oct 18, 2024
Python

guangyaodou / ConMU

Star

Breaking the Trilemma of Privacy, Utility, Efficiency via Controllable Machine Unlearning

data-privacy machine-unlearning deep- trustworthy-ai

Updated May 19, 2024
Python

YuxingLu613 / MoTIF

Star

Multi-omics Trustworthy Integration Framework (MoTIF)

multimodal multiomics trustworthy-ai dynamic-neural-network

Updated Aug 9, 2023
Python

Improve this page

Add a description, image, and links to the trustworthy-ai topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the trustworthy-ai topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

trustworthy-ai

Here are 69 public repositories matching this topic...

lygjwy / READ

nomnomnonono / FFVAE

edadaltocg / detectors

LabeliaLabs / referentiel-processing-scripts

raphischer / strep

craymichael / unfooling

LabeliaLabs / responsible-ai-assessment-platform

TypalAcademy / xai-l2o

HowieHwong / ObscurePrompt

matgege / nam-visualization

balintg1994 / conditional_expectation_with_guarantees

TMIS-Turbo / OARL

aimagelab / safe-clip

edadaltocg / ood-trajectory-projection

nguyenhongson1902 / Venomancer

VinAIResearch / COMBAT

pittisl / FreezeAsGuard

SamH135 / LLM-Trustworthiness-Assessment-Framework

guangyaodou / ConMU

YuxingLu613 / MoTIF

Improve this page

Add this topic to your repo