PhD candidate · trying to figure out whether multimodal LLMs are safe to deploy, and how badly they break when they aren't.
-
Peking University
- Beijing, China
Popular repositories Loading
-
mllm-jailbreak-bench
mllm-jailbreak-bench PublicReproducible benchmark for adversarial attacks on multimodal large language models
-
trust-eval-mm
trust-eval-mm PublicMulti-dimensional trustworthiness evaluation for multimodal LLMs
Python 15
-
chainreason
chainreason PublicForked from joshawome/chainreason
A benchmark for evaluating LLM reasoning on Ethereum and DeFi tasks
Python
-
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.
