PhD student at UC San Diego, developing scalable ways to understand ML and make it safer. Previously @mit.
-
University of California San Diego
Pinned Loading
-
Trustworthy-ML-Lab/Label-free-CBM
Trustworthy-ML-Lab/Label-free-CBM PublicA new framework to transform any neural networks into an interpretable concept-bottleneck-model (CBM) without needing labeled concept data
-
train_mnist_fast
train_mnist_fast PublicHow to train a CNN to 99% accuracy on MNIST in less than a second on a laptop
-
Trustworthy-ML-Lab/CLIP-dissect
Trustworthy-ML-Lab/CLIP-dissect PublicAn automatic and efficient tool to describe functionalities of individual neurons in DNNs
-
radial_rl_v2
radial_rl_v2 PublicThis repository contains the official code for our NeurIPS 2021 publication "Robust Deep Reinforcement Learning through Adversarial Loss"
-
Trustworthy-ML-Lab/Linear-Explanations
Trustworthy-ML-Lab/Linear-Explanations PublicJupyter Notebook 7
-
Trustworthy-ML-Lab/Efficient-LLM-automated-interpretability
Trustworthy-ML-Lab/Efficient-LLM-automated-interpretability PublicPython 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.