#

interpretability-and-explainability

Here are 19 public repositories matching this topic...

ruizheliUOA / Awesome-Interpretability-in-Large-Language-Models

This repository collects all relevant resources about interpretability in LLMs

dictionary-learning sparse-autoencoder interpretability-and-explainability mechanistic-interpretability

Updated Oct 31, 2024

HennyJie / IBGNN

MICCAI 2022 (Oral): Interpretable Graph Neural Networks for Connectome-Based Brain Disorder Analysis

healthcare brain graph-neural-networks miccai2022 interpretability-and-explainability

Updated Apr 29, 2023
Python

Wuyxin / DISC

Discover and Cure: Concept-aware Mitigation of Spurious Correlation (ICML 2023)

pytorch data-augmentation generalization interpretability-and-explainability stable-diffusion icml-2023 spurious-correlation

Updated Apr 14, 2024
Python

liugangcode / GREA

[KDD'22] Source codes of "Graph Rationalization with Environment-based Augmentations"

graph-neural-networks rationalization molecular-property-prediction interpretability-and-explainability polymer-property-prediction

Updated Jun 16, 2024
Python

WanyuGroup / CVPR2022-OrphicX

Official code for the CVPR 2022 (oral) paper "OrphicX: A Causality-Inspired Latent Variable Model for Interpreting Graph Neural Networks."

causality graph-neural-networks interpretability-and-explainability

Updated Apr 2, 2022
Python

cwangrun / ST-ProtoPNet

[ICCV 2023] Learning Support and Trivial Prototypes for Interpretable Image Classification

image-recognition fine-grained-classification interpretable-deep-learning interpretable-ai explainable-ai interpretable-neural-networks interpretability-and-explainability

Updated Nov 27, 2023
Python

vdlad / Remarkable-Robustness-of-LLMs

Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"

machine-learning ai-safety interpretability interpretability-and-explainability

Updated Jun 30, 2024
Jupyter Notebook

interpretable-ml-class / interpretable-ml-class.github.io

Explainable AI: From Simple Rules to Complex Generative Models

ai ml interpretability explainable-ai explainable-ml explainability interpretability-and-explainability

Updated Jun 28, 2023
HTML

warisgill / TraceFL

TraceFL is a novel mechanism for Federated Learning that achieves interpretability by tracking neuron provenance. It identifies clients responsible for global model predictions, achieving 99% accuracy across diverse datasets (e.g., medical imaging) and neural networks (e.g., GPT).

testing debugging machine-learning software-engineering accountability differential-privacy interpretability federated-learning explainable-ai explainability interpretability-and-explainability

Updated Aug 28, 2024
Python

VictorNico / NNs_from_scratch

Build a Neural net from scratch without keras or pytorch just by using numpy for calculus, pandas for data loading.

neural-network numpy eda pandas mnist oop-principles docstrings simulated-annealing-algorithm interpretability-and-explainability gradiantdescent

Updated Dec 8, 2022
Jupyter Notebook

DimitrisReppas / On_visual_explanation_of_supervised_and_self-supervised_learning

Visualization methods to interpret CNNs and Vision Transformers, trained in a supervised or self-supervised way. The methods are based on CAM or on the attention mechanism of Transformers. The results are evaluated qualitatively and quantitatively.

deep-neural-networks computer-vision transformers supervised-learning cnns explainable-ai self-supervised-learning interpretability-and-explainability

Updated Jan 17, 2023
Python

Imenbaa / BA-LR

Explainable Speaker Recognition

forensics speaker-recognition resnet34 likelihood-ratio x-vector-pytorch interpretability-and-explainability automatic-voice-comparison forensic-speaker-recognition

Updated Oct 26, 2022
Python

fguzman82 / PhD-Thesis

Interpretability: Methods for Identification and Retrieval of Concepts in CNN Networks

cnn pytorch interpretability-and-explainability

Updated Jun 17, 2024
Jupyter Notebook

MattScicluna / interpretable_tsne

Implementation of the gradient-based t-SNE sttribution method described in our GLBIO oral presentation: 'Towards Computing Attributions for Dimensionality Reduction Techniques'

dimensionality-reduction research-tool interpretability-and-explainability

Updated May 27, 2024
Python

bishwamittra / nus_thesis

My PhD thesis in NUS. Making it public so that future graduate students may benefit.

machine-learning formal-methods fairness interpretability-and-explainability

Updated Aug 25, 2023
TeX

Skyyyy0920 / FGAI

gnns graph-neural-networks interpretability-and-explainability

Updated Aug 16, 2024
Python

swardiantara / DroneLog

Interpretable Anomaly Severity Detection on UAV Flight Log Messages

anomaly-detection sentence-embeddings multitask-learning transformer-models trustworthy-ai interpretability-and-explainability drone-forensics

Updated Aug 8, 2024
HTML

goz1985 / RST-ARM-GLM_-Research

Work on combining Logit model with an information granulation method for better interpretability

glm rough-sets bagging-ensemble boosting-ensemble interpretability-and-explainability

Updated Mar 13, 2024
R

Skyyyy0920 / SSCBM

Semi-supervised Concept Bottleneck Models (SSCBM)

concept-bottleneck-models interpretability-and-explainability

Updated Oct 26, 2024
Python

Improve this page

Add a description, image, and links to the interpretability-and-explainability topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the interpretability-and-explainability topic, visit your repo's landing page and select "manage topics."