interpretability

Star

Here are 617 public repositories matching this topic...

stanfordnlp / pyreft

Star

ReFT: Representation Finetuning for Language Models

interpretability reft representation-finetuning

Updated May 18, 2024
Python

microsoft / responsible-ai-toolbox

Star

Responsible AI Toolbox is a suite of tools providing model and data exploration and assessment user interfaces and libraries that enable a better understanding of AI systems. These interfaces and libraries empower developers and stakeholders of AI systems to develop and monitor AI more responsibly, and take better data-driven actions.

Updated May 18, 2024
TypeScript

interpretml / interpret

Star

Fit interpretable models. Explain blackbox machine learning.

machine-learning ai scikit-learn artificial-intelligence transparency blackbox bias differential-privacy gradient-boosting interpretability interpretable-ai interpretable-ml explainable-ai explainable-ml xai interpretable-machine-learning iml explainability interpretml

Updated May 17, 2024
C++

EthicalML / awesome-production-machine-learning

Star

A curated list of awesome open source libraries to deploy, monitor, version and scale your machine learning

machine-learning data-mining awesome deep-learning awesome-list interpretability privacy-preserving production-machine-learning mlops privacy-preserving-machine-learning explainability responsible-ai machine-learning-operations ml-ops ml-operations privacy-preserving-ml large-scale-ml production-ml large-scale-machine-learning

Updated May 17, 2024

sbobek / lux

Star

Local Universal Rule-based Explanations

interpretability explainable-artificial-intelligence explainability counterfactual-explanations local-explanations model-agnostic-explanations rule-based-explanations

Updated May 17, 2024
Python

jphall663 / awesome-machine-learning-interpretability

Star

A curated list of awesome responsible machine learning resources.

Updated May 17, 2024

zichuan-liu / TimeXplusplus

Star

[ICML'24] Official PyTorch Implementation of TimeX++

deep-learning time-series interpretability information-bottleneck explainable-ai xai perturbations explainability

Updated May 17, 2024
Python

krisrs1128 / interpretability_review

Star

Code accompanying a review article on interpretability and XAI. Includes examples for both simple (sparse regression) and sophisticated (concept bottlenecks) approaches, using notebooks that can be run in a few minutes.

review data-science concepts interpretability xai

Updated May 17, 2024
Jupyter Notebook

AI4LIFE-GROUP / LLM_Explainer

Star

Code for paper: Are Large Language Models Post Hoc Explainers?

interpretability xai explainability large-language-models llm

Updated May 17, 2024
Jupyter Notebook

shap / shap

Star

A game theoretic approach to explain the output of any machine learning model.

machine-learning deep-learning gradient-boosting interpretability shapley shap explainability

Updated May 16, 2024
Jupyter Notebook

rd20karim / M2T-Interpretable

Star

Official Implementation of the paper guided attention for interpretable motion captioning

interpretability interpretable-machine-learning adaptive-attention guided-attention motion-to-text spatio-temporal-attention motion-captioning

Updated May 16, 2024
Python

alanqrwang / keymorph

Star

Robust multimodal brain registration via keypoints

deep-learning neural-network pytorch affine registration robust keypoints brain interpretability multimodal

Updated May 16, 2024
Python

MAIF / shapash

Star

🔅 Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models

python machine-learning transparency lime interpretability ethical-artificial-intelligence explainable-ml shap explainability

Updated May 16, 2024
Jupyter Notebook

francescortu / comp-mech

Star

Competition of Mechanisms: Tracing How Language Models Handle Facts and Counterfactuals

interpretability llm mechanistic-interpretability

Updated May 16, 2024
Python

JoNeedsSleep / weak_to_strong

Star

Running interpretability experiments with application to weak-to-strong generalization

interpretability

Updated May 15, 2024
Jupyter Notebook

shubhomoydas / ad_examples

Star

A collection of anomaly detection methods (iid/point-based, graph and time series) including active learning for anomaly detection/discovery, bayesian rule-mining, description for diversity/explanation/interpretability. Analysis of incorporating label feedback with ensemble and tree-based detectors. Includes adversarial attacks with Graph Convol…