sparse-autoencoders

Star

Here are 24 public repositories matching this topic...

vgel / repeng

Star

A library for making RepE control vectors

machine-learning transformers language-model sparse-autoencoders sae sparse-autoencoder saes representation-engineering

Updated Jan 8, 2025
Jupyter Notebook

ysh329 / Chinese-UFLDL-Tutorial

Star

[UNMAINTAINED] 非监督特征学习与深度学习中文教程，该版本翻译自新版 UFLDL Tutorial 。建议新人们去学习斯坦福的CS231n课程，该门课程在网易云课堂上也有一个配有中文字幕的版本。

exercise convolutional-neural-networks unsupervised-learning supervised-neural-network sparse-autoencoders taught-learning

Updated Mar 13, 2018

OpenMOSS / Language-Model-SAEs

Star

For OpenMOSS Mechanistic Interpretability Team's Sparse Autoencoder (SAE) research.

sparse-autoencoders interpretability sparse-dictionary mechanistic-interpretability

Updated Jul 12, 2025
Python

LahiruJayasinghe / DeepDOA

Star

Finding Direction of arrival (DOA) of small UAVs using Sparse Denoising Autoencoders and Deep Neural Networks.

autoencoder denoising-autoencoders sparse-autoencoders unmanned-aerial-vehicle direction-of-arrival

Updated Oct 23, 2018
Python

dmis-lab / Monet

Star

[ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers

sparse-autoencoders iclr interpretability mixture-of-experts large-language-models iclr2025

Updated Jun 23, 2025
Python

abhisheksambyal / Autoencoders-using-Pytorch-Medical-Imaging

Star

Medical Imaging, Denoising Autoencoder, Sparse Denoising Autoencoder (SDAE) End-to-end and Layer Wise Pretraining

autoencoders denoising-autoencoders sparse-autoencoders autoencoder-mnist autoencoders-fashionmnist autoencoder-segmentation autoencoder-pytorch autoencoder-classification

Updated Apr 2, 2019
Jupyter Notebook

neuroexplicit-saar / Discover-then-Name

Star

Code for the paper: Discover-then-Name: Task-Agnostic Concept Bottlenecks via Automated Concept Discovery. ECCV 2024.

sparse-autoencoders concept-extraction concept-bottleneck-models eccv2024

Updated Nov 3, 2024
Python

Abhipanda4 / Sparse-Autoencoders

Star

Sparse Autoencoders using FashionMNIST dataset

pytorch autoencoders sparse-autoencoders fashion-mnist

Updated May 19, 2018
Python

rmovva / HypotheSAEs

Star

Hypothesizing interpretable relationships in text datasets using sparse autoencoders.

nlp topic-modeling sparse-autoencoders computational-social-science interpretability ai-for-science

Updated Jul 11, 2025
Jupyter Notebook

meteahishali / SRL-SOA

Star

Hyperspectral Band Selection using Self-Representation Learning with Sparse 1D-Operational Autoencoder (SRL-SOA)

machine-learning sparse-autoencoders band-selection hyperspectral-images 1d-operational-layers

Updated Apr 5, 2025
Python

dynamical-inference / patchsae

Star

Implementation of PatchSAE as presented in "Sparse autoencoders reveal selective remapping of visual concepts during adaptation"

pytorch sparse-autoencoders sae explainable-ai xai

Updated May 6, 2025
Jupyter Notebook

Butanium / tiny-activation-dashboard

Star

A tiny easily hackable implementation of a feature dashboard.

sparse-autoencoders sparse-autoencoder feature-visualization feature-dashboard

Updated Jul 3, 2025
Jupyter Notebook

Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the paper "Evaluating Open-Source Sparse Autoencoders on Disentangling Factual Knowledge in GPT-2 Small"

sparse-autoencoders sae sparse-autoencoder

Updated Jan 26, 2025
Python

gkimer / thesis-ICI

Star

Diagnóstico de falla de rodamiento utilizando descomposición modal empírica y deep learning

matlab dnn autoencoders emd sparse-autoencoders bearing-fault-diagnosis

Updated Jun 27, 2018
MATLAB

255BITS / sae-evolver

Star

Use evolution with sparse autoencoders

python evolutionary-algorithms sparse-autoencoders

Updated Jan 29, 2025
Python

MikolajSzawerda / music-sae

Star

Sparse Autoencoders (SAEs) for unsupervised music representation learning.

music machine-learning sparse-autoencoders rave yue musicgen

Updated Jun 4, 2025
Jupyter Notebook

ashioyajotham / exploring_saes

Star

Implementation and analysis of Sparse Autoencoders for neural network interpretability research. Features interactive visualization dashboard and W&B integration.

sparse-autoencoders interpretability activation-functions neuron-activity wandb transformerlens mech-interp