Skip to content
View gjoelbye's full-sized avatar

Block or report gjoelbye

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gjoelbye/README.md

Hej — I’m Anders 👋🇩🇰

I’m a Dane and a PhD fellow at DTU (DDSA PhD Fellow), currently visiting Stanford’s STAIR Lab. I work at the intersection of trustworthy ML, causality, and neuroscience/healthcare—with a slightly more theory / math tilt than average, but always grounded in real data.

Currently I’m…

  • Visiting researcher at Stanford’s STAIR Lab focusing on trustworthy AI.
  • Interested in AI alignment and what’s happening inside neural networks—how representations form, align, and sometimes fail in ways that matter for robustness and interpretability. This also motivates my recent work on spectral PLS (including missing-data–induced phase transitions) and on mental rotation as a probe.
  • Developing PatternLocal from our NeurIPS paper “Minimizing False-Positive Attributions in Explanations of Non-Linear Models”—aimed at reducing spurious attributions while keeping local explanations faithful.
  • Working on the new Danish AI supercomputer Gefion to use Neural Architecture Search (NAS) to find the optimal EEG foundation model.

Background & affiliations

Selected publications & projects

Pinned Loading

  1. SPEED SPEED Public

    Python 22 5

  2. GoogleBrainCaptureHackathon GoogleBrainCaptureHackathon Public

    Jupyter Notebook 4 11

  3. bayesian-state-space-models bayesian-state-space-models Public

    Jupyter Notebook 1

  4. PatternLocal PatternLocal Public

    PatternLocal is a novel XAI method that refines local linearization approaches to reduce false-positive feature attributions in non-linear explanations.

    Python 1