Skip to content
@apartresearch

apartresearch

Artificial intelligence will change the world. Our mission is to ensure this happens safely and to the benefit of everyone.

Apart facilitates new research in AI safety, towards reducing societal-scale risks from the technology.

We combine a community focus with a drive for high-quality security research.


Read more about our work:

  • Our Research β€” Foundational research for safe and beneficial advanced AI
  • Apart Lab β€” Our research fellowship program for aspiring researchers in AI safety
  • Apart Sprints β€” Weekend-long research sprints and hackathons for AI security and governance

Twitter Badge LinkedIn Badge YouTube Badge Discord Badge Alignment Jam RSS Badge

Pinned

  1. interpretability-starter interpretability-starter Public

    🧠 Starter templates for doing interpretability research

    51 1

  2. Neuron2Graph Neuron2Graph Public

    Tools for exploring Transformer neuron behaviour, including input pruning and diversification.

    Jupyter Notebook 12 5

  3. deepdecipher deepdecipher Public

    🦠 DeepDecipher: An open source API to MLP neurons

    Rust 8

  4. specificityplus specificityplus Public

    πŸ‘©β€πŸ’» Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"

    Python 17 3

  5. Integer_Addition Integer_Addition Public

    ✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks

    Jupyter Notebook 10

  6. readingwhatwecan readingwhatwecan Public

    πŸ“šπŸ“šπŸ“šπŸ“šπŸ“šπŸ“šπŸ“šπŸ“šπŸ“š Reading everything

    CSS 12 3

Repositories

Showing 10 of 33 repositories

Top languages

Loading…

Most used topics

Loading…