
-
Sapienza University of Rome
- Rome, Italy
- https://www.santilli.xyz/
- @teelinsan
- in/andreasantilli
Starred repositories
Generic template to bootstrap your Python project.
🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton
Efficient Triton Kernels for LLM Training
Collection of all the papers talking about/relevant to the topic of privacy-preserving LLMs
Whispering Experts: Neural Interventions for Toxicity Mitigation in Language Models, ICML 2024
This repository collects all relevant resources about interpretability in LLMs
Video+code lecture on building nanoGPT from scratch
Sparsify transformers with SAEs and transcoders
Tools for understanding how transformer predictions are built layer-by-layer
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
GPU programming related news and material links
Materials for "IT5: Large-scale Text-to-text Pretraining for Italian Language Understanding and Generation" 🇮🇹
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Code associated with the paper **Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding**
This repository hosts materials from the CLiC-IT 2023 tutorial
A Python package for analyzing and transforming neural latent spaces.
Robust recipes to align language models with human and AI preferences
A simple tool to update bib entries with their official information (e.g., DBLP or the ACL anthology).
arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates