#

attention-mechanisms

Here are 70 public repositories matching this topic...

lucidrains / alphafold3-pytorch

Implementation of Alphafold 3 in Pytorch

deep-learning transformers artificial-intelligence attention-mechanisms protein-structure-prediction denoising-diffusion

Updated Jul 22, 2024
Python

lucidrains / magvit2-pytorch

Implementation of MagViT2 Tokenizer in Pytorch

deep-learning transformers artificial-intelligence attention-mechanisms video-generation finite-scalar-quantization

Updated Jul 22, 2024
Python

lucidrains / toolformer-pytorch

Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

deep-learning transformers artificial-intelligence attention-mechanisms api-calling

Updated Jul 22, 2024
Python

LongRoPE

jshuadvd / LongRoPE

Implementation of the LongRoPE: Extending LLM Context Window Beyond 2 Million Tokens Paper

nlp machine-learning natural-language-processing ai deep-learning transformers artificial-intelligence gpt language-model natural-language-inference natural tokenization natural-language-understanding attention-is-all-you-need attention-mechanisms transformer-architecture natural-language-procressing tokenizers llm

Updated Jul 20, 2024
Python

kyegomez / MambaFormer

Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks"

ai ml transformer attention mamba ssms attention-is-all-you-need attention-mechanisms

Updated Jul 13, 2024
Python

lucidrains / agent-attention-pytorch

Implementation of Agent Attention in Pytorch

deep-learning artificial-intelligence attention-mechanisms linear-attention

Updated Jul 10, 2024
Python

lucidrains / meshgpt-pytorch

Implementation of MeshGPT, SOTA Mesh generation using Attention, in Pytorch

deep-learning transformers artificial-intelligence mesh-generation attention-mechanisms

Updated Jul 9, 2024
Python

lucidrains / local-attention

An implementation of local windowed attention for language modeling

deep-learning artificial-intelligence attention-mechanisms

Updated Jul 8, 2024
Python

lucidrains / diffusion-policy

Implementation of Diffusion Policy, Toyota Research's supposed breakthrough in leveraging DDPMs for learning policies for real-world Robotics

deep-learning robotics transformers artificial-intelligence attention-mechanisms denoising-diffusion

Updated Jul 6, 2024
Python

lucidrains / mmdit

Implementation of a single layer of the MMDiT, proposed in Stable Diffusion 3, in Pytorch

deep-learning artificial-intelligence attention-mechanisms multi-modal-attention

Updated Jul 6, 2024
Python

kyegomez / ShallowFF

Zeta implemantion of "Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers"

artificial-intelligence transformer attention attention-mechanism feedforward attention-is-all-you-need attention-mechanisms transformer-encoder transformer-models transformers-models

Updated Jul 22, 2024
Python

kyegomez / SparseAttention

Pytorch Implementation of the sparse attention from the paper: "Generating Long Sequences with Sparse Transformers"

machine-learning artificial-intelligence sparse-matrix attention-mechanism attention-is-all-you-need attention-mechanisms sparse-attn

Updated Jun 17, 2024
Python

kyegomez / MambaTransformer

Integrating Mamba/SSMs with Transformer for Enhanced Long Context and High-Quality Sequence Modeling

language machine-learning ai neural-network tensorflow pytorch recurrent-neural-networks artificial-intelligence neural-networks zeta ssm rnns attention-is-all-you-need attention-mechanisms multimodal gpt4

Updated Jun 17, 2024
Python

kyegomez / Jamba

PyTorch Implementation of Jamba: "Jamba: A Hybrid Transformer-Mamba Language Model"

ai ml transformers artificial-neural-networks gpt attention-mechanism ssm attention-is-all-you-need attention-mechanisms

Updated Jun 17, 2024
Python

kyegomez / KosmosG

My implementation of the model KosmosG from "KOSMOS-G: Generating Images in Context with Multimodal Large Language Models"

computer-vision attention-mechanism attention-is-all-you-need multimodal-learning attention-mechanisms multimodal

Updated Jun 17, 2024
Python

lucidrains / CoLT5-attention

Implementation of the conditionally routed attention in the CoLT5 architecture, in Pytorch

deep-learning routing artificial-intelligence attention-mechanisms efficient-attention

Updated Jun 16, 2024
Python

lucidrains / BS-RoFormer

Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs

deep-learning transformers artificial-intelligence music-source-separation attention-mechanisms

Updated Jun 3, 2024
Python

kyegomez / PaLM2-VAdapter

Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter"

ai models ml transformers attention deeplearning multi-modal neural-nets attention-is-all-you-need attention-mechanisms

Updated Jun 17, 2024
Python

lucidrains / iTransformer

Unofficial implementation of iTransformer - SOTA Time Series Forecasting using Attention networks, out of Tsinghua / Ant group

deep-learning transformers artificial-intelligence attention-mechanisms time-series-forecasting

Updated May 10, 2024
Python

lucidrains / q-transformer

Implementation of Q-Transformer, Scalable Offline Reinforcement Learning via Autoregressive Q-Functions, out of Google Deepmind

deep-learning robotics transformers q-learning artificial-intelligence attention-mechanisms offline-learning

Updated May 7, 2024
Python

Improve this page

Add a description, image, and links to the attention-mechanisms topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the attention-mechanisms topic, visit your repo's landing page and select "manage topics."