attention-mechanism

Here are 1,496 public repositories matching this topic...

kyegomez / swarms

Orchestrate Swarms of Agents From Any Framework Like OpenAI, Langchain, and Etc for Business Operation Automation. Join our Community: https://discord.gg/DbjBMJTSWD

Updated Jun 8, 2024
Python

lucidrains / x-transformers

Star

A simple but complete full-attention transformer with a set of promising experimental features from various papers

deep-learning transformers artificial-intelligence attention-mechanism

Updated Jun 8, 2024
Python

Esmail-ibraheem / Xllama

Star

Xllama🦙 is an Extensible advanced language model framework, inspired by the original Llama model.

pytorch llama attention-mechanism paper-implementations llms llama2

Updated Jun 8, 2024
Python

QuillGPT is an implementation of the GPT decoder block based on the architecture from Attention is All You Need paper by Vaswani et. al. in PyTorch. Additionally, this repository contains two pre-trained models — Shakespearean GPT and Harpoon GPT, a Streamlit Playground, Containerized FastAPI Microservice, training - inference scripts & notebooks.

nlp docker decoder pytorch transformer attention gpt attention-mechanism fastapi streamlit generative-ai

Updated Jun 7, 2024
Jupyter Notebook

awslabs / sockeye

Star

Sequence-to-sequence framework with a focus on Neural Machine Translation based on PyTorch

machine-learning deep-neural-networks translation deep-learning machine-translation pytorch transformer seq2seq neural-machine-translation sequence-to-sequence attention-mechanism encoder-decoder attention-model sequence-to-sequence-models attention-is-all-you-need sockeye transformer-architecture transformer-network

Updated Jun 7, 2024
Python

logic-OT / Decoder-Only-LLM

Star

This repository features a custom-built decoder-only language model (LLM) with a total of 37 million parameters 🔥. I train the model to be able to ask question from a given context

nlp computer-vision deep-learning inference transformer attention-mechanism decoder-model large-language-models llm small-models

Updated Jun 7, 2024
Jupyter Notebook

thebrownkidd / ISL-to-text

Star

An attention based approach to convert Indian Sign Language to Text using simulated hand gesture data

python neural-network transformer attention-mechanism hand-gesture-recognition sign-language-recognition indian-sign-language

Updated Jun 7, 2024
Jupyter Notebook

glassroom / heinsen_attention

Star

Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)

attention attention-mechanism attention-model linear-attention linear-attention-model heinsen-attention

Updated Jun 6, 2024
Python

zjysteven / VLM-Visualizer

Star

Visualizing the attention of vision-language models

attention multi-modal attention-mechanism vision-language vision-language-model llava

Updated Jun 6, 2024
Jupyter Notebook

mverbytska / Custom-LSTM-with-Attention-for-FTS

Star

Experimental project on building custom LSTM and LSTM with Attention layer for comparison analysis on FTS forecasting (June 2024)

keras fintech lstm-model attention-mechanism timeseries-forecasting

Updated Jun 5, 2024

markhliu / DGAI

Star

Learn Generative AI with PyTorch (Manning Publications, 2024)

natural-language-processing transformers generative-adversarial-network attention-mechanism natural-language-understanding variational-autoencoder diffusion-models generative-ai

Updated Jun 5, 2024
Jupyter Notebook

MAGICS-LAB / OutEffHop

Star

[ICML 2024] Outlier-Efficient Hopfield Layers for Large Transformer-Based Models

transformer outliers attention attention-mechanism outlier-removal outlier hopfield-neural-network ptq outlier-treatment modern-hopfield-networks modern-hopfield-model icml-2024 softmax-1 quantized-friendly no-op-outlier

Updated Jun 5, 2024
Python

IDT-ITI / T-TAME

Star

Scripts and trained models from our paper: M. Ntrougkas, N. Gkalelis, V. Mezaris, "T-TAME: Trainable Attention Mechanism for Explaining Convolutional Networks and Vision Transformers", IEEE Access, 2024. DOI:10.1109/ACCESS.2024.3405788.

deep-learning cnn attention-mechanism explainable-ai xai model-interpretability vision-transformer

Updated Jun 5, 2024
Jupyter Notebook

comp-imaging-sci / attention-based-bilstm-sleep-scoring

Star

Codes related to the paper "Attention-Based CNN-BiLSTM for Sleep States Classification of Spatiotemporal Wide-Field Calcium Imaging Data"

deep-learning classification neuroimaging calcium-imaging attention-mechanism lstm-neural-networks mouse-brain video-classification bilstm sleep-scoring spatialtemporal sleep-stage-classification wide-field-optical-imaging

Updated Jun 5, 2024
Jupyter Notebook

philipturner / metal-flash-attention

Star

Faster alternative to Metal Performance Shaders

machine-learning metal artificial-intelligence gpgpu high-performance-computing attention-mechanism transformer-models quantum-chemistry-simulation stable-diffusion

Updated Jun 4, 2024
Swift

AlirezaRahimpour / Transformers-LLM

Star

My reimplementations of some of the transformer-based models. LLM and LVMs.

transformer attention-mechanism vision-transformer llms

Updated Jun 4, 2024
Python

MauroCE / DanteGPT

Star

DanteGPT

nlp machine-learning deep-learning gpt language-model attention-mechanism dante-divine-comedy llm

Updated Jun 4, 2024
Python

pallucs / PTMGPT2

Star

GPT-based protein language model for PTM site prediction

protein unsupervised-learning attention-mechanism gpt-2 prompt-engineering

Updated Jun 4, 2024
Jupyter Notebook

florencejt / fusilli

Star

A Python package housing a collection of deep-learning multi-modal data fusion method pipelines! From data loading, to training, to evaluation - fusilli's got you covered 🌸

machine-learning cnn pytorch attention-mechanism imaging multimodality multivariate-analysis variational-autoencoder data-fusion multimodal multimodal-deep-learning multi-view-learning multi-view graph-neural-network pytorch-lightning

Updated Jun 3, 2024
Python

kyegomez / MultiModalMamba

Sponsor

Star

A novel implementation of fusing ViT with Mamba into a fast, agile, and high performance Multi-Modal Model. Powered by Zeta, the simplest AI framework ever.

machine-learning ai ml transformers torch pytorch artificial-intelligence zeta attention-mechanism ssm mamba transformer-architecture

Updated Jun 3, 2024
Python

Improve this page

Add a description, image, and links to the attention-mechanism topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the attention-mechanism topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attention-mechanism

Here are 1,496 public repositories matching this topic...

kyegomez / swarms

lucidrains / x-transformers

Esmail-ibraheem / Xllama

NotShrirang / QuillGPT

awslabs / sockeye

logic-OT / Decoder-Only-LLM

thebrownkidd / ISL-to-text

glassroom / heinsen_attention

zjysteven / VLM-Visualizer

mverbytska / Custom-LSTM-with-Attention-for-FTS

markhliu / DGAI

MAGICS-LAB / OutEffHop

IDT-ITI / T-TAME

comp-imaging-sci / attention-based-bilstm-sleep-scoring

philipturner / metal-flash-attention

AlirezaRahimpour / Transformers-LLM

MauroCE / DanteGPT

pallucs / PTMGPT2

florencejt / fusilli

kyegomez / MultiModalMamba

Improve this page

Add this topic to your repo