Skip to content
#

vision-transformer

Here are 792 public repositories matching this topic...

Seq2SeqSharp is a tensor based fast & flexible deep neural network framework written by .NET (C#). It has many highlighted features, such as automatic differentiation, different network types (Transformer, LSTM, BiLSTM and so on), multi-GPUs supported, cross-platforms (Windows, Linux, x86, x64, ARM), multimodal model for text and images and so on.

  • Updated Jul 20, 2024
  • C#

The AI Enabled Sign Language System is a Streamlit app that detects, classifies, and translates Indian Sign Language (ISL) using custom-trained YOLOv8 and Vision Transformer (ViT) models. It supports real-time image capture, multi-language text translation, and text-to-speech conversion, enhancing accessibility and communication for ISL users.

  • Updated Jul 20, 2024
  • Jupyter Notebook
computer-vision-challenge

Improve this page

Add a description, image, and links to the vision-transformer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-transformer topic, visit your repo's landing page and select "manage topics."

Learn more