Efficient Triton Kernels for LLM Training
-
Updated
Jul 5, 2025 - Python
Efficient Triton Kernels for LLM Training
Explore LLM model deployment based on AXera's AI chips
Gemma2(9B), Llama3-8B-Finetune-and-RAG, code base for sample, implemented in Kaggle platform
RAG-based Telegram assistant bot for freshmen
Craft fortunes using Ollama
Boost RAG performance with question decomposer
Gemma2 2B model that fine tuned with an e-commerce data.
A complete guide to NLP and ML for text processing, covering rule-based models, RNNs, CNNs, Transformers, entity detection, sentiment analysis, LLM fine-tuning, RAG, and prompt engineering with tools like Langchain and Ollama.
This project focuses on efficient machine translation for nine Indic languages using the fine-tuned Gemma2-2B LLM and adapter switching, reducing computational overhead. It also leverages agentic methods and the Groq API for quality assurance and accurate translation analysis between source and target segments.
Программа для поиска Telegram-групп и каналов с использованием GPT 🔍. Позволяет искать сообщества по ключевым словам 🔑. A program for searching Telegram groups and channels using GPT 🔍. Allows you to search for communities by keyword 🔑.
AI Discord Bot (GEMM-X) is an intelligent assistant for Discord, leveraging AI technologies from multiple providers to generate images, create music, produce speech, and more. It supports custom personality settings and advanced user/server configurations.
Add a description, image, and links to the gemma2 topic page so that developers can more easily learn about it.
To associate your repository with the gemma2 topic, visit your repo's landing page and select "manage topics."