gpu-llm

Here are 7 public repositories matching this topic...

hkevin01 / kimi-linear

An optimized implementation of the Kimi Linear architecture - a hybrid linear attention mechanism outperforming traditional full attention.

gpu-computing status-active lang-python ai-ml-portfolio status-stable scope-small framework-numpy framework-matplotlib framework-pandas framework-huggingface framework-pytorch gpu-deep-learning gpu-llm

Updated May 19, 2026
Python

hkevin01 / Llama-GPU

Star

A project to build GPU acceleration for LLaMA models on local computers and AWS, leveraging GPU resources for efficient inference and training.

gpu-computing status-active lang-python generative-ai gpu-inference ai-ml-portfolio scope-medium framework-numpy framework-matplotlib framework-pandas framework-huggingface framework-pytorch framework-scikit-learn framework-cuda ai-ml-portfolio-llm gpu-scientific gpu-deep-learning gpu-llm

Updated May 19, 2026
Python

AdaAttn is a GPU-native attention mechanism that dynamically adapts both numerical precision and matrix rank at runtime, reducing memory bandwidth and computational overhead in large language models without sacrificing model quality. By aligning linear algebra operations with modern GPU hardware characteristics.

Updated May 19, 2026
Python

hkevin01 / secure-llm-assistant

Star

Air-gapped, on-prem LLM assistant for software engineering teams. No external network calls. Full audit trail. RBAC + OIDC/LDAP auth.

status-active lang-python generative-ai ai-ml-portfolio scope-small ai-ml-portfolio-llm gdl-anomaly-detect gpu-deep-learning gpu-llm

Updated May 19, 2026
Python

hkevin01 / nvidia-nemo

Star

A comprehensive implementation of Nvidia NeMo Guardrails for AI safety and responsible AI development.

gpu-computing status-active lang-python ai-ml-portfolio status-stable scope-micro framework-numpy framework-pandas framework-huggingface framework-pytorch gpu-deep-learning gpu-llm

Updated Nov 20, 2025
Python

hkevin01 / pddl-instruct-lcot

Star

This project implements PDDL-INSTRUCT with Logical Chain-of-Thought (LCoT), a novel approach to improve Large Language Model (LLM) performance on automated planning tasks. The system enhances planning capabilities through:

gpu-computing status-active lang-python nlp-tools generative-ai ai-ml-portfolio status-stable scope-micro framework-numpy framework-matplotlib framework-pandas framework-huggingface framework-pytorch framework-scikit-learn ai-ml-portfolio-llm gpu-deep-learning gpu-llm gpu-nlp

Updated May 19, 2026
Python

hkevin01 / csneps-robotics-inference

Star

CSNePS Knowledge Graph Service is a production-ready enterprise system that bridges symbolic AI reasoning with modern ontology engineering. The system combines CSNePS (Cognitive Systems for Natural language Processing and Structured information) - a powerful semantic network reasoning engine - with comprehensive OWL ontology support, advanced graph

status-active lang-java status-finished nlp-tools generative-ai gpu-inference ai-ml-portfolio scope-small ai-ml-portfolio-llm gpu-deep-learning gpu-llm gpu-nlp

Updated May 19, 2026
Java

Improve this page

Add a description, image, and links to the gpu-llm topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the gpu-llm topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpu-llm

Here are 7 public repositories matching this topic...

hkevin01 / kimi-linear

hkevin01 / Llama-GPU

hkevin01 / AdaAttn

hkevin01 / secure-llm-assistant

hkevin01 / nvidia-nemo

hkevin01 / pddl-instruct-lcot

hkevin01 / csneps-robotics-inference

Improve this page

Add this topic to your repo