Find descriptions and details about various AI models, including their capabilities, use cases, and specifications.
π Description:
Distilled LLaMA by DeepSeek, fast and optimized for real-world tasks.
π Model File: ai/deepseek-r1-distill-llama.md
URLs:
- https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B
- https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B
π Description:
Google's latest Gemma, small yet strong for chat and generation
π Model File: ai/gemma3.md
URLs:
π Description:
Meta's LLaMA 3.1: Chat-focused, benchmark-strong, multilingual-ready.
π Model File: ai/llama3.1.md
URLs:
π Description:
Solid LLaMA 3 update, reliable for coding, chat, and Q&A tasks.
π Model File: ai/llama3.2.md
URL:
- https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct
- https://huggingface.co/meta-llama/Llama-3.2-1B-Instruct
π Description:
Newest LLaMA 3 release with improved reasoning and generation quality.
π Model File: ai/llama3.3.md
π Description:
A fast and powerful 7B parameter model excelling in reasoning, code, and math.
π Model File: ai/mistral.md
URLs:
π Description:
Mistral-Nemo-Instruct-2407 is an instruct fine-tuned large language model developed by Mistral AI and NVIDIA.
π Model File: ai/mistral-nemo.md
URLs:
π Description:
A state-of-the-art English language embedding model developed by Mixedbread AI.
π Model File: ai/mxbai-embed-large.md
URLs:
π Description:
Microsoft's compact model, surprisingly capable at reasoning and code.
π Model File: ai/phi4.md
URLs:
π Description:
Versatile Qwen update with better language skills and wider support.
π Model File: ai/qwen2.5.md
URLs:
π Description:
Experimental Qwen variantβlean, fast, and a bit mysterious.
π Model File: ai/qwq.md
URLs:
π Description:
A compact language model, designed to run efficiently on-device while performing a wide range of language tasks
π Model File: ai/smollm2.md
URLs:
- https://huggingface.co/HuggingFaceTB/SmolLM2-360M-Instruct
- https://huggingface.co/HuggingFaceTB/SmolLM2-135M-Instruct
The model-cards-cli tool provides commands to inspect and update model information:
# Basic inspection
make inspect REPOSITORY=ai/smollm2
# Inspect specific tag
make inspect REPOSITORY=ai/smollm2 TAG=360M-Q4_K_M
# Show all metadata
make inspect REPOSITORY=ai/smollm2 OPTIONS="--all"
# Update all models
make run
# Update specific model
make run-single MODEL=smollm2.md
# Upload a single overview to Docker Hub
make -C tools/model-cards-cli upload-overview FILE=ai/llama3.1.md REPO=ai/llama3 USERNAME=your_username TOKEN=your_pat_here
# Upload all overviews in the ai/ folder to their corresponding repositories
./tools/upload-all-overviews.sh your_username your_pat_here
REPOSITORY
: (Required) The repository to inspect (e.g.,ai/smollm2
)TAG
: (Optional) Specific tag to inspect (e.g.,360M-Q4_K_M
)OPTIONS
: (Optional) Additional options:--all
: Show all metadata fields--log-level
: Set log level (debug, info, warn, error)
MODEL
: (Required for run-single) Specific model file to update (e.g.,ai/smollm2.md
)--log-level
: Set log level (debug, info, warn, error)
FILE
: (Required) Path to the markdown file containing the overview contentREPO
: (Required) Repository to upload the overview to (format: namespace/repository)USERNAME
: (Required) Docker Hub usernameTOKEN
: (Required) Personal Access Token (PAT)