Performed by: Angelo Luis C. Cu

<div align="center">

  <a href="https://ultralytics.com/yolo" target="_blank">
    <img width="1024", src="https://raw.githubusercontent.com/ultralytics/assets/main/yolov8/banner-yolov8.png"></a>

  [中文](https://docs.ultralytics.com/zh/) | [한국어](https://docs.ultralytics.com/ko/) | [日本語](https://docs.ultralytics.com/ja/) | [Русский](https://docs.ultralytics.com/ru/) | [Deutsch](https://docs.ultralytics.com/de/) | [Français](https://docs.ultralytics.com/fr/) | [Español](https://docs.ultralytics.com/es/) | [Português](https://docs.ultralytics.com/pt/) | [Türkçe](https://docs.ultralytics.com/tr/) | [Tiếng Việt](https://docs.ultralytics.com/vi/) | [العربية](https://docs.ultralytics.com/ar/)

  <a href="https://github.com/ultralytics/ultralytics/actions/workflows/ci.yml"><img src="https://github.com/ultralytics/ultralytics/actions/workflows/ci.yml/badge.svg" alt="Ultralytics CI"></a>
  <a href="https://console.paperspace.com/github/ultralytics/ultralytics"><img src="https://assets.paperspace.io/img/gradient-badge.svg" alt="Run on Gradient"/></a>
  <a href="https://colab.research.google.com/github/ultralytics/ultralytics/blob/main/examples/tutorial.ipynb"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"></a>
  <a href="https://www.kaggle.com/models/ultralytics/yolo11"><img src="https://kaggle.com/static/images/open-in-kaggle.svg" alt="Open In Kaggle"></a>

  <a href="https://ultralytics.com/discord"><img alt="Discord" src="https://img.shields.io/discord/1089800235347353640?logo=discord&logoColor=white&label=Discord&color=blue"></a>
  <a href="https://community.ultralytics.com"><img alt="Ultralytics Forums" src="https://img.shields.io/discourse/users?server=https%3A%2F%2Fcommunity.ultralytics.com&logo=discourse&label=Forums&color=blue"></a>
  <a href="https://reddit.com/r/ultralytics"><img alt="Ultralytics Reddit" src="https://img.shields.io/reddit/subreddit-subscribers/ultralytics?style=flat&logo=reddit&logoColor=white&label=Reddit&color=blue"></a>

Welcome to the Ultralytics YOLO11 🚀 notebook! <a href="https://github.com/ultralytics/ultralytics">YOLO11</a> is the latest version of the YOLO (You Only Look Once) AI models developed by <a href="https://ultralytics.com">Ultralytics</a>. This notebook serves as the starting point for exploring the various resources available to help you get started with YOLO11 and understand its features and capabilities.

YOLO11 models are fast, accurate, and easy to use, making them ideal for various object detection and image segmentation tasks. They can be trained on large datasets and run on diverse hardware platforms, from CPUs to GPUs.

We hope that the resources in this notebook will help you get the most out of YOLO11. Please browse the YOLO11 <a href="https://docs.ultralytics.com/">Docs</a> for details, raise an issue on <a href="https://github.com/ultralytics/ultralytics">GitHub</a> for support, and join our <a href="https://ultralytics.com/discord">Discord</a> community for questions and discussions!

  <a href="https://www.youtube.com/watch?v=ZN3nRZT7b24" target="_blank">
    <img src="https://img.youtube.com/vi/ZN3nRZT7b24/maxresdefault.jpg" alt="Ultralytics Video" width="720" style="border-radius: 10px; box-shadow: 0 4px 8px rgba(0, 0, 0, 0.2);"></a>
  
  <p style="font-size: 16px; font-family: Arial, sans-serif; color: #555;">
    <strong>Watch: </strong> How to Train
  <a href="https://github.com/ultralytics/ultralytics">Ultralytics</a>
  <a href="https://docs.ultralytics.com/models/yolo11/">YOLO11</a> Model on Custom Dataset using Google Colab Notebook 🚀</p>
</div>

# Setup

pip install `ultralytics` and [dependencies](https://github.com/ultralytics/ultralytics/blob/main/pyproject.toml) and check software and hardware.

[![PyPI - Version](https://img.shields.io/pypi/v/ultralytics?logo=pypi&logoColor=white)](https://pypi.org/project/ultralytics/) [![Downloads](https://static.pepy.tech/badge/ultralytics)](https://www.pepy.tech/projects/ultralytics) [![PyPI - Python Version](https://img.shields.io/pypi/pyversions/ultralytics?logo=python&logoColor=gold)](https://pypi.org/project/ultralytics/)

In [1]:
%pip install ultralytics
import ultralytics
ultralytics.checks()

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
Setup complete  (12 CPUs, 27.9 GB RAM, 894.7/3726.0 GB disk)


# 1. Predict

YOLO11 may be used directly in the Command Line Interface (CLI) with a `yolo` command for a variety of tasks and modes and accepts additional arguments, i.e. `imgsz=640`. See a full list of available `yolo` [arguments](https://docs.ultralytics.com/usage/cfg/) and other details in the [YOLO11 Predict Docs](https://docs.ultralytics.com/modes/train/).


In [2]:
# Run inference on an image with YOLO11n
!yolo predict model=yolo11n.pt source='https://ultralytics.com/images/zidane.jpg'

Ultralytics 8.3.107 🚀 Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n summary (fused): 100 layers, 2,616,248 parameters, 0 gradients, 6.5 GFLOPs

Found https://ultralytics.com/images/zidane.jpg locally at zidane.jpg
image 1/1 e:\Programming\VSCodeProjects\zidane.jpg: 384x640 2 persons, 1 tie, 58.1ms
Speed: 4.2ms preprocess, 58.1ms inference, 119.0ms postprocess per image at shape (1, 3, 384, 640)
Results saved to [1mruns\detect\predict2[0m
💡 Learn more at https://docs.ultralytics.com/modes/predict


&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;
<img align="left" src="https://user-images.githubusercontent.com/26833433/212889447-69e5bdf1-5800-4e29-835e-2ed2336dede2.jpg" width="600">

# 2. Val
Validate a model's accuracy on the [COCO](https://docs.ultralytics.com/datasets/detect/coco/) dataset's `val` or `test` splits. The latest YOLO11 [models](https://github.com/ultralytics/ultralytics#models) are downloaded automatically the first time they are used. See [YOLO11 Val Docs](https://docs.ultralytics.com/modes/val/) for more information.

In [7]:
!python -m bash_kernel.install

Installing IPython kernel spec


In [11]:
# Download COCO val

import torch
torch.hub.download_url_to_file('https://ultralytics.com/assets/coco2017val.zip', 'tmp.zip')  # download (780M - 5000 images)
!unzip -q tmp.zip -d datasets && rm tmp.zip  # unzip

# Since UNIX commands are not available as I have a windows machine, I performed the unzipping manually

100%|██████████| 780M/780M [00:35<00:00, 23.2MB/s] 
'unzip' is not recognized as an internal or external command,
operable program or batch file.


In [12]:
# Validate YOLO11n on COCO8 val
!yolo val model=yolo11n.pt data=coco8.yaml

Ultralytics 8.3.107 🚀 Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n summary (fused): 100 layers, 2,616,248 parameters, 0 gradients, 6.5 GFLOPs

Dataset 'coco8.yaml' images not found ⚠️, missing path 'E:\Programming\VSCodeProjects\datasets\coco8\images\val'
Downloading https://ultralytics.com/assets/coco8.zip to 'E:\Programming\VSCodeProjects\datasets\coco8.zip'...
Dataset download success ✅ (2.2s), saved to [1mE:\Programming\VSCodeProjects\datasets[0m

Downloading https://ultralytics.com/assets/Arial.ttf to 'C:\Users\cu\AppData\Roaming\Ultralytics\Arial.ttf'...
[34m[1mval: [0mNew cache created: E:\Programming\VSCodeProjects\datasets\coco8\labels\val.cache
                   all          4         17       0.57       0.85      0.847      0.632
                person          3         10      0.557        0.6      0.585      0.272
                   dog          1          1      0.548          1      0.995      0.697
                 horse      


  0%|          | 0.00/433k [00:00<?, ?B/s]
 89%|████████▊ | 384k/433k [00:00<00:00, 1.69MB/s]
100%|██████████| 433k/433k [00:00<00:00, 1.82MB/s]

Unzipping E:\Programming\VSCodeProjects\datasets\coco8.zip to E:\Programming\VSCodeProjects\datasets\coco8...:   0%|          | 0/25 [00:00<?, ?file/s]
Unzipping E:\Programming\VSCodeProjects\datasets\coco8.zip to E:\Programming\VSCodeProjects\datasets\coco8...: 100%|██████████| 25/25 [00:00<00:00, 2610.48file/s]

  0%|          | 0.00/755k [00:00<?, ?B/s]
100%|██████████| 755k/755k [00:00<00:00, 20.6MB/s]

[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\val...:   0%|          | 0/4 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\val... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<00:00, 159.35it/s]

                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95):   0%|          | 0/1 [00:00<?, ?it/s]
                 Cl

# 3. Train

<p align=""><a href="https://ultralytics.com/hub"><img width="1000" src="https://github.com/ultralytics/assets/raw/main/yolov8/banner-integrations.png"/></a></p>

Train YOLO11 on [Detect](https://docs.ultralytics.com/tasks/detect/), [Segment](https://docs.ultralytics.com/tasks/segment/), [Classify](https://docs.ultralytics.com/tasks/classify/) and [Pose](https://docs.ultralytics.com/tasks/pose/) datasets. See [YOLO11 Train Docs](https://docs.ultralytics.com/modes/train/) for more information.

In [3]:
#@title Select YOLO11 🚀 logger {run: 'auto'}
logger = 'Comet' #@param ['Comet', 'TensorBoard']

if logger == 'Comet':
  %pip install -q comet_ml
  import comet_ml; comet_ml.login()
elif logger == 'TensorBoard':
  %load_ext tensorboard
  %tensorboard --logdir .

Note: you may need to restart the kernel to use updated packages.
Please paste your Comet API key from https://www.comet.com/api/my/settings/
(api key may not show as you type)


[1;38;5;39mCOMET INFO:[0m Valid Comet API Key saved in C:\Users\cu\.comet.config (set COMET_CONFIG to change where it is saved).


In [1]:
# Train YOLO11n on COCO8 for 3 epochs
!yolo train model=yolo11n.pt data=coco8.yaml epochs=3 imgsz=640

Ultralytics 8.3.107 🚀 Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=detect, mode=train, model=yolo11n.pt, data=coco8.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=True, show_boxes=True, li

COMET INFO: Experiment is live on comet.com https://www.comet.com/qalccu/general/97e1ccb5b8c6461ab0830fc4bc40d878

COMET INFO: Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.

[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\train...:   0%|          | 0/4 [00:00<?, ?it/s]
[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\train... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<00:00, 263.93it/s]

[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\val.cache... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\val.cache... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<?, ?it/s]

  0%|          | 0/1 [00:00<?, ?it/s]
        1/3     0.674G      1.098     

# 4. Export

Export a YOLO11 model to any supported format below with the `format` argument, i.e. `format=onnx`. See [YOLO11 Export Docs](https://docs.ultralytics.com/modes/export/) for more information.

- 💡 ProTip: Export to [ONNX](https://docs.ultralytics.com/integrations/onnx/) or [OpenVINO](https://docs.ultralytics.com/integrations/openvino/) for up to 3x CPU speedup.
- 💡 ProTip: Export to [TensorRT](https://docs.ultralytics.com/integrations/tensorrt/) for up to 5x GPU speedup.

| Format                                                                   | `format` Argument | Model                     | Metadata | Arguments                                                                                |
|--------------------------------------------------------------------------|-------------------|---------------------------|----------|------------------------------------------------------------------------------------------|
| [PyTorch](https://pytorch.org/)                                          | -                 | `yolo11n.pt`              | ✅        | -                                                                                        |
| [TorchScript](https://docs.ultralytics.com/integrations/torchscript)     | `torchscript`     | `yolo11n.torchscript`     | ✅        | `imgsz`, `optimize`, `batch`, `device`                                                   |
| [ONNX](https://docs.ultralytics.com/integrations/onnx)                   | `onnx`            | `yolo11n.onnx`            | ✅        | `imgsz`, `half`, `dynamic`, `simplify`, `opset`, `batch`, `device`                       |
| [OpenVINO](https://docs.ultralytics.com/integrations/openvino)           | `openvino`        | `yolo11n_openvino_model/` | ✅        | `imgsz`, `half`, `dynamic`, `int8`, `batch`, `data`, `fraction`, `device`                |
| [TensorRT](https://docs.ultralytics.com/integrations/tensorrt)           | `engine`          | `yolo11n.engine`          | ✅        | `imgsz`, `half`, `dynamic`, `simplify`, `workspace`, `int8`, `batch`, `data`, `fraction`, `device` |
| [CoreML](https://docs.ultralytics.com/integrations/coreml)               | `coreml`          | `yolo11n.mlpackage`       | ✅        | `imgsz`, `half`, `int8`, `nms`, `batch`, `device`                                        |
| [TF SavedModel](https://docs.ultralytics.com/integrations/tf-savedmodel) | `saved_model`     | `yolo11n_saved_model/`    | ✅        | `imgsz`, `keras`, `int8`, `batch`, `device`                                              |
| [TF GraphDef](https://docs.ultralytics.com/integrations/tf-graphdef)     | `pb`              | `yolo11n.pb`              | ❌        | `imgsz`, `batch`, `device`                                                               |
| [TF Lite](https://docs.ultralytics.com/integrations/tflite)              | `tflite`          | `yolo11n.tflite`          | ✅        | `imgsz`, `half`, `int8`, `batch`, `data`, `fraction`, `device`                           |
| [TF Edge TPU](https://docs.ultralytics.com/integrations/edge-tpu)        | `edgetpu`         | `yolo11n_edgetpu.tflite`  | ✅        | `imgsz`, `device`                                                                        |
| [TF.js](https://docs.ultralytics.com/integrations/tfjs)                  | `tfjs`            | `yolo11n_web_model/`      | ✅        | `imgsz`, `half`, `int8`, `batch`, `device`                                               |
| [PaddlePaddle](https://docs.ultralytics.com/integrations/paddlepaddle)   | `paddle`          | `yolo11n_paddle_model/`   | ✅        | `imgsz`, `batch`, `device`                                                               |
| [MNN](https://docs.ultralytics.com/integrations/mnn)                     | `mnn`             | `yolo11n.mnn`             | ✅        | `imgsz`, `batch`, `int8`, `half`, `device`                                               |
| [NCNN](https://docs.ultralytics.com/integrations/ncnn)                   | `ncnn`            | `yolo11n_ncnn_model/`     | ✅        | `imgsz`, `half`, `batch`, `device`                                                       |
| [IMX500](https://docs.ultralytics.com/integrations/sony-imx500)          | `imx`             | `yolov8n_imx_model/`      | ✅        | `imgsz`, `int8`, `data`, `fraction`, `device`                                            |
| [RKNN](https://docs.ultralytics.com/integrations/rockchip-rknn)          | `rknn`            | `yolo11n_rknn_model/`     | ✅        | `imgsz`, `batch`, `name`, `device`                                                       |

In [2]:
!yolo export model=yolo11n.pt format=torchscript

Ultralytics 8.3.107 🚀 Python-3.12.9 torch-2.6.0+cu118 CPU (AMD Ryzen 5 5600G with Radeon Graphics)
YOLO11n summary (fused): 100 layers, 2,616,248 parameters, 0 gradients, 6.5 GFLOPs

[34m[1mPyTorch:[0m starting from 'yolo11n.pt' with input shape (1, 3, 640, 640) BCHW and output shape(s) (1, 84, 8400) (5.4 MB)

[34m[1mTorchScript:[0m starting export with torch 2.6.0+cu118...
[34m[1mTorchScript:[0m export success ✅ 1.9s, saved as 'yolo11n.torchscript' (10.5 MB)

Export complete (3.9s)
Results saved to [1mE:\Programming\VSCodeProjects[0m
Predict:         yolo predict task=detect model=yolo11n.torchscript imgsz=640  
Validate:        yolo val task=detect model=yolo11n.torchscript imgsz=640 data=/usr/src/ultralytics/ultralytics/cfg/datasets/coco.yaml  
Visualize:       https://netron.app
💡 Learn more at https://docs.ultralytics.com/modes/export


# 5. Python Usage

YOLO11 was reimagined using Python-first principles for the most seamless Python YOLO experience yet. YOLO11 models can be loaded from a trained checkpoint or created from scratch. Then methods are used to train, val, predict, and export the model. See detailed Python usage examples in the [YOLO11 Python Docs](https://docs.ultralytics.com/usage/python/).

In [1]:
from ultralytics import YOLO

# Load a model
model = YOLO('yolo11n.yaml')  # build a new model from scratch
model = YOLO('yolo11n.pt')  # load a pretrained model (recommended for training)

# Use the model
results = model.train(data='coco8.yaml', epochs=3)  # train the model
results = model.val()  # evaluate model performance on the validation set
results = model('https://ultralytics.com/images/bus.jpg')  # predict on an image
results = model.export(format='onnx')  # export the model to ONNX format

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=detect, mode=train, model=yolo11n.pt, data=coco8.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train2, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=True, show_boxes=True, li

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/fe2c375520f842269bbfaa91654ed205

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\detect\train2', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\train.cache... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\val.cache... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<?, ?it/s]


Plotting labels to runs\detect\train2\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.000119, momentum=0.9) with parameter groups 81 weight(decay=0.0), 88 weight(decay=0.0005), 87 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\detect\train2[0m
Starting training for 3 epochs...

      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        1/3     0.674G      1.098       2.75      1.479         21        640: 100%|██████████| 1/1 [00:00<00:00,  1.00it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.90it/s]

                   all          4         17      0.566       0.85      0.878      0.634






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        2/3     0.707G      1.167      2.779      1.475         36        640: 100%|██████████| 1/1 [00:00<00:00,  6.74it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 17.49it/s]

                   all          4         17      0.556       0.85      0.886      0.635






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        3/3     0.707G      1.091      2.495      1.214         20        640: 100%|██████████| 1/1 [00:00<00:00,  7.22it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 17.20it/s]

                   all          4         17      0.558       0.85      0.851      0.615






3 epochs completed in 0.002 hours.
Optimizer stripped from runs\detect\train2\weights\last.pt, 5.5MB
Optimizer stripped from runs\detect\train2\weights\best.pt, 5.5MB

Validating runs\detect\train2\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n summary (fused): 100 layers, 2,616,248 parameters, 0 gradients, 6.5 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 22.66it/s]


                   all          4         17      0.557       0.85      0.887      0.635
                person          3         10      0.547        0.6      0.599      0.269
                   dog          1          1       0.54          1      0.995      0.697
                 horse          1          2      0.496          1      0.995      0.674
              elephant          1          2      0.358        0.5      0.745      0.281
              umbrella          1          1      0.562          1      0.995      0.995
          potted plant          1          1      0.839          1      0.995      0.895
Speed: 0.4ms preprocess, 4.2ms inference, 0.0ms loss, 1.3ms postprocess per image
Results saved to [1mruns\detect\train2[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : tough_hippopotamus_6382
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/fe2c375520f842269bbfaa91654ed205
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]               : (0.0, 8.092e-07)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]               : (0.0, 8.092e-07)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]               : (0.0, 8.092e-07)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B) [8]     : (0.8508, 0.8873046020849226)
[1;38;5;39mCOMET INFO:[0m     metric

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n summary (fused): 100 layers, 2,616,248 parameters, 0 gradients, 6.5 GFLOPs


[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\val.cache... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<?, ?it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:05<00:00,  5.37s/it]


                   all          4         17      0.556       0.85      0.887       0.62
                person          3         10      0.546        0.6      0.599      0.279
                   dog          1          1      0.539          1      0.995      0.697
                 horse          1          2      0.495          1      0.995      0.674
              elephant          1          2      0.357        0.5      0.745      0.281
              umbrella          1          1      0.561          1      0.995      0.895
          potted plant          1          1      0.836          1      0.995      0.895
Speed: 0.7ms preprocess, 26.2ms inference, 0.0ms loss, 1.4ms postprocess per image
Results saved to [1mruns\detect\train22[0m

Downloading https://ultralytics.com/images/bus.jpg to 'bus.jpg'...


100%|██████████| 134k/134k [00:00<00:00, 9.14MB/s]

image 1/1 e:\Programming\VSCodeProjects\bus.jpg: 640x480 4 persons, 1 bus, 53.6ms
Speed: 2.7ms preprocess, 53.6ms inference, 1.5ms postprocess per image at shape (1, 3, 640, 480)
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CPU (AMD Ryzen 5 5600G with Radeon Graphics)






[34m[1mPyTorch:[0m starting from 'runs\detect\train2\weights\best.pt' with input shape (1, 3, 640, 640) BCHW and output shape(s) (1, 84, 8400) (5.3 MB)
[31m[1mrequirements:[0m Ultralytics requirements ['onnx>=1.12.0', 'onnxslim', 'onnxruntime-gpu'] not found, attempting AutoUpdate...
Collecting onnx>=1.12.0
  Downloading onnx-1.17.0-cp312-cp312-win_amd64.whl.metadata (16 kB)
Collecting onnxslim
  Downloading onnxslim-0.1.50-py3-none-any.whl.metadata (4.8 kB)
Collecting onnxruntime-gpu
  Downloading onnxruntime_gpu-1.21.0-cp312-cp312-win_amd64.whl.metadata (5.2 kB)
Collecting coloredlogs (from onnxruntime-gpu)
  Downloading coloredlogs-15.0.1-py2.py3-none-any.whl.metadata (12 kB)
Collecting humanfriendly>=9.1 (from coloredlogs->onnxruntime-gpu)
  Downloading humanfriendly-10.0-py2.py3-none-any.whl.metadata (9.2 kB)
Collecting pyreadline3 (from humanfriendly>=9.1->coloredlogs->onnxruntime-gpu)
  Downloading pyreadline3-3.5.4-py3-none-any.whl.metadata (4.7 kB)
Downloading onnx-1.17

In [None]:
results = model('https://ultralytics.com/images/bus.jpg')  # predict on an image
for result in results:
    result.show()  # plot results

# The error is due to keyboard interrupt, which closes the view() output window.


Found https://ultralytics.com/images/bus.jpg locally at bus.jpg
image 1/1 e:\Programming\VSCodeProjects\bus.jpg: 640x480 4 persons, 1 bus, 11.6ms
Speed: 93.9ms preprocess, 11.6ms inference, 1.4ms postprocess per image at shape (1, 3, 640, 480)


KeyboardInterrupt: 

# 6. Tasks

YOLO11 can train, val, predict and export models for the most common tasks in vision AI: [Detect](https://docs.ultralytics.com/tasks/detect/), [Segment](https://docs.ultralytics.com/tasks/segment/), [Classify](https://docs.ultralytics.com/tasks/classify/) and [Pose](https://docs.ultralytics.com/tasks/pose/). See [YOLO11 Tasks Docs](https://docs.ultralytics.com/tasks/) for more information.

<br><img width="1024" src="https://raw.githubusercontent.com/ultralytics/assets/main/im/banner-tasks.png">


## 1. Detection

YOLO11 _detection_ models have no suffix and are the default YOLO11 models, i.e. `yolo11n.pt` and are pretrained on COCO. See [Detection Docs](https://docs.ultralytics.com/tasks/detect/) for full details.


In [7]:
# Load YOLO11n, train it on COCO128 for 3 epochs and predict an image with it
from ultralytics import YOLO

model = YOLO('yolo11n.pt')  # load a pretrained YOLO detection model
model.train(data='coco8.yaml', epochs=3)  # train the model
results = model('https://ultralytics.com/images/bus.jpg')  # predict on an image

results[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=detect, mode=train, model=yolo11n.pt, data=coco8.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train3, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=True, show_boxes=True, li

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/74366858011a4ae4a3c46bd7094ec4d3

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\detect\train3', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\train.cache... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8\labels\val.cache... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<?, ?it/s]


Plotting labels to runs\detect\train3\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.000119, momentum=0.9) with parameter groups 81 weight(decay=0.0), 88 weight(decay=0.0005), 87 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\detect\train3[0m
Starting training for 3 epochs...

      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        1/3     0.709G      1.098       2.75      1.479         21        640: 100%|██████████| 1/1 [00:00<00:00,  3.82it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 14.50it/s]

                   all          4         17      0.566       0.85      0.878      0.634






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        2/3      0.73G      1.167      2.779      1.475         36        640: 100%|██████████| 1/1 [00:00<00:00,  5.76it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 13.10it/s]

                   all          4         17      0.556       0.85      0.886      0.635






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        3/3      0.73G      1.091      2.495      1.214         20        640: 100%|██████████| 1/1 [00:00<00:00,  5.83it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 13.58it/s]

                   all          4         17      0.558       0.85      0.851      0.615






3 epochs completed in 0.003 hours.
Optimizer stripped from runs\detect\train3\weights\last.pt, 5.5MB
Optimizer stripped from runs\detect\train3\weights\best.pt, 5.5MB

Validating runs\detect\train3\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n summary (fused): 100 layers, 2,616,248 parameters, 0 gradients, 6.5 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 19.26it/s]


                   all          4         17      0.557       0.85      0.887      0.635
                person          3         10      0.547        0.6      0.599      0.269
                   dog          1          1       0.54          1      0.995      0.697
                 horse          1          2      0.496          1      0.995      0.674
              elephant          1          2      0.358        0.5      0.745      0.281
              umbrella          1          1      0.562          1      0.995      0.995
          potted plant          1          1      0.839          1      0.995      0.895
Speed: 0.4ms preprocess, 4.2ms inference, 0.0ms loss, 1.8ms postprocess per image
Results saved to [1mruns\detect\train3[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : conservation_girder_742
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/74366858011a4ae4a3c46bd7094ec4d3
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]               : (0.0, 8.092e-07)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]               : (0.0, 8.092e-07)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]               : (0.0, 8.092e-07)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B) [8]     : (0.8508, 0.8873046020849226)
[1;38;5;39mCOMET INFO:[0m     metric


Found https://ultralytics.com/images/bus.jpg locally at bus.jpg
image 1/1 e:\Programming\VSCodeProjects\bus.jpg: 640x480 4 persons, 1 bus, 19.1ms
Speed: 3.4ms preprocess, 19.1ms inference, 2.2ms postprocess per image at shape (1, 3, 640, 480)


## 2. Segmentation

YOLO11 _segmentation_ models use the `-seg` suffix, i.e. `yolo11n-seg.pt` and are pretrained on COCO. See [Segmentation Docs](https://docs.ultralytics.com/tasks/segment/) for full details.


In [8]:
# Load YOLO11n-seg, train it on COCO128-seg for 3 epochs and predict an image with it
from ultralytics import YOLO

model = YOLO('yolo11n-seg.pt')  # load a pretrained YOLO segmentation model
model.train(data='coco8-seg.yaml', epochs=3)  # train the model
results2 = model('https://ultralytics.com/images/bus.jpg')  # predict on an image

results2[0].show()  # plot results

Downloading https://github.com/ultralytics/assets/releases/download/v8.3.0/yolo11n-seg.pt to 'yolo11n-seg.pt'...


100%|██████████| 5.90M/5.90M [00:00<00:00, 21.3MB/s]

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)





[34m[1mengine\trainer: [0mtask=segment, mode=train, model=yolo11n-seg.pt, data=coco8-seg.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=True, show_boxes=True, line_width=None, format=torchscript, keras=False, optimize=False, int8=False, dynamic=Fal

100%|██████████| 439k/439k [00:00<00:00, 20.2MB/s]
Unzipping E:\Programming\VSCodeProjects\datasets\coco8-seg.zip to E:\Programming\VSCodeProjects\datasets\coco8-seg...: 100%|██████████| 25/25 [00:00<00:00, 2266.36file/s]

Dataset download success  (1.9s), saved to [1mE:\Programming\VSCodeProjects\datasets[0m


                   from  n    params  module                                       arguments                     
  0                  -1  1       464  ultralytics.nn.modules.conv.Conv             [3, 16, 3, 2]                 
  1                  -1  1      4672  ultralytics.nn.modules.conv.Conv             [16, 32, 3, 2]                
  2                  -1  1      6640  ultralytics.nn.modules.block.C3k2            [32, 64, 1, False, 0.25]      
  3                  -1  1     36992  ultralytics.nn.modules.conv.Conv             [64, 64, 3, 2]                
  4                  -1  1     26080  ultralytics.nn.modules.block.C3k2            [64, 128, 1, False, 0.25]     
  5                  -1  1    147712  ultralytics.nn.modules.conv.Conv             [128, 128, 3, 2]              
  6                  -1  1     87040  ultralytics.nn.modules.block.C3k2            [128, 128, 1, True]        




YOLO11n-seg summary: 203 layers, 2,876,848 parameters, 2,876,832 gradients, 10.5 GFLOPs

Transferred 561/561 items from pretrained weights


[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/21860b54119f4aef9631bbe748cef1e8

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\segment\train', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8-seg\labels\train... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<00:00, 234.59it/s]

[34m[1mtrain: [0mNew cache created: E:\Programming\VSCodeProjects\datasets\coco8-seg\labels\train.cache



[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8-seg\labels\val... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<00:00, 199.37it/s]

[34m[1mval: [0mNew cache created: E:\Programming\VSCodeProjects\datasets\coco8-seg\labels\val.cache





Plotting labels to runs\segment\train\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.000119, momentum=0.9) with parameter groups 90 weight(decay=0.0), 101 weight(decay=0.0005), 100 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\segment\train[0m
Starting training for 3 epochs...

      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


        1/3     0.906G     0.9704      2.375      2.628      1.288         22        640: 100%|██████████| 1/1 [00:00<00:00,  4.22it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  8.09it/s]

                   all          4         17      0.781      0.894      0.939      0.662       0.72      0.833      0.822      0.564






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


        2/3      1.07G      1.038       3.27      2.585      1.274         37        640: 100%|██████████| 1/1 [00:00<00:00,  5.63it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 10.96it/s]

                   all          4         17      0.778      0.906      0.939      0.664      0.715      0.844      0.822      0.566






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


        3/3      1.08G     0.8966      2.807      1.727      1.108         21        640: 100%|██████████| 1/1 [00:00<00:00,  6.19it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 10.04it/s]

                   all          4         17      0.778      0.908       0.94      0.672      0.714      0.845      0.822      0.565






3 epochs completed in 0.002 hours.
Optimizer stripped from runs\segment\train\weights\last.pt, 6.1MB
Optimizer stripped from runs\segment\train\weights\best.pt, 6.1MB

Validating runs\segment\train\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-seg summary (fused): 113 layers, 2,868,664 parameters, 0 gradients, 10.4 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95):   0%|          | 0/1 [00:00<?, ?it/s]



                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 12.16it/s]


                   all          4         17      0.777       0.91       0.94      0.671      0.712      0.846      0.822      0.565
                person          3         10      0.741      0.575      0.664      0.312      0.741      0.575      0.618      0.255
                   dog          1          1      0.725          1      0.995      0.895      0.725          1      0.995      0.895
                 horse          1          2      0.612          1      0.995      0.581      0.612          1      0.828        0.2
              elephant          1          2          1      0.885      0.995      0.348      0.615        0.5        0.5      0.252
              umbrella          1          1      0.659          1      0.995      0.995      0.659          1      0.995      0.895
          potted plant          1          1      0.924          1      0.995      0.895      0.924          1      0.995      0.895
Speed: 0.4ms preprocess, 5.2ms inference, 0.0ms loss, 1.7ms postproce

[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : financial_goldfish_8222
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/21860b54119f4aef9631bbe748cef1e8
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]               : (0.0, 8.092e-07)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]               : (0.0, 8.092e-07)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]               : (0.0, 8.092e-07)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B) [8]     : (0.93925, 0.94004)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(M)


Found https://ultralytics.com/images/bus.jpg locally at bus.jpg
image 1/1 e:\Programming\VSCodeProjects\bus.jpg: 640x480 4 persons, 1 bus, 1 stop sign, 32.5ms
Speed: 2.5ms preprocess, 32.5ms inference, 7.7ms postprocess per image at shape (1, 3, 640, 480)


## 3. Classification

YOLO11 _classification_ models use the `-cls` suffix, i.e. `yolo11n-cls.pt` and are pretrained on ImageNet. See [Classification Docs](https://docs.ultralytics.com/tasks/classify/) for full details.


In [9]:
# Load YOLO11n-cls, train it on mnist160 for 3 epochs and predict an image with it
from ultralytics import YOLO

model = YOLO('yolo11n-cls.pt')  # load a pretrained YOLO classification model
model.train(data='mnist160', epochs=3)  # train the model
results3 = model('https://ultralytics.com/images/bus.jpg')  # predict on an image

results3[0].show()  # plot results

Downloading https://github.com/ultralytics/assets/releases/download/v8.3.0/yolo11n-cls.pt to 'yolo11n-cls.pt'...


100%|██████████| 5.52M/5.52M [00:00<00:00, 8.85MB/s]

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=classify, mode=train, model=yolo11n-cls.pt, data=mnist160, epochs=3, time=None, patience=100, batch=16, imgsz=224, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=True, show_boxes=True,





Dataset not found , missing path E:\Programming\VSCodeProjects\datasets\mnist160, attempting download...
Downloading https://ultralytics.com/assets/mnist160.zip to 'E:\Programming\VSCodeProjects\datasets\mnist160.zip'...


100%|██████████| 70.0k/70.0k [00:00<00:00, 16.7MB/s]
Unzipping E:\Programming\VSCodeProjects\datasets\mnist160.zip to E:\Programming\VSCodeProjects\datasets\mnist160...: 100%|██████████| 184/184 [00:00<00:00, 2282.60file/s]

Dataset download success  (1.9s), saved to [1mE:\Programming\VSCodeProjects\datasets\mnist160[0m

[34m[1mtrain:[0m E:\Programming\VSCodeProjects\datasets\mnist160\train... found 80 images in 10 classes  
[34m[1mval:[0m None...
[34m[1mtest:[0m E:\Programming\VSCodeProjects\datasets\mnist160\test... found 80 images in 10 classes  
Overriding model.yaml nc=80 with nc=10

                   from  n    params  module                                       arguments                     
  0                  -1  1       464  ultralytics.nn.modules.conv.Conv             [3, 16, 3, 2]                 
  1                  -1  1      4672  ultralytics.nn.modules.conv.Conv             [16, 32, 3, 2]                
  2                  -1  1      6640  ultralytics.nn.modules.block.C3k2            [32, 64, 1, False, 0.25]      
  3                  -1  1     36992  ultralytics.nn.modules.conv.Conv             [64, 64, 3, 2]                
  4                  -1  1     26080  ultralytic




  9                  -1  1    249728  ultralytics.nn.modules.block.C2PSA           [256, 256, 1]                 
 10                  -1  1    343050  ultralytics.nn.modules.head.Classify         [256, 10]                     
YOLO11n-cls summary: 86 layers, 1,543,914 parameters, 1,543,914 gradients, 3.3 GFLOPs
Transferred 234/236 items from pretrained weights


[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/3949ea4307d4474aa0088514b23382a8

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\classify\train', view at http://localhost:6006/
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\mnist160\train... 80 images, 0 corrupt: 100%|██████████| 80/80 [00:00<00:00, 915.19it/s]

[34m[1mtrain: [0mNew cache created: E:\Programming\VSCodeProjects\datasets\mnist160\train.cache



[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\mnist160\test... 80 images, 0 corrupt: 100%|██████████| 80/80 [00:00<00:00, 642.52it/s]

[34m[1mval: [0mNew cache created: E:\Programming\VSCodeProjects\datasets\mnist160\test.cache





[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.000714, momentum=0.9) with parameter groups 39 weight(decay=0.0), 40 weight(decay=0.0005), 40 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 224 train, 224 val
Using 8 dataloader workers
Logging results to [1mruns\classify\train[0m
Starting training for 3 epochs...

      Epoch    GPU_mem       loss  Instances       Size


        1/3     0.289G      2.511         16        224: 100%|██████████| 5/5 [00:08<00:00,  1.73s/it]
               classes   top1_acc   top5_acc: 100%|██████████| 3/3 [00:00<00:00, 16.76it/s]

                   all        0.1      0.488






      Epoch    GPU_mem       loss  Instances       Size


        2/3     0.318G      2.446         16        224: 100%|██████████| 5/5 [00:00<00:00, 14.89it/s]
               classes   top1_acc   top5_acc: 100%|██████████| 3/3 [00:00<00:00, 55.18it/s]

                   all      0.138      0.512






      Epoch    GPU_mem       loss  Instances       Size


        3/3     0.328G      2.372         16        224: 100%|██████████| 5/5 [00:00<00:00, 15.39it/s]
               classes   top1_acc   top5_acc: 100%|██████████| 3/3 [00:00<00:00, 53.21it/s]

                   all     0.0625      0.525






3 epochs completed in 0.004 hours.
Optimizer stripped from runs\classify\train\weights\last.pt, 3.2MB
Optimizer stripped from runs\classify\train\weights\best.pt, 3.2MB

Validating runs\classify\train\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-cls summary (fused): 47 layers, 1,538,834 parameters, 0 gradients, 3.2 GFLOPs
[34m[1mtrain:[0m E:\Programming\VSCodeProjects\datasets\mnist160\train... found 80 images in 10 classes  
[34m[1mval:[0m E:\Programming\VSCodeProjects\datasets\mnist160\test... found 80 images in 10 classes  
[34m[1mtest:[0m E:\Programming\VSCodeProjects\datasets\mnist160\test... found 80 images in 10 classes  


               classes   top1_acc   top5_acc: 100%|██████████| 3/3 [00:00<00:00, 38.68it/s]


                   all      0.138      0.512
Speed: 0.1ms preprocess, 0.6ms inference, 0.0ms loss, 0.0ms postprocess per image
Results saved to [1mruns\classify\train[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : delicious_prison_4251
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/3949ea4307d4474aa0088514b23382a8
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]                : (2.856e-05, 4.3054200000000005e-05)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]                : (2.856e-05, 4.3054200000000005e-05)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]                : (2.856e-05, 4.3054200000000005e-05)
[1;38;5;39mCOMET INFO:[0m     metrics/accuracy_top1 [8] : (0.0625,


Found https://ultralytics.com/images/bus.jpg locally at bus.jpg
image 1/1 e:\Programming\VSCodeProjects\bus.jpg: 224x224 6 0.38, 9 0.11, 8 0.10, 0 0.08, 2 0.08, 7.0ms
Speed: 8.0ms preprocess, 7.0ms inference, 0.1ms postprocess per image at shape (1, 3, 224, 224)


## 4. Pose

YOLO11 _pose_ models use the `-pose` suffix, i.e. `yolo11n-pose.pt` and are pretrained on COCO Keypoints. See [Pose Docs](https://docs.ultralytics.com/tasks/pose/) for full details.

In [10]:
# Load YOLO11n-pose, train it on COCO8-pose for 3 epochs and predict an image with it
from ultralytics import YOLO

model = YOLO('yolo11n-pose.pt')  # load a pretrained YOLO pose model
model.train(data='coco8-pose.yaml', epochs=3)  # train the model
results4 = model('https://ultralytics.com/images/bus.jpg')  # predict on an image

results4[0].show()  # plot results

Downloading https://github.com/ultralytics/assets/releases/download/v8.3.0/yolo11n-pose.pt to 'yolo11n-pose.pt'...


100%|██████████| 5.97M/5.97M [00:00<00:00, 13.6MB/s]


Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=pose, mode=train, model=yolo11n-pose.pt, data=coco8-pose.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=True, show_boxes=T

100%|██████████| 334k/334k [00:00<00:00, 11.2MB/s]
Unzipping E:\Programming\VSCodeProjects\datasets\coco8-pose.zip to E:\Programming\VSCodeProjects\datasets\coco8-pose...: 100%|██████████| 27/27 [00:00<00:00, 1855.31file/s]

Dataset download success  (1.9s), saved to [1mE:\Programming\VSCodeProjects\datasets[0m

Overriding model.yaml nc=80 with nc=1

                   from  n    params  module                                       arguments                     
  0                  -1  1       464  ultralytics.nn.modules.conv.Conv             [3, 16, 3, 2]                 
  1                  -1  1      4672  ultralytics.nn.modules.conv.Conv             [16, 32, 3, 2]                
  2                  -1  1      6640  ultralytics.nn.modules.block.C3k2            [32, 64, 1, False, 0.25]      
  3                  -1  1     36992  ultralytics.nn.modules.conv.Conv             [64, 64, 3, 2]                
  4                  -1  1     26080  ultralytics.nn.modules.block.C3k2            [64, 128, 1, False, 0.25]     
  5                  -1  1    147712  ultralytics.nn.modules.conv.Conv             [128, 128, 3, 2]              
  6                  -1  1     87040  ultralytics.nn.modules.block.C3k2 




YOLO11n-pose summary: 196 layers, 2,874,462 parameters, 2,874,446 gradients, 7.5 GFLOPs

Transferred 541/541 items from pretrained weights


[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/5c631cff6c38477b86d6493072d8d88a

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\pose\train', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8-pose\labels\train... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<00:00, 273.22it/s]

[34m[1mtrain: [0mNew cache created: E:\Programming\VSCodeProjects\datasets\coco8-pose\labels\train.cache



[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\coco8-pose\labels\val... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<00:00, 106.47it/s]

[34m[1mval: [0mNew cache created: E:\Programming\VSCodeProjects\datasets\coco8-pose\labels\val.cache





Plotting labels to runs\pose\train\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.002, momentum=0.9) with parameter groups 87 weight(decay=0.0), 97 weight(decay=0.0005), 96 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\pose\train[0m
Starting training for 3 epochs...

      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


        1/3      0.67G      1.092      2.073     0.2519     0.5446      1.196         16        640: 100%|██████████| 1/1 [00:00<00:00,  2.68it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  7.00it/s]


                   all          4         14      0.866      0.925      0.916      0.715          1      0.692      0.722      0.352

      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


        2/3      0.68G      1.239      4.342      0.351      0.978      1.284         19        640: 100%|██████████| 1/1 [00:00<00:00,  6.21it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  8.98it/s]

                   all          4         14      0.868      0.857      0.911      0.698      0.983      0.643       0.71      0.367






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


        3/3      0.68G       1.46      3.444     0.4112      1.201       1.45         13        640: 100%|██████████| 1/1 [00:00<00:00,  6.55it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  8.83it/s]

                   all          4         14      0.841      0.929      0.916      0.697      0.986      0.643       0.71      0.365






3 epochs completed in 0.003 hours.
Optimizer stripped from runs\pose\train\weights\last.pt, 6.1MB
Optimizer stripped from runs\pose\train\weights\best.pt, 6.1MB

Validating runs\pose\train\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-pose summary (fused): 109 layers, 2,866,468 parameters, 0 gradients, 7.4 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00, 12.98it/s]


                   all          4         14      0.866      0.924      0.916      0.716          1      0.692      0.722      0.362
Speed: 0.8ms preprocess, 7.4ms inference, 0.0ms loss, 1.7ms postprocess per image
Results saved to [1mruns\pose\train[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : salmon_egret_421
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/5c631cff6c38477b86d6493072d8d88a
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]               : (0.0, 1.36e-05)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]               : (0.0, 1.36e-05)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]               : (0.0, 1.36e-05)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B) [8]     : (0.91091, 0.9162927847346453)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(P


Found https://ultralytics.com/images/bus.jpg locally at bus.jpg
image 1/1 e:\Programming\VSCodeProjects\bus.jpg: 640x480 4 persons, 32.7ms
Speed: 2.3ms preprocess, 32.7ms inference, 3.5ms postprocess per image at shape (1, 3, 640, 480)


## 4. Oriented Bounding Boxes (OBB)

YOLO11 _OBB_ models use the `-obb` suffix, i.e. `yolo11n-obb.pt` and are pretrained on the DOTA dataset. See [OBB Docs](https://docs.ultralytics.com/tasks/obb/) for full details.

In [12]:
# Load YOLO11n-obb, train it on DOTA8 for 3 epochs and predict an image with it
from ultralytics import YOLO

model = YOLO('yolo11n-obb.pt')  # load a pretrained YOLO OBB model
model.train(data='dota8.yaml', epochs=3)  # train the model
results4 = model('https://ultralytics.com/images/boats.jpg')  # predict on an image

results4[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=obb, mode=train, model=yolo11n-obb.pt, data=dota8.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=1024, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train2, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=True, show_boxes=True, 

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/ce96244cf4114de9a6368e0895f9a8ad

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\obb\train2', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\dota8\labels\train.cache... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\dota8\labels\val.cache... 4 images, 0 backgrounds, 0 corrupt: 100%|██████████| 4/4 [00:00<?, ?it/s]


Plotting labels to runs\obb\train2\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.000526, momentum=0.9) with parameter groups 87 weight(decay=0.0), 97 weight(decay=0.0005), 96 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 1024 train, 1024 val
Using 8 dataloader workers
Logging results to [1mruns\obb\train2[0m
Starting training for 3 epochs...

      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        1/3      1.56G     0.8721     0.4506      1.447         69       1024: 100%|██████████| 1/1 [00:00<00:00,  3.28it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  7.33it/s]

                   all          4          8      0.942          1      0.995      0.804






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        2/3      1.79G     0.8769     0.5377      1.705        163       1024: 100%|██████████| 1/1 [00:00<00:00,  3.00it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  7.47it/s]

                   all          4          8      0.945          1      0.995      0.804






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        3/3      1.79G     0.7338     0.5788      1.095        136       1024: 100%|██████████| 1/1 [00:00<00:00,  3.79it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  7.08it/s]

                   all          4          8      0.945          1      0.995      0.804






3 epochs completed in 0.003 hours.
Optimizer stripped from runs\obb\train2\weights\last.pt, 5.9MB
Optimizer stripped from runs\obb\train2\weights\best.pt, 5.9MB

Validating runs\obb\train2\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-obb summary (fused): 109 layers, 2,656,648 parameters, 0 gradients, 6.6 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.84it/s]


                   all          4          8      0.945          1      0.995      0.804
      baseball diamond          3          4      0.899          1      0.995       0.85
      basketball court          1          3      0.971          1      0.995      0.866
     soccer ball field          1          1      0.964          1      0.995      0.697
Speed: 1.1ms preprocess, 15.6ms inference, 0.0ms loss, 5.6ms postprocess per image
Results saved to [1mruns\obb\train2[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : experienced_primer_7916
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/ce96244cf4114de9a6368e0895f9a8ad
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]               : (0.0, 3.5768e-06)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]               : (0.0, 3.5768e-06)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]               : (0.0, 3.5768e-06)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B)         : 0.995
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50-95(B) [8]  :


Found https://ultralytics.com/images/boats.jpg locally at boats.jpg
image 1/1 e:\Programming\VSCodeProjects\boats.jpg: 576x1024 13.5ms
Speed: 4.6ms preprocess, 13.5ms inference, 7.6ms postprocess per image at shape (1, 3, 576, 1024)


# Appendix

Additional content below.

In [None]:
# Pip install from source
!pip install git+https://github.com/ultralytics/ultralytics@main

In [None]:
# Git clone and run tests on updates branch
!git clone https://github.com/ultralytics/ultralytics -b main
%pip install -qe ultralytics

In [None]:
# Run tests (Git clone only)
!pytest ultralytics/tests

In [None]:
# Validate multiple models
for x in 'nsmlx':
  !yolo val model=yolo11{x}.pt data=coco.yaml

# Supplementary Exercise

In [None]:
!pip install roboflow

from roboflow import Roboflow
rf = Roboflow(api_key="0nxjiKWHzZ4uQ3XnGWsd")
project = rf.workspace("test-3exll").project("drowsiness-h1xmb-pofma")
version = project.version(1)
dataset = version.download("yolov11")

# It should be noted that after the dataset is downloaded, it is moved to /datasets
                

Collecting roboflow
  Downloading roboflow-1.1.61-py3-none-any.whl.metadata (9.7 kB)
Collecting idna==3.7 (from roboflow)
  Downloading idna-3.7-py3-none-any.whl.metadata (9.9 kB)
Collecting opencv-python-headless==4.10.0.84 (from roboflow)
  Downloading opencv_python_headless-4.10.0.84-cp37-abi3-win_amd64.whl.metadata (20 kB)
Collecting pillow-heif>=0.18.0 (from roboflow)
  Downloading pillow_heif-0.22.0-cp312-cp312-win_amd64.whl.metadata (9.8 kB)
Collecting python-dotenv (from roboflow)
  Downloading python_dotenv-1.1.0-py3-none-any.whl.metadata (24 kB)
Downloading roboflow-1.1.61-py3-none-any.whl (85 kB)
Downloading idna-3.7-py3-none-any.whl (66 kB)
Downloading opencv_python_headless-4.10.0.84-cp37-abi3-win_amd64.whl (38.8 MB)
   ---------------------------------------- 0.0/38.8 MB ? eta -:--:--
   ------ --------------------------------- 6.0/38.8 MB 28.4 MB/s eta 0:00:02
   ------------- -------------------------- 13.1/38.8 MB 31.6 MB/s eta 0:00:01
   ------------------- ----------

Downloading Dataset Version Zip in Drowsiness-1 to yolov11:: 100%|██████████| 15853/15853 [00:07<00:00, 2112.71it/s]





Extracting Dataset Version Zip to Drowsiness-1 in yolov11:: 100%|██████████| 784/784 [00:00<00:00, 2189.40it/s]


In [None]:
# Detection with YOLOv11
from ultralytics import YOLO

model = YOLO('yolo11n.pt')  # load a pretrained YOLO detection model
model.train(data='datasets/Drowsiness-1/data.yaml', epochs=3)  # train the model
results = model('datasets/Drowsiness-1/test/images/sleepdec38_jpg.rf.2fb782f58c4cd73172d1d88d602c2739.jpg')  # predict on an image

results[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=detect, mode=train, model=yolo11n.pt, data=datasets/Drowsiness-1/data.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train16, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=Tru

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/45879137f9c04ec5a95d0080becdd4ee

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\detect\train16', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-1\train\labels.cache... 366 images, 0 backgrounds, 0 corrupt: 100%|██████████| 366/366 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-1\valid\labels.cache... 11 images, 0 backgrounds, 0 corrupt: 100%|██████████| 11/11 [00:00<?, ?it/s]


Plotting labels to runs\detect\train16\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.001667, momentum=0.9) with parameter groups 81 weight(decay=0.0), 88 weight(decay=0.0005), 87 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\detect\train16[0m
Starting training for 3 epochs...

      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        1/3      2.26G      1.616      3.201      1.802         37        640: 100%|██████████| 23/23 [00:06<00:00,  3.31it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.50it/s]

                   all         11         11    0.00587          1      0.915      0.367






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        2/3      2.28G      1.284      2.335      1.537         28        640: 100%|██████████| 23/23 [00:06<00:00,  3.73it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.48it/s]

                   all         11         11    0.00658          1      0.835      0.453






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        3/3      2.29G      1.251      2.008      1.437         38        640: 100%|██████████| 23/23 [00:06<00:00,  3.44it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  2.19it/s]

                   all         11         11      0.578      0.364      0.695      0.407






3 epochs completed in 0.008 hours.
Optimizer stripped from runs\detect\train16\weights\last.pt, 5.5MB
Optimizer stripped from runs\detect\train16\weights\best.pt, 5.5MB

Validating runs\detect\train16\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n summary (fused): 100 layers, 2,582,542 parameters, 0 gradients, 6.3 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  7.16it/s]


                   all         11         11    0.00661          1      0.835      0.448
                drowsy         11         11    0.00661          1      0.835      0.448
Speed: 0.6ms preprocess, 3.8ms inference, 0.0ms loss, 3.3ms postprocess per image
Results saved to [1mruns\detect\train16[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : territorial_mandarine_4542
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/45879137f9c04ec5a95d0080becdd4ee
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]               : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]               : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]               : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B) [8]     


image 1/1 e:\Programming\VSCodeProjects\datasets\Drowsiness-1\test\images\sleepdec38_jpg.rf.2fb782f58c4cd73172d1d88d602c2739.jpg: 640x640 (no detections), 11.2ms
Speed: 2.0ms preprocess, 11.2ms inference, 0.6ms postprocess per image at shape (1, 3, 640, 640)


In [3]:
# Detection, with 12 epochs instead
from ultralytics import YOLO

model = YOLO('yolo11n.pt')  # load a pretrained YOLO detection model
model.train(data='datasets/Drowsiness-1/data.yaml', epochs=12)  # train the model
results = model('datasets/Drowsiness-1/test/images/sleepdec38_jpg.rf.2fb782f58c4cd73172d1d88d602c2739.jpg')  # predict on an image

results[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=detect, mode=train, model=yolo11n.pt, data=datasets/Drowsiness-1/data.yaml, epochs=12, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train17, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=Tr

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/3ef06b7602504433b78f503b52a3b266

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\detect\train17', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-1\train\labels.cache... 366 images, 0 backgrounds, 0 corrupt: 100%|██████████| 366/366 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-1\valid\labels.cache... 11 images, 0 backgrounds, 0 corrupt: 100%|██████████| 11/11 [00:00<?, ?it/s]


Plotting labels to runs\detect\train17\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.001667, momentum=0.9) with parameter groups 81 weight(decay=0.0), 88 weight(decay=0.0005), 87 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\detect\train17[0m
Starting training for 12 epochs...

      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


       1/12      2.29G      1.616      3.201      1.802         37        640: 100%|██████████| 23/23 [00:07<00:00,  3.09it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.68it/s]

                   all         11         11    0.00587          1      0.915      0.367






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


       2/12      2.29G      1.332      2.369      1.573         28        640: 100%|██████████| 23/23 [00:06<00:00,  3.68it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.01it/s]

                   all         11         11     0.0068          1      0.871      0.312





Closing dataloader mosaic

      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


       3/12      2.29G      1.281       2.53      1.633         16        640: 100%|██████████| 23/23 [00:04<00:00,  4.71it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  8.07it/s]

                   all         11         11    0.00671          1      0.359      0.171






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


       4/12      2.29G      1.213       2.21      1.484         14        640: 100%|██████████| 23/23 [00:05<00:00,  4.41it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  7.79it/s]

                   all         11         11       0.59      0.455      0.612      0.304






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


       5/12       2.3G      1.204      2.041      1.452         14        640: 100%|██████████| 23/23 [00:04<00:00,  5.38it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  9.19it/s]

                   all         11         11      0.762      0.545      0.791      0.548






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


       6/12      2.32G        1.1      1.863      1.355         16        640: 100%|██████████| 23/23 [00:04<00:00,  5.35it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  9.73it/s]

                   all         11         11      0.879      0.818      0.908      0.566






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


       7/12      2.33G      1.098      1.618       1.34         16        640: 100%|██████████| 23/23 [00:04<00:00,  5.24it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  8.41it/s]

                   all         11         11      0.984      0.818      0.939       0.64






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


       8/12      2.35G     0.9852      1.466       1.25         15        640: 100%|██████████| 23/23 [00:04<00:00,  5.12it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  8.49it/s]

                   all         11         11      0.898      0.803      0.904      0.708






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


       9/12      2.35G     0.9695      1.311      1.218         16        640: 100%|██████████| 23/23 [00:04<00:00,  5.21it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  9.65it/s]

                   all         11         11       0.98          1      0.995      0.767






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


      10/12      2.35G     0.9122      1.233      1.191         15        640: 100%|██████████| 23/23 [00:04<00:00,  4.69it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  8.26it/s]

                   all         11         11       0.92          1      0.995      0.736






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


      11/12      2.35G     0.8316      1.123      1.122         15        640: 100%|██████████| 23/23 [00:04<00:00,  4.98it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  7.93it/s]

                   all         11         11      0.993          1      0.995      0.704






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


      12/12      2.35G      0.768      1.047       1.09         14        640: 100%|██████████| 23/23 [00:04<00:00,  5.04it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  9.09it/s]

                   all         11         11      0.995          1      0.995      0.729






12 epochs completed in 0.029 hours.
Optimizer stripped from runs\detect\train17\weights\last.pt, 5.5MB
Optimizer stripped from runs\detect\train17\weights\best.pt, 5.5MB

Validating runs\detect\train17\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n summary (fused): 100 layers, 2,582,542 parameters, 0 gradients, 6.3 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  8.81it/s]


                   all         11         11      0.981          1      0.995      0.767
                drowsy         11         11      0.981          1      0.995      0.767
Speed: 0.3ms preprocess, 2.7ms inference, 0.0ms loss, 1.8ms postprocess per image
Results saved to [1mruns\detect\train17[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : alternative_pomelo_7008
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/3ef06b7602504433b78f503b52a3b266
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [25]               : (0.00015419750000000005, 0.0011415199250000001)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [25]               : (0.00015419750000000005, 0.0011415199250000001)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [25]               : (0.00015419750000000005, 0.0011415199250000001)
[1;38;5;39mCOMET INFO:[0m   


image 1/1 e:\Programming\VSCodeProjects\datasets\Drowsiness-1\test\images\sleepdec38_jpg.rf.2fb782f58c4cd73172d1d88d602c2739.jpg: 640x640 3 drowsys, 14.8ms
Speed: 2.8ms preprocess, 14.8ms inference, 2.5ms postprocess per image at shape (1, 3, 640, 640)


In [5]:
from roboflow import Roboflow

rf = Roboflow(api_key="0nxjiKWHzZ4uQ3XnGWsd")
project = rf.workspace("test-3exll").project("drowsiness-h1xmb-pofma")
version = project.version(2)
dataset = version.download("yolov11")

loading Roboflow workspace...
loading Roboflow project...


Downloading Dataset Version Zip in Drowsiness-2 to yolov11:: 100%|██████████| 15965/15965 [00:02<00:00, 6957.14it/s] 





Extracting Dataset Version Zip to Drowsiness-2 in yolov11:: 100%|██████████| 784/784 [00:00<00:00, 2086.88it/s]


In [7]:
# Segmentation with YOLOv11
from ultralytics import YOLO

model = YOLO('yolo11n-seg.pt')  # load a pretrained YOLO Segmentation model
model.train(data='datasets/Drowsiness-2/data.yaml', epochs=3)  # train the model
results = model('datasets/Drowsiness-2/test/images/sleepdec38_jpg.rf.143ada9487272792e1f5ed39cd33e7f1.jpg')  # predict on an image

results[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=segment, mode=train, model=yolo11n-seg.pt, data=datasets/Drowsiness-2/data.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train4, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf

[1;38;5;39mCOMET INFO:[0m An experiment with the same configuration options is already running and will be reused.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\segment\train4', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-2\train\labels... 363 images, 0 backgrounds, 0 corrupt: 100%|██████████| 363/363 [00:00<00:00, 806.68it/s]

[34m[1mtrain: [0mNew cache created: E:\Programming\VSCodeProjects\datasets\Drowsiness-2\train\labels.cache



[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-2\valid\labels... 11 images, 0 backgrounds, 0 corrupt: 100%|██████████| 11/11 [00:00<00:00, 200.57it/s]

[34m[1mval: [0mNew cache created: E:\Programming\VSCodeProjects\datasets\Drowsiness-2\valid\labels.cache





Plotting labels to runs\segment\train4\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.001667, momentum=0.9) with parameter groups 90 weight(decay=0.0), 101 weight(decay=0.0005), 100 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\segment\train4[0m
Starting training for 3 epochs...

      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


        1/3      5.13G      1.076      2.304      2.913      1.302         24        640: 100%|██████████| 23/23 [00:07<00:00,  3.06it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.11it/s]

                   all         11         11     0.0057          1      0.904      0.626     0.0057          1      0.967      0.574






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


        2/3      5.15G     0.8436      1.267      1.838      1.156         21        640: 100%|██████████| 23/23 [00:05<00:00,  3.86it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  5.98it/s]

                   all         11         11    0.00625          1      0.868      0.497    0.00625          1      0.868      0.526






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


        3/3      5.17G     0.8291      1.139      1.571       1.11         26        640: 100%|██████████| 23/23 [00:06<00:00,  3.83it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.19it/s]

                   all         11         11          1       0.75      0.995      0.701          1       0.75      0.995      0.667






3 epochs completed in 0.008 hours.
Optimizer stripped from runs\segment\train4\weights\last.pt, 6.0MB
Optimizer stripped from runs\segment\train4\weights\best.pt, 6.0MB

Validating runs\segment\train4\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-seg summary (fused): 113 layers, 2,834,958 parameters, 0 gradients, 10.2 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95):   0%|          | 0/1 [00:00<?, ?it/s]



                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.30it/s]


                   all         11         11          1       0.75      0.995      0.701          1       0.75      0.995      0.667
                drowsy         11         11          1       0.75      0.995      0.701          1       0.75      0.995      0.667
Speed: 0.3ms preprocess, 3.4ms inference, 0.0ms loss, 2.4ms postprocess per image
Results saved to [1mruns\segment\train4[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : ideological_rabbit_1166
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/b082eebf016f420a9d38637e5d12e2f0
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]               : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]               : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]               : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B) [8]     : (


image 1/1 e:\Programming\VSCodeProjects\datasets\Drowsiness-2\test\images\sleepdec38_jpg.rf.143ada9487272792e1f5ed39cd33e7f1.jpg: 640x640 (no detections), 12.6ms
Speed: 2.1ms preprocess, 12.6ms inference, 0.5ms postprocess per image at shape (1, 3, 640, 640)


In [8]:
# Segmentation, with 12 epochs instead
from ultralytics import YOLO

model = YOLO('yolo11n-seg.pt')  # load a pretrained YOLO Segmentation model
model.train(data='datasets/Drowsiness-2/data.yaml', epochs=12)  # train the model
results = model('datasets/Drowsiness-2/test/images/sleepdec38_jpg.rf.143ada9487272792e1f5ed39cd33e7f1.jpg')  # predict on an image

results[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=segment, mode=train, model=yolo11n-seg.pt, data=datasets/Drowsiness-2/data.yaml, epochs=12, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train5, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_con

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/0335feaffd3b4c58beeb8a0a3d35ffaa

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\segment\train5', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-2\train\labels.cache... 363 images, 0 backgrounds, 0 corrupt: 100%|██████████| 363/363 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-2\valid\labels.cache... 11 images, 0 backgrounds, 0 corrupt: 100%|██████████| 11/11 [00:00<?, ?it/s]


Plotting labels to runs\segment\train5\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.001667, momentum=0.9) with parameter groups 90 weight(decay=0.0), 101 weight(decay=0.0005), 100 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\segment\train5[0m
Starting training for 12 epochs...

      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


       1/12      5.06G      1.076      2.304      2.913      1.302         24        640: 100%|██████████| 23/23 [00:07<00:00,  3.10it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  5.83it/s]

                   all         11         11     0.0057          1      0.904      0.626     0.0057          1      0.967      0.574






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


       2/12      5.08G     0.8751      1.298      1.859      1.165         21        640: 100%|██████████| 23/23 [00:07<00:00,  3.25it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.00it/s]

                   all         11         11    0.00611          1       0.54      0.147    0.00611          1      0.554      0.176





Closing dataloader mosaic

      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


       3/12      5.08G     0.9044       1.18      1.976      1.223         11        640: 100%|██████████| 23/23 [00:06<00:00,  3.57it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.19it/s]

                   all         11         11      0.473          1      0.921      0.583      0.473          1      0.921      0.405






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


       4/12      5.08G     0.8252      1.063      1.692      1.156         12        640: 100%|██████████| 23/23 [00:05<00:00,  3.89it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.31it/s]

                   all         11         11      0.494      0.727      0.652      0.444      0.494      0.727      0.656      0.388






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


       5/12      5.08G     0.8463      1.112      1.575      1.158         12        640: 100%|██████████| 23/23 [00:06<00:00,  3.68it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.47it/s]

                   all         11         11          1      0.491      0.888      0.627          1      0.491      0.888      0.619






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


       6/12      5.08G     0.7726     0.9999       1.34      1.085         11        640: 100%|██████████| 23/23 [00:05<00:00,  3.93it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.41it/s]

                   all         11         11      0.984      0.909      0.988      0.703      0.984      0.909      0.988      0.694






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


       7/12      5.08G     0.6928     0.9094      1.191      1.071         13        640: 100%|██████████| 23/23 [00:05<00:00,  3.87it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.49it/s]

                   all         11         11      0.993          1      0.995      0.776      0.993          1      0.995      0.721






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


       8/12      5.08G     0.6609     0.8514      1.117      1.037         11        640: 100%|██████████| 23/23 [00:05<00:00,  3.87it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.73it/s]

                   all         11         11      0.984          1      0.995       0.74      0.984          1      0.995      0.715






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


       9/12      5.08G     0.6363     0.7952      1.029      1.006         12        640: 100%|██████████| 23/23 [00:06<00:00,  3.81it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.70it/s]

                   all         11         11      0.993          1      0.995      0.752      0.993          1      0.995      0.737






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


      10/12      5.08G     0.5896     0.7863     0.9476     0.9717         11        640: 100%|██████████| 23/23 [00:06<00:00,  3.56it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.56it/s]

                   all         11         11      0.994          1      0.995      0.833      0.994          1      0.995      0.743






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


      11/12      5.08G     0.5697     0.7389     0.8632     0.9771         12        640: 100%|██████████| 23/23 [00:05<00:00,  3.92it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.74it/s]

                   all         11         11      0.988          1      0.995      0.813      0.988          1      0.995      0.781






      Epoch    GPU_mem   box_loss   seg_loss   cls_loss   dfl_loss  Instances       Size


      12/12      5.09G     0.5366     0.7231     0.8242     0.9714         11        640: 100%|██████████| 23/23 [00:05<00:00,  3.92it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  6.47it/s]

                   all         11         11      0.995          1      0.995      0.861      0.995          1      0.995      0.796






12 epochs completed in 0.032 hours.
Optimizer stripped from runs\segment\train5\weights\last.pt, 6.0MB
Optimizer stripped from runs\segment\train5\weights\best.pt, 6.0MB

Validating runs\segment\train5\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-seg summary (fused): 113 layers, 2,834,958 parameters, 0 gradients, 10.2 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95):   0%|          | 0/1 [00:00<?, ?it/s]



                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Mask(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  5.76it/s]


                   all         11         11      0.995          1      0.995      0.866      0.995          1      0.995      0.798
                drowsy         11         11      0.995          1      0.995      0.866      0.995          1      0.995      0.798
Speed: 0.4ms preprocess, 3.4ms inference, 0.0ms loss, 1.8ms postprocess per image
Results saved to [1mruns\segment\train5[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : corresponding_cardinal_565
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/0335feaffd3b4c58beeb8a0a3d35ffaa
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [25]               : (0.00015419750000000005, 0.0011415199250000001)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [25]               : (0.00015419750000000005, 0.0011415199250000001)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [25]               : (0.00015419750000000005, 0.0011415199250000001)
[1;38;5;39mCOMET INFO:[0m


image 1/1 e:\Programming\VSCodeProjects\datasets\Drowsiness-2\test\images\sleepdec38_jpg.rf.143ada9487272792e1f5ed39cd33e7f1.jpg: 640x640 2 drowsys, 12.7ms
Speed: 2.0ms preprocess, 12.7ms inference, 15.7ms postprocess per image at shape (1, 3, 640, 640)


In [12]:
# Classification with YOLOv11
from ultralytics import YOLO

model = YOLO('yolo11n-cls.pt')  # load a pretrained YOLO Classification model
model.train(data='datasets/Drowsiness-3', epochs=3)  # train the model
results = model('datasets/Drowsiness-3/test/drowsy/sleepdec38_jpg.rf.2fb782f58c4cd73172d1d88d602c2739.jpg')  # predict on an image

results[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=classify, mode=train, model=yolo11n-cls.pt, data=datasets/Drowsiness-3, epochs=3, time=None, patience=100, batch=16, imgsz=224, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train5, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=True, sh

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/9632a9b0824e4e079d73b2f0ac8eb8d0

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\classify\train5', view at http://localhost:6006/
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-3\train... 366 images, 0 corrupt: 100%|██████████| 366/366 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-3\test... 9 images, 0 corrupt: 100%|██████████| 9/9 [00:00<?, ?it/s]


[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.001667, momentum=0.9) with parameter groups 39 weight(decay=0.0), 40 weight(decay=0.0005), 40 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 224 train, 224 val
Using 8 dataloader workers
Logging results to [1mruns\classify\train5[0m
Starting training for 3 epochs...

      Epoch    GPU_mem       loss  Instances       Size


        1/3     0.357G     0.4922         14        224: 100%|██████████| 23/23 [00:02<00:00,  9.81it/s]
               classes   top1_acc   top5_acc: 100%|██████████| 1/1 [00:02<00:00,  2.78s/it]

                   all          1          1






      Epoch    GPU_mem       loss  Instances       Size


        2/3     0.357G     0.0895         14        224: 100%|██████████| 23/23 [00:01<00:00, 16.11it/s]
               classes   top1_acc   top5_acc: 100%|██████████| 1/1 [00:00<00:00, 62.04it/s]

                   all          1          1






      Epoch    GPU_mem       loss  Instances       Size


        3/3     0.357G     0.0473         14        224: 100%|██████████| 23/23 [00:01<00:00, 15.29it/s]
               classes   top1_acc   top5_acc: 100%|██████████| 1/1 [00:00<00:00, 52.40it/s]


                   all          1          1

3 epochs completed in 0.003 hours.
Optimizer stripped from runs\classify\train5\weights\last.pt, 3.2MB
Optimizer stripped from runs\classify\train5\weights\best.pt, 3.2MB

Validating runs\classify\train5\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-cls summary (fused): 47 layers, 1,528,586 parameters, 0 gradients, 3.2 GFLOPs
[34m[1mtrain:[0m E:\Programming\VSCodeProjects\datasets\Drowsiness-3\train... found 366 images in 2 classes  
[34m[1mval:[0m E:\Programming\VSCodeProjects\datasets\Drowsiness-3\test... found 9 images in 1 classes: ERROR  requires 2 classes, not 1
[34m[1mtest:[0m E:\Programming\VSCodeProjects\datasets\Drowsiness-3\test... found 9 images in 1 classes: ERROR  requires 2 classes, not 1


               classes   top1_acc   top5_acc: 100%|██████████| 1/1 [00:00<00:00, 68.68it/s]


                   all          1          1
Speed: 0.1ms preprocess, 1.0ms inference, 0.0ms loss, 0.0ms postprocess per image
Results saved to [1mruns\classify\train5[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : honest_watt_4057
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/9632a9b0824e4e079d73b2f0ac8eb8d0
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]              : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]              : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]              : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     metrics/accuracy_top1   : 1.0
[1;38;5


image 1/1 e:\Programming\VSCodeProjects\datasets\Drowsiness-3\test\drowsy\sleepdec38_jpg.rf.2fb782f58c4cd73172d1d88d602c2739.jpg: 224x224 drowsy 1.00, natural 0.00, 75.8ms
Speed: 4.6ms preprocess, 75.8ms inference, 0.1ms postprocess per image at shape (1, 3, 224, 224)


In [5]:
from roboflow import Roboflow
rf = Roboflow(api_key="0nxjiKWHzZ4uQ3XnGWsd")
project = rf.workspace("test-3exll").project("drowsiness2-dh4cu")
version = project.version(1)
dataset = version.download("yolov8")

loading Roboflow workspace...
loading Roboflow project...


Downloading Dataset Version Zip in drowsiness2-1 to yolov8:: 100%|██████████| 7196/7196 [00:01<00:00, 3956.41it/s] 





Extracting Dataset Version Zip to drowsiness2-1 in yolov8:: 100%|██████████| 358/358 [00:00<00:00, 1439.87it/s]


In [8]:
# Pose with YOLOv11
from ultralytics import YOLO

model = YOLO('yolo11n-pose.pt')  # load a pretrained YOLO Pose model
model.train(data='datasets/Drowsiness-5/data.yaml', epochs=3)  # train the model
results = model('datasets/Drowsiness-5/test/images/sleepdec8_jpg.rf.96b4513404a98f82e3603505d8edac06.jpg')  # predict on an image

results[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=pose, mode=train, model=yolo11n-pose.pt, data=datasets/Drowsiness-5/data.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train5, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=T

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/4e9c0d1fca0c4451abe01a9406e488c4

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\pose\train5', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-5\train\labels.cache... 131 images, 0 backgrounds, 0 corrupt: 100%|██████████| 131/131 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-5\valid\labels.cache... 28 images, 0 backgrounds, 0 corrupt: 100%|██████████| 28/28 [00:00<?, ?it/s]


Plotting labels to runs\pose\train5\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.001667, momentum=0.9) with parameter groups 87 weight(decay=0.0), 97 weight(decay=0.0005), 96 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\pose\train5[0m
Starting training for 3 epochs...

      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


        1/3      2.57G      1.649      2.502     0.7549      3.955      1.881          5        640: 100%|██████████| 9/9 [00:03<00:00,  2.55it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  2.96it/s]

                   all         28         30      0.433      0.375      0.328     0.0944      0.377      0.396      0.413      0.249






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


        2/3      2.69G      1.254      1.678     0.7173      2.823       1.62          9        640: 100%|██████████| 9/9 [00:02<00:00,  4.21it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.30it/s]

                   all         28         30      0.717      0.646      0.559      0.286      0.717      0.646      0.567      0.324






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


        3/3      2.69G      1.168      1.231      0.706      2.397      1.546          3        640: 100%|██████████| 9/9 [00:01<00:00,  4.57it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.09it/s]

                   all         28         30      0.632      0.562       0.56      0.344      0.649      0.579      0.589      0.345






3 epochs completed in 0.005 hours.
Optimizer stripped from runs\pose\train5\weights\last.pt, 5.6MB
Optimizer stripped from runs\pose\train5\weights\best.pt, 5.6MB

Validating runs\pose\train5\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-pose summary (fused): 109 layers, 2,654,521 parameters, 0 gradients, 6.6 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.21it/s]


                   all         28         30      0.634      0.562      0.557      0.343      0.652      0.583      0.586      0.344
                drowsy         22         24      0.697      0.958      0.731       0.45      0.727      0.999      0.788      0.438
                normal          6          6      0.571      0.167      0.383      0.237      0.577      0.167      0.383       0.25
Speed: 0.3ms preprocess, 2.7ms inference, 0.0ms loss, 2.3ms postprocess per image
Results saved to [1mruns\pose\train5[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : improved_chart_4602
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/4e9c0d1fca0c4451abe01a9406e488c4
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]               : (0.00013336, 0.0001898713)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]               : (0.00013336, 0.0001898713)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]               : (0.00013336, 0.0001898713)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B) [8]     : (0.32771, 0.55976)
[1;38;5;39mCOMET INFO


image 1/1 e:\Programming\VSCodeProjects\datasets\Drowsiness-5\test\images\sleepdec8_jpg.rf.96b4513404a98f82e3603505d8edac06.jpg: 640x640 2 drowsys, 12.0ms
Speed: 2.1ms preprocess, 12.0ms inference, 2.1ms postprocess per image at shape (1, 3, 640, 640)


In [9]:
# Pose, with 12 epochs instead
from ultralytics import YOLO

model = YOLO('yolo11n-pose.pt')  # load a pretrained YOLO Pose model
model.train(data='datasets/Drowsiness-5/data.yaml', epochs=12)  # train the model
results = model('datasets/Drowsiness-5/test/images/sleepdec8_jpg.rf.96b4513404a98f82e3603505d8edac06.jpg')  # predict on an image

results[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=pose, mode=train, model=yolo11n-pose.pt, data=datasets/Drowsiness-5/data.yaml, epochs=12, time=None, patience=100, batch=16, imgsz=640, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train6, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/dc738154016145ef94e3c273ce30f052

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\pose\train6', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-5\train\labels.cache... 131 images, 0 backgrounds, 0 corrupt: 100%|██████████| 131/131 [00:00<?, ?it/s]
[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-5\valid\labels.cache... 28 images, 0 backgrounds, 0 corrupt: 100%|██████████| 28/28 [00:00<?, ?it/s]


Plotting labels to runs\pose\train6\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.001667, momentum=0.9) with parameter groups 87 weight(decay=0.0), 97 weight(decay=0.0005), 96 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 640 train, 640 val
Using 8 dataloader workers
Logging results to [1mruns\pose\train6[0m
Starting training for 12 epochs...

      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


       1/12      2.56G      1.649      2.502     0.7549      3.955      1.881          5        640: 100%|██████████| 9/9 [00:03<00:00,  2.40it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  2.54it/s]

                   all         28         30      0.433      0.375      0.328     0.0944      0.377      0.396      0.413      0.249






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


       2/12      2.68G      1.239      1.602     0.7106      2.771      1.619          9        640: 100%|██████████| 9/9 [00:02<00:00,  3.23it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  2.86it/s]

                   all         28         30      0.468      0.646      0.548      0.289      0.468      0.646      0.551      0.312





Closing dataloader mosaic

      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


       3/12      2.68G       1.12      1.033     0.6626      2.736      1.549          3        640: 100%|██████████| 9/9 [00:02<00:00,  4.12it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.30it/s]

                   all         28         30       0.54      0.729      0.651      0.405       0.54      0.729      0.652      0.378






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


       4/12      2.68G      1.085     0.7982     0.6688      2.222      1.431          4        640: 100%|██████████| 9/9 [00:01<00:00,  4.96it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.24it/s]

                   all         28         30      0.612      0.638      0.616      0.415      0.612      0.638      0.616      0.478






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


       5/12      2.68G     0.9896     0.5858     0.5715       2.03      1.329          3        640: 100%|██████████| 9/9 [00:01<00:00,  4.82it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.03it/s]

                   all         28         30      0.306      0.812       0.55      0.381      0.306      0.812       0.55      0.466






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


       6/12      2.68G     0.9587      0.584     0.5551      1.799      1.201          3        640: 100%|██████████| 9/9 [00:01<00:00,  5.09it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.41it/s]

                   all         28         30      0.647      0.646      0.636       0.42      0.647      0.646      0.644       0.55






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


       7/12      2.68G     0.9214     0.5126     0.4799      1.565      1.255          3        640: 100%|██████████| 9/9 [00:01<00:00,  5.33it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.66it/s]

                   all         28         30      0.566      0.727      0.631      0.435      0.566      0.727      0.631      0.576






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


       8/12      2.68G     0.8592      0.508     0.4469       1.54      1.136          3        640: 100%|██████████| 9/9 [00:01<00:00,  5.24it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.68it/s]

                   all         28         30       0.57      0.833      0.682      0.472       0.57      0.833      0.682      0.643






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


       9/12      2.69G      0.841     0.4173     0.4596      1.315      1.146          3        640: 100%|██████████| 9/9 [00:01<00:00,  5.25it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.46it/s]

                   all         28         30      0.874      0.497      0.557      0.373      0.874      0.497      0.557      0.518






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


      10/12      2.69G     0.8252     0.3747     0.3965      1.268      1.112          4        640: 100%|██████████| 9/9 [00:01<00:00,  5.43it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.51it/s]

                   all         28         30      0.569       0.58      0.552      0.379      0.569       0.58      0.552      0.527






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


      11/12      2.71G     0.7763     0.3373     0.3736      1.249      1.077          3        640: 100%|██████████| 9/9 [00:02<00:00,  4.12it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.66it/s]

                   all         28         30      0.466       0.75      0.573      0.404      0.466       0.75      0.573      0.559






      Epoch    GPU_mem   box_loss  pose_loss  kobj_loss   cls_loss   dfl_loss  Instances       Size


      12/12      2.71G     0.7721     0.3663     0.3665      1.193      1.059          4        640: 100%|██████████| 9/9 [00:01<00:00,  5.22it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.56it/s]

                   all         28         30      0.448          1      0.588       0.42      0.448          1      0.588      0.575






12 epochs completed in 0.019 hours.
Optimizer stripped from runs\pose\train6\weights\last.pt, 5.6MB
Optimizer stripped from runs\pose\train6\weights\best.pt, 5.6MB

Validating runs\pose\train6\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-pose summary (fused): 109 layers, 2,654,521 parameters, 0 gradients, 6.6 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95)     Pose(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:00<00:00,  3.59it/s]


                   all         28         30      0.569      0.833      0.682      0.472      0.569      0.833      0.682      0.643
                drowsy         22         24       0.73          1      0.811      0.545       0.73          1      0.811      0.733
                normal          6          6      0.408      0.667      0.552        0.4      0.408      0.667      0.552      0.552
Speed: 0.3ms preprocess, 2.6ms inference, 0.0ms loss, 1.4ms postprocess per image
Results saved to [1mruns\pose\train6[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : armed_bilberry_3621
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/dc738154016145ef94e3c273ce30f052
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [25]               : (0.00013336, 0.0005219377)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [25]               : (0.00013336, 0.0005219377)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [25]               : (0.00013336, 0.0005219377)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B) [26]     : (0.32771, 0.68157)
[1;38;5;39mCOMET 


image 1/1 e:\Programming\VSCodeProjects\datasets\Drowsiness-5\test\images\sleepdec8_jpg.rf.96b4513404a98f82e3603505d8edac06.jpg: 640x640 1 drowsy, 28.3ms
Speed: 2.3ms preprocess, 28.3ms inference, 4.4ms postprocess per image at shape (1, 3, 640, 640)


In [2]:
from roboflow import Roboflow
rf = Roboflow(api_key="0nxjiKWHzZ4uQ3XnGWsd")
project = rf.workspace("test-3exll").project("drowsiness-h1xmb-pofma")
version = project.version(3)
dataset = version.download("yolov8-obb")

loading Roboflow workspace...
loading Roboflow project...


Downloading Dataset Version Zip in Drowsiness-3 to yolov8-obb:: 100%|██████████| 15912/15912 [00:02<00:00, 7113.61it/s] 





Extracting Dataset Version Zip to Drowsiness-3 in yolov8-obb:: 100%|██████████| 784/784 [00:00<00:00, 2150.54it/s]


In [4]:
# OBB with YOLOv11
from ultralytics import YOLO

model = YOLO('yolo11n-obb.pt')  # load a pretrained YOLO OBB model
model.train(data='datasets/Drowsiness-4/data.yaml', epochs=3)  # train the model
results = model('datasets/Drowsiness-4/test/images/sleepdec38_jpg.rf.143ada9487272792e1f5ed39cd33e7f1.jpg')  # predict on an image

results[0].show()  # plot results

Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
[34m[1mengine\trainer: [0mtask=obb, mode=train, model=yolo11n-obb.pt, data=datasets/Drowsiness-4/data.yaml, epochs=3, time=None, patience=100, batch=16, imgsz=1024, save=True, save_period=-1, cache=False, device=None, workers=8, project=None, name=train8, exist_ok=False, pretrained=True, optimizer=auto, verbose=True, seed=0, deterministic=True, single_cls=False, rect=False, cos_lr=False, close_mosaic=10, resume=False, amp=True, fraction=1.0, profile=False, freeze=None, multi_scale=False, overlap_mask=True, mask_ratio=4, dropout=0.0, val=True, split=val, save_json=False, conf=None, iou=0.7, max_det=300, half=False, dnn=False, plots=True, source=None, vid_stride=1, stream_buffer=False, visualize=False, augment=False, agnostic_nms=False, classes=None, retina_masks=False, embed=None, show=False, save_frames=False, save_txt=False, save_conf=False, save_crop=False, show_labels=True, show_conf=Tr

[1;38;5;39mCOMET INFO:[0m Experiment is live on comet.com https://www.comet.com/qalccu/general/d734ca2ed94e4d1cbf9a2e326ae35bcd

[1;38;5;39mCOMET INFO:[0m Couldn't find a Git repository in 'e:\\Programming\\VSCodeProjects' nor in any parent directory. Set `COMET_GIT_DIRECTORY` if your Git Repository is elsewhere.


[34m[1mTensorBoard: [0mStart with 'tensorboard --logdir runs\obb\train8', view at http://localhost:6006/
Freezing layer 'model.23.dfl.conv.weight'
[34m[1mAMP: [0mrunning Automatic Mixed Precision (AMP) checks...
[34m[1mAMP: [0mchecks passed 


[34m[1mtrain: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-4\train\labels.cache... 366 images, 0 backgrounds, 10 corrupt: 100%|██████████| 366/366 [00:00<?, ?it/s]




[34m[1mval: [0mScanning E:\Programming\VSCodeProjects\datasets\Drowsiness-4\valid\labels.cache... 11 images, 0 backgrounds, 1 corrupt: 100%|██████████| 11/11 [00:00<?, ?it/s]






Plotting labels to runs\obb\train8\labels.jpg... 
[34m[1moptimizer:[0m 'optimizer=auto' found, ignoring 'lr0=0.01' and 'momentum=0.937' and determining best 'optimizer', 'lr0' and 'momentum' automatically... 
[34m[1moptimizer:[0m AdamW(lr=0.001667, momentum=0.9) with parameter groups 87 weight(decay=0.0), 97 weight(decay=0.0005), 96 bias(decay=0.0)
[34m[1mTensorBoard: [0mmodel graph visualization added 
Image sizes 1024 train, 1024 val
Using 8 dataloader workers
Logging results to [1mruns\obb\train8[0m
Starting training for 3 epochs...

      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        1/3      10.3G      1.688      4.663      3.446         20       1024: 100%|██████████| 23/23 [03:08<00:00,  8.22s/it]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95):   0%|          | 0/1 [00:00<?, ?it/s]



                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:09<00:00,  9.44s/it]

                   all         10         20      0.974       0.05     0.0475     0.0238






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        2/3      5.52G      1.264       2.62      3.092         11       1024: 100%|██████████| 23/23 [00:10<00:00,  2.24it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95):   0%|          | 0/1 [00:00<?, ?it/s]



                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:04<00:00,  4.32s/it]

                   all         10         20      0.988        0.2      0.198      0.154






      Epoch    GPU_mem   box_loss   cls_loss   dfl_loss  Instances       Size


        3/3      5.53G      1.175      1.978      2.956         15       1024: 100%|██████████| 23/23 [00:09<00:00,  2.42it/s]
                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:01<00:00,  1.12s/it]

                   all         10         20      0.997       0.55      0.598      0.344






3 epochs completed in 0.065 hours.
Optimizer stripped from runs\obb\train8\weights\last.pt, 6.2MB
Optimizer stripped from runs\obb\train8\weights\best.pt, 6.2MB

Validating runs\obb\train8\weights\best.pt...
Ultralytics 8.3.107  Python-3.12.9 torch-2.6.0+cu118 CUDA:0 (NVIDIA GeForce RTX 3050, 8192MiB)
YOLO11n-obb summary (fused): 109 layers, 2,654,113 parameters, 0 gradients, 6.6 GFLOPs


                 Class     Images  Instances      Box(P          R      mAP50  mAP50-95): 100%|██████████| 1/1 [00:01<00:00,  1.13s/it]


                   all         10         20      0.997       0.55      0.598      0.351
                drowsy         10         20      0.997       0.55      0.598      0.351
Speed: 0.8ms preprocess, 6.9ms inference, 0.0ms loss, 97.6ms postprocess per image
Results saved to [1mruns\obb\train8[0m


[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m Comet.ml Experiment Summary
[1;38;5;39mCOMET INFO:[0m ---------------------------------------------------------------------------------------
[1;38;5;39mCOMET INFO:[0m   Data:
[1;38;5;39mCOMET INFO:[0m     display_summary_level : 1
[1;38;5;39mCOMET INFO:[0m     name                  : registered_giraffe_9958
[1;38;5;39mCOMET INFO:[0m     url                   : https://www.comet.com/qalccu/general/d734ca2ed94e4d1cbf9a2e326ae35bcd
[1;38;5;39mCOMET INFO:[0m   Metrics [count] (min, max):
[1;38;5;39mCOMET INFO:[0m     lr/pg0 [7]               : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     lr/pg1 [7]               : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     lr/pg2 [7]               : (0.00036674000000000006, 0.0005026005)
[1;38;5;39mCOMET INFO:[0m     metrics/mAP50(B) [8]     : (


image 1/1 e:\Programming\VSCodeProjects\datasets\Drowsiness-4\test\images\sleepdec38_jpg.rf.143ada9487272792e1f5ed39cd33e7f1.jpg: 1024x1024 13.9ms
Speed: 7.4ms preprocess, 13.9ms inference, 7.9ms postprocess per image at shape (1, 3, 1024, 1024)
