<a href="https://colab.research.google.com/github/pourmand1376/Polyp_detection/blob/main/Kvasir_SEG_YOLOX.ipynb" target="_parent"><img src="https://colab.research.google.com/assets/colab-badge.svg" alt="Open In Colab"/></a>

# How to Train YOLOX on Custom Objects

This tutorial is based on the [YOLOX repository](https://github.com/Megvii-BaseDetection/YOLOX) by [the Megvii Team](https://github.com/Megvii-BaseDetection). This notebook shows training on **your own custom objects**. Many thanks to the Megvii Team for putting this repository together - we hope that in combination with clean data management tools at Roboflow, this technologoy will become easily accessible to any developer wishing to use computer vision in their projects.

### Accompanying Blog Post

We recommend that you follow along in this notebook while reading the blog post on [How to Train YOLOX](blog.roboflow.com/how-to-train-yolox-on-a-custom-dataset/), concurrently.

### Steps Covered in this Tutorial

In this tutorial, we will walk through the steps required to train YOLOR on your custom objects. We use a [public blood cell detection dataset](https://public.roboflow.ai/object-detection/bccd), which is open source and free to use. You can also use this notebook on your own data. We will use Roboflow to preprocess our images.

To train our detector we take the following steps:

* Install YOLOX dependencies
* Download and Prepare custom YOLOX object detection data
* Download Pre-Trained Weights for YOLOX
* Run YOLOX training
* Evaluate YOLOX performance
* Run YOLOX inference on test images
* Export saved YOLOX weights for future inference

### **About**

[Roboflow](https://roboflow.com) enables teams to deploy custom computer vision models quickly and accurately. Convert data from to annotation format, assess dataset health, preprocess, augment, and more. It's free for your first 1000 source images.

**Looking for a vision model available via API without hassle? Try Roboflow Train.**

![Roboflow Wordmark](https://i.imgur.com/dcLNMhV.png)

# Install YOLOX Dependencies

In [None]:
!git clone https://github.com/roboflow-ai/YOLOX.git
%cd YOLOX
!pip3 install -U pip && pip3 install -r requirements.txt
!pip3 install -v -e .  
!pip uninstall -y torch torchvision torchaudio
# May need to change in the future if Colab no longer uses CUDA 11.0
!pip install torch==1.8.0+cu111 torchvision==0.9.0+cu111 torchaudio==0.8.0 -f https://download.pytorch.org/whl/torch_stable.html

Cloning into 'YOLOX'...
remote: Enumerating objects: 786, done.[K
remote: Total 786 (delta 0), reused 0 (delta 0), pack-reused 786[K
Receiving objects: 100% (786/786), 5.78 MiB | 18.84 MiB/s, done.
Resolving deltas: 100% (416/416), done.
/content/YOLOX
Collecting pip
  Downloading pip-22.0.4-py3-none-any.whl (2.1 MB)
[K     |████████████████████████████████| 2.1 MB 6.8 MB/s 
[?25hInstalling collected packages: pip
  Attempting uninstall: pip
    Found existing installation: pip 21.1.3
    Uninstalling pip-21.1.3:
      Successfully uninstalled pip-21.1.3
Successfully installed pip-22.0.4
Collecting loguru
  Downloading loguru-0.6.0-py3-none-any.whl (58 kB)
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m58.3/58.3 KB[0m [31m4.6 MB/s[0m eta [36m0:00:00[0m
Collecting thop
  Downloading thop-0.0.31.post2005241907-py3-none-any.whl (8.7 kB)
Collecting ninja
  Downloading ninja-1.10.2.3-py2.py3-none-manylinux_2_5_x86_64.manylinux1_x86_64.whl (108 kB)
[2K     [90m━━━

## Install Nvidia Apex

In [None]:
%cd /content/
!git clone https://github.com/NVIDIA/apex
%cd apex
!pip install -v --disable-pip-version-check --no-cache-dir --global-option="--cpp_ext" --global-option="--cuda_ext" ./

/content
Cloning into 'apex'...
remote: Enumerating objects: 9079, done.[K
remote: Counting objects: 100% (150/150), done.[K
remote: Compressing objects: 100% (125/125), done.[K
remote: Total 9079 (delta 83), reused 41 (delta 24), pack-reused 8929[K
Receiving objects: 100% (9079/9079), 14.59 MiB | 19.87 MiB/s, done.
Resolving deltas: 100% (6196/6196), done.
/content/apex
[0mUsing pip 22.0.4 from /usr/local/lib/python3.7/dist-packages/pip (python 3.7)
Processing /content/apex
  Running command python setup.py egg_info


  torch.__version__  = 1.8.0+cu111


  running egg_info
  creating /tmp/pip-pip-egg-info-cygeeytc/apex.egg-info
  writing /tmp/pip-pip-egg-info-cygeeytc/apex.egg-info/PKG-INFO
  writing dependency_links to /tmp/pip-pip-egg-info-cygeeytc/apex.egg-info/dependency_links.txt
  writing top-level names to /tmp/pip-pip-egg-info-cygeeytc/apex.egg-info/top_level.txt
  writing manifest file '/tmp/pip-pip-egg-info-cygeeytc/apex.egg-info/SOURCES.txt'
  adding license file 'LICE

## Install PyCocoTools

In [None]:
!pip3 install cython; pip3 install 'git+https://github.com/cocodataset/cocoapi.git#subdirectory=PythonAPI'

[0mCollecting git+https://github.com/cocodataset/cocoapi.git#subdirectory=PythonAPI
  Cloning https://github.com/cocodataset/cocoapi.git to /tmp/pip-req-build-h88g3in_
  Running command git clone --filter=blob:none --quiet https://github.com/cocodataset/cocoapi.git /tmp/pip-req-build-h88g3in_
  Resolved https://github.com/cocodataset/cocoapi.git to commit 8c9bcc3cf640524c4c20a9c40e89cb6a2f2fa0e9
  Preparing metadata (setup.py) ... [?25l[?25hdone
Building wheels for collected packages: pycocotools
  Building wheel for pycocotools (setup.py) ... [?25l[?25hdone
  Created wheel for pycocotools: filename=pycocotools-2.0-cp37-cp37m-linux_x86_64.whl size=264363 sha256=fac5774927f71d6515d900c1b01d71f9fe6a0a13a526b8f9395b59f42f036f12
  Stored in directory: /tmp/pip-ephem-wheel-cache-9pyq294r/wheels/e2/6b/1d/344ac773c7495ea0b85eb228bc66daec7400a143a92d36b7b1
Successfully built pycocotools
Installing collected packages: pycocotools
  Attempting uninstall: pycocotools
    Found existing insta

# Download your Data

We'll download our dataset from Roboflow. Use the "**Pascal VOC**" export format.

To get your data into Roboflow, follow the [Getting Started Guide](https://blog.roboflow.ai/getting-started-with-roboflow/).


In [None]:
#to get your roboflow code below please follow the link output by this cell
!pip -q install roboflow
from roboflow import Roboflow
rf = Roboflow(model_format="voc", notebook="yolox")

[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m67.8/67.8 KB[0m [31m3.7 MB/s[0m eta [36m0:00:00[0m
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m138.5/138.5 KB[0m [31m8.4 MB/s[0m eta [36m0:00:00[0m
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m596.3/596.3 KB[0m [31m25.1 MB/s[0m eta [36m0:00:00[0m
[?25h  Preparing metadata (setup.py) ... [?25l[?25hdone
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m145.5/145.5 KB[0m [31m11.2 MB/s[0m eta [36m0:00:00[0m
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m178.7/178.7 KB[0m [31m12.8 MB/s[0m eta [36m0:00:00[0m
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m1.1/1.1 MB[0m [31m40.3 MB/s[0m eta [36m0:00:00[0m
[2K     [90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━[0m [32m63.1/63.1 KB[0m [31m6.0 MB/s[0m eta [36m0:00:00[0m
[?25h  Building wheel for wget (setup.py) ... [?25l[?25hdone
[31mERROR: pip's dependency resol

In [None]:
%cd /content/
from roboflow import Roboflow
rf = Roboflow(api_key="Quj44CovudhOOv2fXcs0")
project = rf.workspace().project("kvasir-seg-hrk5k")
dataset = project.version("1").download("voc")

/content
loading Roboflow workspace...
loading Roboflow project...
Downloading Dataset Version Zip in Kvasir-SEG-1 to voc: 100% [49617130 / 49617130] bytes


Extracting Dataset Version Zip to Kvasir-SEG-1 in voc:: 100%|██████████| 4805/4805 [00:05<00:00, 925.25it/s]


In [None]:
%cd YOLOX/
!ln -s {dataset.location}/train/ ./datasets/VOCdevkit

/content/YOLOX


## Format Your Data Appropriately

In [None]:
%mkdir "/content/YOLOX/datasets/VOCdevkit/VOC2007"
!python3 voc_txt.py "/content/YOLOX/datasets/VOCdevkit/"
%mkdir "/content/YOLOX/datasets/VOCdevkit/VOC2012"
!cp -r "/content/YOLOX/datasets/VOCdevkit/VOC2007/." "/content/YOLOX/datasets/VOCdevkit/VOC2012"

train and val size: 1890
train size: 1512


## Change the Classes
Make sure you change the classes based on what your dataset. To ensure that the training process will function as intended, write the classes in lowercase with no whitespace.

In [None]:
from IPython.core.magic import register_line_cell_magic

@register_line_cell_magic
def writetemplate(line, cell):
    with open(line, 'w') as f:
        f.write(cell.format(**globals()))

In [None]:
##REPLACE this cell with your classnames stripped of whitespace and lowercase
%%writetemplate /content/YOLOX/yolox/data/datasets/voc_classes.py

VOC_CLASSES = (
  "JSON"
)

In [None]:
##REPLACE this cell with your classnames stripped of whitespace and lowercase
%%writetemplate /content/YOLOX/yolox/data/datasets/coco_classes.py

COCO_CLASSES = (
    "JSON"
)

Set the number of classes you have in your dataset in te `NUM_CLASSES` variable

In [None]:
NUM_CLASSES = 1
!sed -i -e 's/self.num_classes = 20/self.num_classes = {NUM_CLASSES}/g' "/content/YOLOX/exps/example/yolox_voc/yolox_voc_s.py"

# Download Pretrained Weights

In [None]:
%cd /content/
!wget https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_s.pth
%cd /content/YOLOX/

/content
--2022-03-08 20:40:32--  https://github.com/Megvii-BaseDetection/storage/releases/download/0.0.1/yolox_s.pth
Resolving github.com (github.com)... 140.82.112.3
Connecting to github.com (github.com)|140.82.112.3|:443... connected.
HTTP request sent, awaiting response... 302 Found
Location: https://objects.githubusercontent.com/github-production-release-asset-2e65be/388351473/0b307dd4-bddb-4cfe-a863-1d19afb5598a?X-Amz-Algorithm=AWS4-HMAC-SHA256&X-Amz-Credential=AKIAIWNJYAX4CSVEH53A%2F20220308%2Fus-east-1%2Fs3%2Faws4_request&X-Amz-Date=20220308T204032Z&X-Amz-Expires=300&X-Amz-Signature=331c5df5fa642873de3795cf107e78b2525ebdb502dca5a30a5c60d5b156350b&X-Amz-SignedHeaders=host&actor_id=0&key_id=0&repo_id=388351473&response-content-disposition=attachment%3B%20filename%3Dyolox_s.pth&response-content-type=application%2Foctet-stream [following]
--2022-03-08 20:40:32--  https://objects.githubusercontent.com/github-production-release-asset-2e65be/388351473/0b307dd4-bddb-4cfe-a863-1d19afb55

# Train the Model

In [None]:
!python tools/train.py -f exps/example/yolox_voc/yolox_voc_s.py -d 1 -b 16 --fp16 -o -c /content/yolox_s.pth

`fused_weight_gradient_mlp_cuda` module not found. gradient accumulation fusion with weight gradient computation disabled.
[32m2022-03-08 20:41:02[0m | [1mINFO    [0m | [36myolox.core.trainer[0m:[36m125[0m - [1margs: Namespace(batch_size=16, ckpt='/content/yolox_s.pth', devices=1, dist_backend='nccl', dist_url=None, exp_file='exps/example/yolox_voc/yolox_voc_s.py', experiment_name='yolox_voc_s', fp16=True, local_rank=0, machine_rank=0, name=None, num_machines=1, occupy=True, opts=[], resume=False, start_epoch=None)[0m
[32m2022-03-08 20:41:02[0m | [1mINFO    [0m | [36myolox.core.trainer[0m:[36m126[0m - [1mexp value:
╒══════════════════╤════════════════════════════╕
│ keys             │ values                     │
╞══════════════════╪════════════════════════════╡
│ seed             │ None                       │
├──────────────────┼────────────────────────────┤
│ output_dir       │ './YOLOX_outputs'          │
├──────────────────┼────────────────────────────┤
│ print_

# Evaluate the Model

In [None]:
MODEL_PATH = "/content/YOLOX/YOLOX_outputs/yolox_voc_s/latest_ckpt.pth.tar"
!python3 tools/eval.py -n  yolox-s -c {MODEL_PATH} -b 64 -d 1 --conf 0.001 -f exps/example/yolox_voc/yolox_voc_s.py

[32m2022-02-21 14:52:17[0m | [1mINFO    [0m | [36m__main__[0m:[36m129[0m - [1mArgs: Namespace(batch_size=64, ckpt='/content/YOLOX/YOLOX_outputs/yolox_voc_s/latest_ckpt.pth.tar', conf=0.001, devices=1, dist_backend='nccl', dist_url=None, exp_file='exps/example/yolox_voc/yolox_voc_s.py', experiment_name='yolox_voc_s', fp16=False, fuse=False, local_rank=0, machine_rank=0, name='yolox-s', nms=None, num_machines=1, opts=[], seed=None, speed=False, test=False, trt=False, tsize=None)[0m
[32m2022-02-21 14:52:18[0m | [1mINFO    [0m | [36m__main__[0m:[36m139[0m - [1mModel Summary: Params: 8.94M, Gflops: 26.64[0m
[32m2022-02-21 14:52:18[0m | [1mINFO    [0m | [36m__main__[0m:[36m140[0m - [1mModel Structure:
YOLOX(
  (backbone): YOLOPAFPN(
    (backbone): CSPDarknet(
      (stem): Focus(
        (conv): BaseConv(
          (conv): Conv2d(12, 32, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1), bias=False)
          (bn): BatchNorm2d(32, eps=0.001, momentum=0.03, affin

# Test the Model
Make sure you replace the `TEST_IMAGE_PATH` variable with a test image from your dataset

In [None]:
TEST_IMAGE_PATH = "/content/valid/BloodImage_00057_jpg.rf.1ee93e9ec4d76cfaddaa7df70456c376.jpg"
!python tools/demo.py image -f /content/YOLOX/exps/example/yolox_voc/yolox_voc_s.py -c {MODEL_PATH} --path {TEST_IMAGE_PATH} --conf 0.25 --nms 0.45 --tsize 640 --save_result --device gpu

[32m2022-02-21 14:52:23.208[0m | [1mINFO    [0m | [36m__main__[0m:[36mmain[0m:[36m240[0m - [1mArgs: Namespace(camid=0, ckpt='/content/YOLOX/YOLOX_outputs/yolox_voc_s/latest_ckpt.pth.tar', conf=0.25, demo='image', device='gpu', exp_file='/content/YOLOX/exps/example/yolox_voc/yolox_voc_s.py', experiment_name='yolox_voc_s', fp16=False, fuse=False, name=None, nms=0.45, path='/content/valid/BloodImage_00057_jpg.rf.1ee93e9ec4d76cfaddaa7df70456c376.jpg', save_result=True, trt=False, tsize=640)[0m
[32m2022-02-21 14:52:23.461[0m | [1mINFO    [0m | [36m__main__[0m:[36mmain[0m:[36m250[0m - [1mModel Summary: Params: 8.94M, Gflops: 26.64[0m
[32m2022-02-21 14:52:25.862[0m | [1mINFO    [0m | [36m__main__[0m:[36mmain[0m:[36m261[0m - [1mloading checkpoint[0m
Traceback (most recent call last):
  File "tools/demo.py", line 296, in <module>
    main(exp, args)
  File "tools/demo.py", line 262, in main
    ckpt = torch.load(ckpt_file, map_location="cpu")
  File "/usr/loc

# Visualize the Predictions
Make sure you replace the `OUTPUT_IMAGE_PATH` with the respective path of the image output. This path can be found somewhere in the `YOLOX_outputs` folder


In [None]:
from PIL import Image
OUTPUT_IMAGE_PATH = "/content/YOLOX/YOLOX_outputs/yolox_voc_s/vis_res/2021_08_01_19_51_59/BloodImage_00057_jpg.rf.1ee93e9ec4d76cfaddaa7df70456c376.jpg" 
Image.open(OUTPUT_IMAGE_PATH)

FileNotFoundError: ignored

# Export Trained Weights for Future Inference

Now that you have trained your custom detector, you can export the trained weights you have made here for inference on your device elsewhere

In [None]:
from google.colab import drive
drive.mount('/content/gdrive')

In [None]:
%cp {MODEL_PATH} /content/gdrive/My\ Drive