xView3 Challenge

The goal of this project is to detect, classify and estimate the length of dark vessels (e.g., ships) in SAR satellite imagry using pixel-wise masks segmentation as a partiipant in the xView3: Dark Vessels Challenge the fall of 2021.

Below is an example of xview validation detections from the initial yolact ship detector on a few images:

Solution description

Abstract—This report describes the techniques and experiments for improving automatic ship detection from synthetic aperture radar (SAR) satellite imagery as a participant in the xView3 Dark Vessels Challenge 2021. The xView3 Challenge provides a large multi-dimensional dataset of SAR satellite views to benchmark new approaches to automatically detect illegal fishing activities at a global scale. Computer vision methods and Azure Machine Learning services are utilized in this challenge aiming to advance research contributions in extracting accurate ships mask and dimensions to enable performance improvements in ship detection, ship classification and estimating the length of detected ships. The initial technique has been tested and evaluated on the xView3 Challenge public dataset for benchmarking the performance of the trained models were the proposed method ranked 45 on the leaderboard. While areas of improvement are reflected, the detector and classifier do not outperform the xView3 reference model for this challenge however the proposed method for estimating the length of ships provided positive results. Visual comparisons of the proposed method for delineating the vessel outline in SAR images using mask segmentation indicated more better ground truths from those provided by experts using manual analysis, however manual expert review still may be needed for verifying the classification of ships.

updated: at the end of the xView3 challenge our submission was ranked in the top 50, rank 45. This was before xView3 removed the unverified submissions.

paper draft: Improve Illegal Ship Detection Using Pixel-Wise Mask.pdf
presentation: paper/xView3 Challenge Paper Summary Presentation.pdf

Results

Model	Length Est Type	loc fscore	l-fscore score	vessel fscore	fishing fscore	length acc	aggregate
xView3	Fixed	0.190	0.426	0.121	0.712	0.398	0.000
yolact	Pixel	0.163	0.243	0.166	0.921	0.752	0.516
yolact	Fixed	0.155	0.249	0.166	0.921	0.752	0.341

Challenge Description

The xView3: Dark Vessels Challenge leaderboard performance is tested on the public dataset that does not contain any labels, just scenes containing a set of co-registered SAR images indexed by a unique xView3 scene ID.

Submission task: For each scene in the public xView3 challenge dataset, the trained model is to:

identify the maritime objects
estimate the length of the object
classify it as a vessel or non-vessel
for each vessel classify each as fishing or non-fishing. (non-vessel are assumed to be non-fishing)

Getting Started (Quick Start)

refer to docs/install_notes.md for details instructions. The solution consist of the follow componets, performed consecutively:

Prepare data and metadata
Training features generated for DNN
train DNN based on the featuers and metadata
Inference and create submission

Requirements

To run this repo create a conda environment with all the necesaary packages. pip install -r requirements.txt. refer to docs/install_notes.md for details instructions.

Most training and edevelopment was on an Azure VM configured with 1-2 Nvidia Tesla K80 GPUs. Overall training requires at least 2 GPUs with 12GB memory. Batch size should be adjusted according to the number of GPUs when training.

OS: Ubuntu 18.04 TLS
CUDA: 11.0

TODO: experiment with V100 gpus to evaludate if performaance increases if we utilized a larger batch size.

Data Preparations

Download and extract dataset from the xView3 site: https://iuu.xview.us/download-links, first you'll have to create an account.

Preprocessing is done by runing a couple of scripts:

1. Split raw SAR Images into image chips

Run python datasets/split_images.py. This script will split the images into smaller sizes that can be used for DNN trainings. Images are pickled as numpy arrays .npy. update script params before running Note: the downloading of the challenge zips and the splitting of images are still naive in this apporach and required at at least 1TB. Also, the splitting and unpacking of zips can take multiple days. One 2 machines, it took over a week for the entire dataset (train, val, public test) to be completely unpacked and split. A more efficient approach is possible.

2. Generate Masks

Run python datasets/ship_mask_utils/main.py. This script will generate pixel masks for each of the marinetime objects in chip annotation .csv. The output of this script is a coco_annotation.json file with segmentations.

Training Dataset

The final training dataset consists of 44,383 training images and ~5,000 validation images. Each image is 200x200 pixel .npy. from the previous scripts ran in the datasets/ship_mask_utils/ we utilize the coco_json annotation files that contains the file name and the labels. The categories for this challenge are broken into 3 ocmmen categories that are required to be classified: fishing, non-fishing and other.

Below is a table for the training dataset and the instances distributed across the label categories.

fishing	non_fishing	other	total
12535	19408	14873	46816

Training

Training yolact instance segmentation is implemented for this challenge solutions. Install yolact as defined in the submodule: repos/yolact. The scripts and model configuration is defined in trainers/yolact_xview.

Resnet50 architecture is utilized. During training checkpoints are saved every 10 epochs.

AzureML and Training Dockerfile (TODO)

TODO: need to add to repo

The training environment for running yolact/detectron2 models are built as docker images. Refer to the submodule: repos/azureml_cv for details regarding the yolact/detectron2 Dockerfiles and samples used for kicking of training experiment in AzureML.

Inference & Create Submission

To test the model on a new scene or folder of scenes run detect.py

Create Submission: run detect.py - is the same script used to create the submission csv.

Submission format: the xView3 challenge submission format required prediction results to be provided as a .csv file with the following headings:

scene_id: (str) the unique ID for the xView3 scene
detect_scene_row: (int) pixel coordinate in the vertical (y) axis
detect_scene_column: (int) pixel coordinate in the horizontal (x) axis
is_vessel: (bool), True if the object is a vessel; False otherwise
is_fishing: (bool), True if the object is a fishing-vessel and false otherwise
vessel_length_m: (float), estimated length of the vessel, in meters

source: https://iuu.xview.us/challenge

Note: all participants must register and accept the terms of the xView3 challnege before being able to submit.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
datasets		datasets
docs		docs
notebooks		notebooks
repos		repos
score		score
trainers		trainers
.gitignore		.gitignore
.gitmodules		.gitmodules
Improve Illegal Ship Detection Using Pixel-Wise Mask.pdf		Improve Illegal Ship Detection Using Pixel-Wise Mask.pdf
LICENSE		LICENSE
README.md		README.md
detect.py		detect.py
requirements.txt		requirements.txt

License

naivelogic/xview3_ship_detection

Folders and files

Latest commit

History

Repository files navigation

xView3 Challenge

Solution description

Challenge Description

Getting Started (Quick Start)

Requirements

Data Preparations

Training Dataset

Training

AzureML and Training Dockerfile (TODO)

Inference & Create Submission

About

Resources

License

Stars

Watchers

Forks

Languages