# FPGA-based Binocular Image Feature Extraction and Matching System

Qi Ni, Fei Wang\*, Ziwei Zhao, Peng Gao Harbin Institute of Technology, Shenzhen

## Introduction



Visual Odometer



AR





Image Stitching

# Introduction





performance and power efficiency

#### Contribution

1) A binocular system on FPGA is built, which covers SURF detection, BRIEF description and matching, and runs at a frame rate of 162fps @640 x 480.

2) Standard AXI protocol is used, and different IPs can be flexibly exchanged or configured. Consequently, the design can be extended to more complex applications.

# System Design



#### Feature Extractor



## Feature Matcher - Trace and Stereo Matching



**Stereo Matching** 

### Feature Matcher - Pipeline



#### Feature Matcher - Buffer



WP: write pointer for previous left extraction WR: write pointer for current right extraction RP: read pointer for previous left extraction RR: read pointer for current left extraction NL/R: the left/right extraction result that is read or wrote

#### Feature Matcher – Match Executor



T: trace matching

S: stereo matching

# Experiments – Trace Matching



# Experiments – Stereo Matching



### Experiments – Error Analysis



Data set: http://www.robots.ox.ac.uk/~vgg/research/affine/

# Experiments – FPGA Utilization

| Module     | LUT<br>(as logic) | LUT<br>(as memory) | FF         | BRAM        | DSP       |
|------------|-------------------|--------------------|------------|-------------|-----------|
| detector   | 10878             | 5702               | 18435      | 26          | 160       |
| descriptor | 3097              | 2281               | 9272       | 68          | 0         |
| matcher    | 3479              | 26                 | 2134       | 22.5        | 0         |
| Total      | 17454 (%18)       | 8009 (%11)         | 29841 (%7) | 116.5 (%21) | 160 (18%) |
| Available  | 148200            | 70400              | 437200     | 545         | 900       |

Chip: Xilinx SOC Zynq XC7Z045

# Experiments – Performance Comparison

| Ver.                  | Detection | Description | Architecture | Clock<br>(MHz) | Image Size | Frame Rate (fps) |
|-----------------------|-----------|-------------|--------------|----------------|------------|------------------|
| (Bonato et al., 2008) | SIFT      | SIFT        | Stratix+NIOS | 100            | 320×240    | 30               |
| (Fowers et al., 2013) | SURF      | BASIS       | Virtex+CPU   | 400            | 640×480    | 30               |
| (Heo et al., 2013)    | FAST      | BRIEF       | ZYNQ         | 100            | 640×480    | 55               |
| (Wang et al., 2014)   | SIFT      | BRIEF       | Virtex       | 200            | 1280×720   | 60               |
| (Fularz et al., 2015) | FAST      | BRIEF       | ZYNQ         | 100            | 640×480    | 308              |
| Our (Monocular)       | SURF      | BRIEF       | ZYNQ         | 100            | 640×480    | 325              |
| Our (Binocular )      | SURF      | BRIEF       | ZYNQ         | 100            | 640×480    | 162              |

# Thank you!