Segmentation with Structured and Contextual Features

Motivation

A major bottleneck of pedestrian detection lies on the sharp performance deterioration in the presence of small-size pedestrians as being relatively far from the camera. As presented in Fig.1, a typical image often contains multiple pedestrians of different scales, and current detection performance varies significantly over scales: The state-of-the-art detectors typically work reasonably well with large size pedestrians where the objects are near the camera, also referred to as near-scale; In regard to small size (i.e. far-scale) ones, their performance becomes however considerably worse. Take one latest effort, MS-CNN for example, it has been reported that empirically their detector is capable of achieving 3.30% log-average miss rate for near-scale pedestrians (higher than 80 pixels in height) in Caltech Pedestrian Benchmark, the error rate however increases to 60.51% for medium- and far-scale pedestrians (lower than 80 pixels in height).

Pipeline of our approach.

Figure 1: In pedestrian detection, a typical input image usually contains multiple pedestrian instances over different scales. (a) An input image from the Caltech benchmark. (b) The scale distribution of pedestrian heights from the same Caltech dataset. One can observe that far-scale instances in fact dominate the distribution. (c) and (d) display exemplar visual appearance between near- and far-scale instances, as well as the corresponding neuronal feature representations from the proper layers.

Our Approach

Pipeline of our approach.

Figure 1: The flowchart of our proposed approach. Multi-layer representations of ResNet are respectively utilized to compile pedestrian proposals of different sizes, which are then passed to our localization policy module to produce the final outputs.

Demo video

Video

Related publications, code, and results

Xiaowei Zhang, Li Cheng, Bo Li, and Hai-miao Hu. Too Far to See? Not Really! --- Pedestrian Detection with Scale-aware Localization Policy. In arXiv, 2017. [pdf] [Supplementary file] [Source code] [Detection results on Caltech, ETH, and TUD-Brussels benchmarks]

Too Far to See? Not Really!

--- Pedestrian Detection with Scale-aware Localization Policy

Motivation

Our Approach

Demo video

Video

Related publications, code, and results