Videos, Slides, Films

Enhancing Semantic Segmentation of Aerial Images with Inhibitory Neurons

Conferences
ICPR 2020 MAIN CONFERENCE PS T1.7: Supervised and Semi-supervised Learning (2021)
Available as
Online
Summary

In a Convolutional Neural Network, each neuron in the output feature map takes input from the neurons in its receptive field. This receptive field concept plays a vital role in today's deep neural ...

In a Convolutional Neural Network, each neuron in the output feature map takes input from the neurons in its receptive field. This receptive field concept plays a vital role in today's deep neural networks. However, inspired by neuro-biological research, it has been proposed to add inhibitory neurons outside the receptive field, which may enhance the performance of neural network models. In this paper, we begin with deep network architectures such as VGG and ResNet, and propose an approach to add lateral inhibition in each output neuron to reduce its impact on its neighbours, both in fine-tuning pre-trained models and training from scratch. Our experiments show that notable improvements upon prior baseline deep models can be achieved. A key feature of our approach is that it is easy to add to baseline models; it can be adopted in any model containing convolution layers, and we demonstrate its value in applications including object recognition and semantic segmentation of aerial images, where we show state-of-the-art result on the Aeroscape dataset. On semantic segmentation tasks, our enhancement achieves 13.43% higher mIoU than a single baseline model on a single source (the Aeroscape dataset) and 7.03% higher the previous state-of-the art result, which was an ensemble model trained on multiple sources. Our experiments illustrate the potential impact of using inhibitory neurons in deep learning models, and they also show better results than the baseline models that have standard convolutional layers.

Details

Additional Information