Sparse Shape Encoding for Improved Instance Segmentation
Date
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Abstract
Neurophysiological studies suggest that neurons in the intermediate visual area V4 of the primate cortex encode a sparse representation of object shape. While there are metabolic arguments for such sparse representations, there are also potential advantages for inference. Here we explore whether sparse shape encoding can yield benefits for instance segmentation. Specifically, we encode 2D object shape using a Distance Transform Map(DTM) and learn a sparse basis for this representation. To make use of this encoding, we design an instance segmentation head to estimate the sparse coefficients of each object, and then recover the shape from the zero-crossing level set of the corresponding DTM. Our novel SparseShape encoding approach produces fewer topological errors than the state-of-the-art, yields competitive mask AP on the MS COCO benchmark, and exhibits superior generalization performance on the Cityscapes traffic instance segmentation task.