Sparse Shape Encoding for Improved Instance Segmentation

Liu, Keyi

Sparse Shape Encoding for Improved Instance Segmentation

Files

Liu_Keyi_2023_Masters.pdf (31 MB)

Date

2023-08-04

Authors

Liu, Keyi

Abstract

Neurophysiological studies suggest that neurons in the intermediate visual area V4 of the primate cortex encode a sparse representation of object shape. While there are metabolic arguments for such sparse representations, there are also potential advantages for inference. Here we explore whether sparse shape encoding can yield benefits for instance segmentation. Specifically, we encode 2D object shape using a Distance Transform Map(DTM) and learn a sparse basis for this representation. To make use of this encoding, we design an instance segmentation head to estimate the sparse coefficients of each object, and then recover the shape from the zero-crossing level set of the corresponding DTM. Our novel SparseShape encoding approach produces fewer topological errors than the state-of-the-art, yields competitive mask AP on the MS COCO benchmark, and exhibits superior generalization performance on the Cityscapes traffic instance segmentation task.

Keywords

Artificial intelligence, Computer science

URI

https://hdl.handle.net/10315/41293

Collections

Computer Science
Theses and Dissertations

Full item page

Sparse Shape Encoding for Improved Instance Segmentation

Files

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

Abstract

Description

Keywords

Citation

URI

Collections