Elder, JamesLiu, Keyi2023-08-042023-08-042023-08-04https://hdl.handle.net/10315/41293Neurophysiological studies suggest that neurons in the intermediate visual area V4 of the primate cortex encode a sparse representation of object shape. While there are metabolic arguments for such sparse representations, there are also potential advantages for inference. Here we explore whether sparse shape encoding can yield benefits for instance segmentation. Specifically, we encode 2D object shape using a Distance Transform Map(DTM) and learn a sparse basis for this representation. To make use of this encoding, we design an instance segmentation head to estimate the sparse coefficients of each object, and then recover the shape from the zero-crossing level set of the corresponding DTM. Our novel SparseShape encoding approach produces fewer topological errors than the state-of-the-art, yields competitive mask AP on the MS COCO benchmark, and exhibits superior generalization performance on the Cityscapes traffic instance segmentation task.Author owns copyright, except where explicitly noted. Please contact the author directly with licensing requests.Artificial intelligenceComputer scienceSparse Shape Encoding for Improved Instance SegmentationElectronic Thesis or Dissertation2023-08-04Instance segmentationDeep learningShape representationSparse codingConvolutional neural networksDistance transform