LIP: Learning instance propagation for video object segmentation

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

In recent years, the task of segmenting foreground objects from background in a video, i.e. video object segmentation (VOS), has received considerable attention. In this paper, we propose a single end-to-end trainable deep neural network, convolutional gated recurrent Mask-RCNN, for tackling the semi-supervised VOS task. We take advantage of both the instance segmentation network (Mask-RCNN) and the visual memory module (Conv-GRU) to tackle the VOS task. The instance segmentation network predicts masks for instances, while the visual memory module learns to selectively propagate information for multiple instances simultaneously, which handles the appearance change, the variation of scale and pose and the occlusions between objects. After offline and online training under purely instance segmentation losses, our approach is able to achieve satisfactory results without any post-processing or synthetic video data augmentation. Experimental results on DAVIS 2016 dataset and DAVIS 2017 dataset have demonstrated the effectiveness of our method for video object segmentation task.

Original languageEnglish
Title of host publicationProceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019
PublisherIEEE
Pages2739-2748
Number of pages10
ISBN (Electronic)9781728150239
DOIs
Publication statusPublished - Oct 2019
Event17th IEEE/CVF International Conference on Computer Vision, ICCV 2019 - Seoul, Korea, Republic of
Duration: 27 Oct 20192 Nov 2019
Conference number: 17
http://iccv2019.thecvf.com/

Publication series

NameProceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019

Conference

Conference17th IEEE/CVF International Conference on Computer Vision, ICCV 2019
Abbreviated titleICCV 2019
CountryKorea, Republic of
CitySeoul
Period27/10/192/11/19
Internet address

Keywords

  • Deep learning
  • Instance segmentation
  • Memory module
  • Video object segmentation

Fingerprint Dive into the research topics of 'LIP: Learning instance propagation for video object segmentation'. Together they form a unique fingerprint.

  • Cite this

    Lyu, Y., Vosselman, G., Xia, G. S., & Yang, M. Y. (2019). LIP: Learning instance propagation for video object segmentation. In Proceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019 (pp. 2739-2748). [9022371] (Proceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019). IEEE. https://doi.org/10.1109/ICCVW.2019.00335