Deep Learning for Video Object SegmentationA Review

Standard

Deep Learning for Video Object Segmentation : A Review. / Gao, Mingqi; Zheng, Feng; Yu, James et al.

In: Artificial Intelligence Review, Vol. 56, No. 1, 01.01.2023, p. 457-531.

Research output: Contribution to journalArticlepeer-review

Harvard

Gao, M, Zheng, F, Yu, J, Shan, C, Ding, G & Han, J 2023, 'Deep Learning for Video Object Segmentation: A Review', Artificial Intelligence Review, vol. 56, no. 1, pp. 457-531. https://doi.org/10.1007/s10462-022-10176-7

APA

Gao, M., Zheng, F., Yu, J., Shan, C., Ding, G., & Han, J. (2023). Deep Learning for Video Object Segmentation: A Review. Artificial Intelligence Review, 56(1), 457-531. https://doi.org/10.1007/s10462-022-10176-7

Vancouver

Gao M, Zheng F, Yu J, Shan C, Ding G, Han J. Deep Learning for Video Object Segmentation: A Review. Artificial Intelligence Review. 2023 Jan 1;56(1):457-531. Epub 2022 Apr 8. doi: 10.1007/s10462-022-10176-7

Author

Gao, Mingqi ; Zheng, Feng ; Yu, James et al. / Deep Learning for Video Object Segmentation : A Review. In: Artificial Intelligence Review. 2023 ; Vol. 56, No. 1. pp. 457-531.

Bibtex - Download

@article{1297558e33a24e64a0012d5500ab92f6,
title = "Deep Learning for Video Object Segmentation: A Review",
abstract = "As one of the fundamental problems in the field of video understanding, video object segmentation aims at segmenting objects of interest throughout the given video sequence. Recently, with the advancements of deep learning techniques, deep neural networks have shown outstanding performance improvements in many computer vision applications, with video object segmentation being one of the most advocated and intensively investigated. In this paper, we present a systematic review of the deep learning-based video segmentation literature, highlighting the pros and cons of each category of approaches. Concretely, we start by introducing the definition, background concepts and basic ideas of algorithms in this field. Subsequently, we summarise the datasets for training and testing a video object segmentation algorithm, as well as common challenges and evaluation metrics. Next, previous works are grouped and reviewed based on how they extract and use spatial and temporal features, where their architectures, contributions and the differences among each other are elaborated. At last, the quantitative and qualitative results of several representative methods on a dataset with many remaining challenges are provided and analysed, followed by further discussions on future research directions. This article is expected to serve as a tutorial and source of reference for learners intended to quickly grasp the current progress in this research area and practitioners interested in applying the video object segmentation methods to their problems. A public website is built to collect and track the related works in this field: https://github.com/gaomingqi/VOS-Review.",
keywords = "Convolutional neural network, Deep learning, Video object segmentation",
author = "Mingqi Gao and Feng Zheng and James Yu and Caifeng Shan and Guiguang Ding and Jungong Han",
note = "Publisher Copyright: {\textcopyright} 2022, The Author(s).",
year = "2023",
month = jan,
day = "1",
doi = "10.1007/s10462-022-10176-7",
language = "English",
volume = "56",
pages = "457--531",
journal = "Artificial Intelligence Review",
issn = "0269-2821",
publisher = "Springer Nature",
number = "1",

}

RIS (suitable for import to EndNote) - Download

TY - JOUR

T1 - Deep Learning for Video Object Segmentation

T2 - A Review

AU - Gao, Mingqi

AU - Zheng, Feng

AU - Yu, James

AU - Shan, Caifeng

AU - Ding, Guiguang

AU - Han, Jungong

N1 - Publisher Copyright: © 2022, The Author(s).

PY - 2023/1/1

Y1 - 2023/1/1

N2 - As one of the fundamental problems in the field of video understanding, video object segmentation aims at segmenting objects of interest throughout the given video sequence. Recently, with the advancements of deep learning techniques, deep neural networks have shown outstanding performance improvements in many computer vision applications, with video object segmentation being one of the most advocated and intensively investigated. In this paper, we present a systematic review of the deep learning-based video segmentation literature, highlighting the pros and cons of each category of approaches. Concretely, we start by introducing the definition, background concepts and basic ideas of algorithms in this field. Subsequently, we summarise the datasets for training and testing a video object segmentation algorithm, as well as common challenges and evaluation metrics. Next, previous works are grouped and reviewed based on how they extract and use spatial and temporal features, where their architectures, contributions and the differences among each other are elaborated. At last, the quantitative and qualitative results of several representative methods on a dataset with many remaining challenges are provided and analysed, followed by further discussions on future research directions. This article is expected to serve as a tutorial and source of reference for learners intended to quickly grasp the current progress in this research area and practitioners interested in applying the video object segmentation methods to their problems. A public website is built to collect and track the related works in this field: https://github.com/gaomingqi/VOS-Review.

AB - As one of the fundamental problems in the field of video understanding, video object segmentation aims at segmenting objects of interest throughout the given video sequence. Recently, with the advancements of deep learning techniques, deep neural networks have shown outstanding performance improvements in many computer vision applications, with video object segmentation being one of the most advocated and intensively investigated. In this paper, we present a systematic review of the deep learning-based video segmentation literature, highlighting the pros and cons of each category of approaches. Concretely, we start by introducing the definition, background concepts and basic ideas of algorithms in this field. Subsequently, we summarise the datasets for training and testing a video object segmentation algorithm, as well as common challenges and evaluation metrics. Next, previous works are grouped and reviewed based on how they extract and use spatial and temporal features, where their architectures, contributions and the differences among each other are elaborated. At last, the quantitative and qualitative results of several representative methods on a dataset with many remaining challenges are provided and analysed, followed by further discussions on future research directions. This article is expected to serve as a tutorial and source of reference for learners intended to quickly grasp the current progress in this research area and practitioners interested in applying the video object segmentation methods to their problems. A public website is built to collect and track the related works in this field: https://github.com/gaomingqi/VOS-Review.

KW - Convolutional neural network

KW - Deep learning

KW - Video object segmentation

UR - http://www.scopus.com/inward/record.url?scp=85127648745&partnerID=8YFLogxK

U2 - 10.1007/s10462-022-10176-7

DO - 10.1007/s10462-022-10176-7

M3 - Article

AN - SCOPUS:85127648745

VL - 56

SP - 457

EP - 531

JO - Artificial Intelligence Review

JF - Artificial Intelligence Review

SN - 0269-2821

IS - 1

ER -

Show download statistics
View graph of relations
Citation formats