A study on deep learning spatiotemporal models and feature extraction techniques for video understanding

M. Suresha; S. Kuppa; D. S. Raghukumar

doi:10.1007/s13735-019-00190-x

Loading next page...

References (91)

M Sekma (2015)
10.1016/j.patrec.2015.06.029
Pattern Recogn Lett, 65
10.1145/2733373.2806222
K Chen (2018)
10.1007/s13735-017-0139-6
Int J Multimed Inf Retr, 7
YG Jiang (2018)
10.1109/TMM.2018.2823900
IEEE Trans Multimed, 20
10.1109/CVPR.2016.216
10.1145/2733373.2806226
D Chen (2019)
10.1038/s41746-019-0122-0
NPJ Digit Med, 2
P Mamoshina (2016)
10.1021/acs.molpharmaceut.5b00982
Mol Pharm, 13
Y Peng (2018)
10.1109/TCSVT.2018.2808685
IEEE Trans Circuits Syst Video Technol, 29
10.1109/CVPR.2011.5995407
10.1109/CVPR.2008.4587756
Y Guo (2018)
10.1007/s13735-017-0141-z
Int J Multimed Inf Retr, 7
CJ Kelly (2019)
10.1186/s12916-019-1426-2
BMC Med, 17
SB Bhorge (2018)
10.1007/s13735-018-0152-4
Int J Multimed Inf Retr, 7
10.1109/CVPR.2018.00936
DG Lowe (2004)
10.1023/B:VISI.0000029664.99615.94
Int J Comput Vis, 60
L Wang (2017)
10.1016/j.patrec.2017.04.004
Pattern Recogn Lett, 92
S Baker (2011)
10.1007/s11263-010-0390-2
Int J Comput Vis, 92
MA Goodale (1992)
10.1016/0166-2236(92)90344-8
Trends Neurosci, 15
10.1109/CVPR.2019.00136
P Wang (2018)
10.1016/j.cviu.2018.04.007
Comput Vis Image Underst, 171
Q Li (2017)
10.1007/s13735-016-0117-4
Int J Multimed Inf Retr, 6
Z Wang (2018)
10.1016/j.neucom.2018.01.076
Neurocomputing, 287
G Sreenu (2019)
10.1186/s40537-019-0212-5
J Big Data, 6
RK Tripathi (2018)
10.1007/s10462-017-9545-7
Artif Intell Rev, 50
10.1109/CISP-BMEI.2017.8302004
J Zhang (2019)
10.3390/s19010056
Sensors, 19
10.15607/RSS.2018.XIV.062
10.1109/CVPR.2016.308
I Laptev (2005)
10.1007/s11263-005-1838-7
Int J Comput Vis, 64
10.1145/2671188.2749406
10.1109/IROS.2017.8206288
Chuanqi Tan (2018)
10.1007/978-3-030-01424-7_27
10.1109/CVPR.2014.223
10.1109/CVPR.2017.226
10.1109/ICCV.2013.338
10.1109/CVPR.2015.7298789
Yang Du (2018)
10.1007/978-3-030-01270-0_23
10.1109/CVPR.2019.00377
10.1109/ICCV.2017.244
NC Mithun (2019)
10.1007/s13735-018-00166-3
Int J Multimed Inf Retr, 8
10.1109/ISACV.2018.8354045
MM Najafabadi (2015)
10.1186/s40537-014-0007-7
J Big Data, 2
U Sivarajah (2017)
10.1016/j.jbusres.2016.08.001
J Bus Res, 70
L Kangwei (2018)
10.1007/s11760-017-1153-0
Signal Image Video Process, 12
10.1109/DICTA.2012.6411699
Konstantinos Papadopoulos (2019)
10.3390/s19163503
Sensors, 19
Li Yao (2016)
10.1155/2016/1760172
Computational Intelligence and Neuroscience, 2016
R Melfi (2013)
10.1016/j.patrec.2013.04.025
Pattern Recogn Lett, 34
10.1109/SIPROCESS.2016.7888355
W Zhang (2019)
10.3390/a12010008
Algorithms, 12
L Wang (2003)
10.1016/S0031-3203(02)00100-0
Pattern Recogn, 36
Darrick Evensen (2019)
10.1038/s41558-019-0481-1
Nature Climate Change, 9
YG Jiang (2017)
10.1109/TPAMI.2017.2670560
IEEE Trans Pattern Anal Mach Intell, 40
10.1109/ICCV.2015.522
G Atluri (2018)
10.1145/3161602
ACM Comput Surv: CSUR, 51
TF Gonzalez (2007)
10.1201/9781420010749
AF Bobick (2001)
10.1109/34.910878
IEEE Trans Pattern Anal Mach Intell, 3
10.1109/ICCV.2015.510
10.1109/CVPR.2015.7299059
10.1109/IECON.2018.8591338
10.1109/ICCV.2017.620
10.1109/EuroSP.2016.36
10.1109/WACV.2017.27
10.1145/1291233.1291311
WG Hatcher (2018)
10.1109/ACCESS.2018.2830661
IEEE Access, 6
KS Ray (2019)
10.1016/j.jvcir.2018.12.002
J Vis Commun Image Represent, 58
10.1109/CVPR.2018.00931
Y Yuan (2016)
10.1016/j.patcog.2016.02.022
Pattern Recogn, 59
S Levine (2018)
10.1177/0278364917710318
Int J Robot Res, 37
10.1109/ROBIO.2009.5420735
Annalisa Cocchia (2014)
10.1007/978-3-319-06160-3_2
10.1109/AVSS.2012.39
J Liu (2019)
10.1007/s10489-019-01459-8
Appl Intell, 49
10.1109/ICCV.2019.00889
10.1109/CVPR.2016.213
Z Qiu (2017)
10.1109/TMM.2017.2759504
IEEE Trans Multimed, 20
10.1109/CVPR.2016.90
Y LeCun (2015)
10.1038/nature14539
Nature, 521
10.1109/ICCV.2013.441
GJ Burghouts (2013)
10.1016/j.patrec.2013.01.024
Pattern Recogn Lett, 34
10.1109/IROS.2012.6386146
A Ullah (2017)
10.1109/ACCESS.2017.2778011
IEEE Access, 6
10.1109/CVPR.2015.7298925
10.1145/3180155.3180220
S Naseer (2018)
10.1109/ACCESS.2018.2863036
IEEE Access, 6
O Russakovsky (2015)
10.1007/s11263-015-0816-y
Int J Comput Vis, 115
MZ Alom (2019)
10.3390/electronics8030292
Electronics, 8
N Kruger (2012)
10.1109/TPAMI.2012.272
IEEE Trans Pattern Anal Mach Intell, 35
Xiao-Yu Zhang (2019)
10.1609/aaai.v33i01.33019227
Proceedings of the AAAI Conference on Artificial Intelligence, 33
Y Deldjoo (2018)
10.1007/s13735-018-0155-1
Int J Multimed Inf Retr, 7

Publisher: Springer Journals
Copyright: Copyright © Springer-Verlag London Ltd., part of Springer Nature 2020
ISSN: 2192-6611
eISSN: 2192-662X
DOI: 10.1007/s13735-019-00190-x
Publisher site: See Article on Publisher Site

Abstract

Video understanding requires abundant semantic information. Substantial progress has been made on deep learning models in the image, text, and audio domains, and notable efforts have been recently dedicated to the design of deep networks in the video domain. We discuss the state-of-the-art convolutional neural network (CNN) and its pipelines for the exploration of video features, various fusion strategies, and their performances; we also discuss the limitations of CNN for long-term motion cues and the use of sequential learning models such as long short-term memory to overcome these limitations. In addition, we address various multi-model approaches for extracting important cues and score fusion techniques from hybrid deep learning frameworks. Then, we highlight future plans in this domain, recent trends, and substantial challenges for video understanding. This survey’s objectives are to study the plethora of approaches that have been developed for solving video understanding problems, to comprehensively study spatiotemporal cues, to explore the various models that are available for solving these problems and to identify the most promising approaches.

Journal

International Journal of Multimedia Information Retrieval – Springer Journals

Published: Jun 24, 2020

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

A study on deep learning spatiotemporal models and feature extraction techniques for video understanding

A study on deep learning spatiotemporal models and feature extraction techniques for video understanding

Get 20M+ Full-Text Papers For Less Than $1.50/day. Start a 14-Day Trial for You or Your Team.

Learn More →

A study on deep learning spatiotemporal models and feature extraction techniques for video understanding

A study on deep learning spatiotemporal models and feature extraction techniques for video understanding

References (91)

Abstract

Journal

Recommended Articles

There are no references for this article.

Our policy towards the use of cookies