期刊论文详细信息
PATTERN RECOGNITION 卷:104
Timed-image based deep learning for action recognition in video sequences
Article
Atto, Abdourrahmane Mahamane1  Benoit, Alexandre1  Lambert, Patrick1 
[1] Univ Savoie Mt Blanc, Lab Comp Sci Syst Informat & Knowledge Proc, BP 80439, F-74944 Annecy Le Vieux, France
关键词: Data conditioning;    Video analysis;    Deep learning;    Convolution frames;    Hilbert space-filling curve;    Action recognition;    Violence detection;   
DOI  :  10.1016/j.patcog.2020.107353
来源: Elsevier
PDF
【 摘 要 】

The paper addresses two issues relative to machine learning on 2D + X data volumes, where 2D refers to image observation and X denotes a variable that can be associated with time, depth, wavelength, etc. The first issue addressed is conditioning these structured volumes for compatibility with respect to convolutional neural networks operating on 2D image file formats. The second issue is associated with sensitive action detection in the 2D + Time case (video clips and image time series). For the data conditioning issue, the paper first highlights that referring 2D spatial convolution to its 1D Hilbert based instance is highly accurate for information compressibility upon tight frames of convolutional networks. As a consequence of this compressibility, the paper proposes converting the 2D + X data volume into a single meta-image file format, prior to machine learning frameworks. This conversion is such that any 2D frame of the 2D + X data is reshaped as a 1D array indexed by a Hilbert space-filling curve and the third variable X of the initial file format becomes the second variable in the meta-image format. For the sensitive action recognition issue, the paper provides: (i) a 3 category video database involving non-violent, moderate and extreme violence actions; (ii) the conversion of this database into a timed meta-image database from the 2D + Time to 2D conditioning stage described above and (iii) outstanding 2-level and 3-level violence classification results from deep convolutional neural networks operating on meta-image databases. (C) 2020 Elsevier Ltd. All rights reserved.

【 授权许可】

Free   

【 预 览 】
附件列表
Files Size Format View
10_1016_j_patcog_2020_107353.pdf 2297KB PDF download
  文献评价指标  
  下载次数:0次 浏览次数:0次