面向人群计数的对偶卷积神经网络

Dual Convolutional Neural Networks for Crowd Counting

  • 摘要: 基于卷积神经网络(CNN)人群计数方法的核心是如何构建能将人群图有效映射为密度图的CNN模型,进而从密度图准确地估计出人群数量.传统构建CNN的方法只考虑了人群图到密度图的映射,并未考虑密度图到人群图的映射,以及该映射对模型性能的影响.为了解决以上问题,提出一种基于卷积神经网络的对偶模型(Dual Convolutional Neural Networks,DualCNN)以提高模型将人群图映射为密度图准确性.DualCNN包含有两个映射子模型:1)将人群图映射到密度图的卷积神经模型,2)将密度图映射到人群图的对偶卷积神经模型.在学习过程中,通过两个子模型的相互影响,进而达到提高卷积神经模型在人群计数问题上的性能.在UCF_CC_50数据集和ShangHaitech数据集上的实验结果表明,该方法能有效提升CNN的计数性能,尤其在UCF_CC_50数据集中,DualCNN将MCNN和CSRNet的平均绝对误差(MAE)分别降低15.6%和15.8%,最小均方误差(MSE)分别降低18.1%和28.8%.

     

    Abstract: The core of crowd counting methods based on convolutional neural networks (CNN) is to construct a CNN model that can effectively mapping the crowd image to the density map, and then accurately estimate the number of crowd from the density map. The conventional CNN based methods only consider the mapping from crowd images to density maps without taking that from density maps to crowd maps and the impact of the mapping on model performance. A dual convolutional neural networks (DualCNN) is proposed to improve the accuracy of the mapping from crowd maps to density maps. DualCNN contains two mapping sub-models: 1) a convolutional neural model that mapping a crowd image to a density map; 2) a dual convolutional neural model that mapping a density map to a crowd image. In the learning process, through the mutual influence of the two sub-models, the performance of the convolutional neural model on crowd counting is improved.Experimental results on the datasets of UCF_CC_50 and ShangHaitech show that the presented method can effectively improve the counting performance of CNN. Especially in the UCF_CC_50 data set, DualCNN reduces the MAE of MCNN and CSRNet by 15.6% and 15.8%, and MSE by 18.1% and 28.8%, respectively.

     

/

返回文章
返回