Advanced Search
LEI Hanlin, ZHANG Baohua. Crowd counting algorithm based on multi-model deep convolution network integration[J]. LASER TECHNOLOGY, 2019, 43(4): 476-481. DOI: 10.7510/jgjs.issn.1001-3806.2019.04.008
Citation: LEI Hanlin, ZHANG Baohua. Crowd counting algorithm based on multi-model deep convolution network integration[J]. LASER TECHNOLOGY, 2019, 43(4): 476-481. DOI: 10.7510/jgjs.issn.1001-3806.2019.04.008

Crowd counting algorithm based on multi-model deep convolution network integration

More Information
  • Received Date: September 17, 2018
  • Revised Date: October 23, 2018
  • Published Date: July 24, 2019
  • To avoid the interference of depth of field and occlusion and improve the accuracy of crowd counting, three models of LeNet-5, AlexNet and VGG-16 were adopted and the characteristics of objects with different depth of field in the image were extracted. After adjusting the convolution core size and network structure of the above model, model fusion was carried out. A deep convolution neural network structure based on multi-model fusion was constructed. In the last two layers of the network, the convolution layer with convolution core size of 1×1 was used to replace the traditional full connection layer. The extracted feature maps were integrated with information and the density maps were output. The network parameters were greatly reduced and some improved data was obtained. The efficiency and accuracy of the algorithm were taken into account. The theoretical analysis and experimental verification were carried out. The results show that, in public population counting data set of two subsets of shanghaitech and UCF_CC_50, the mean absolute error and mean square error of this method are 97.99 and 158.02, 23.36 and 41.86, 354.27 and 491.68, respectively. It achieves better performance than the existing traditional crowd counting methods. At the same time, migration experiments are carried out. It proves that the population counting model proposed in this paper has good generalization ability. This study is helpful to improve the accuracy of population counting.
  • [1]
    ZHANG Y Y, ZHOU D S, CHEN S Q, et al. Single-image crowd counting via multi-column convolutional neural network[C]//Computer Vision and Pattern Recognition. New York, USA: IEEE, 2016: 589-597.
    [2]
    LI M, ZHANG Z, HUANG K, et al. Estimating the number of people in crowded scenes by mid based foreground segmentation and head-shoulder detection[C]//International Conference on Pattern Recognition. New York, USA: IEEE, 2008: 1-4.
    [3]
    GE W, COLLINS R T. Marked point processes for crowd counting[C]//Computer Vision and Pattern Recognition. New York, USA: IEEE, 2009: 2913-2920.
    [4]
    ZHAO T, NEVATIA R, WU B. Segmentation and tracking of multiple humans in crowded environments[J]. Pattern Analysis and Machine Intelligence, 2008, 30(7): 1198-1211. DOI: 10.1109/TPAMI.2007.70770
    [5]
    CHAN A B, VASCONCELOS N. Counting people with low-level features and bayesian regression[J]. IEEE Transactions on Image Processing, 2012, 21(4): 2160-2177. DOI: 10.1109/TIP.2011.2172800
    [6]
    JI L N, CHEN Q K, CHEN Y J, et al. Real-time crowd counting method from video stream based on GPU[J]. Journal of Computer Applications, 2017, 37(1): 145-152(in Chinese). http://d.old.wanfangdata.com.cn/Periodical/jsjyy201701025
    [7]
    IDREES H, SALEEMI I, SEIBERT C, et al. Multi-source multi-scale counting in extremely dense crowd images[C]//Computer Vision and Pattern Recognition. New York, USA: IEEE, 2013: 2547-2554.
    [8]
    BANSAL A, VENKATESH K. People counting in high density crowds from still images[J]. International Journal of Computer and Electrical Engineering, 2015, 7(5): 316-324. DOI: 10.17706/IJCEE.2015.7.5.316-324
    [9]
    CHEN K, LOY C C, GONG S, et al. Feature mining for localised crowd counting[C]//British Machine Vision Conference. Durham, England: British Machine Vision Association, 2012: 1-3.
    [10]
    HUANG S, LI X, ZHANG Z, et al. Body structure aware deep crowd counting[J]. IEEE Transactions on Image Processing, 2017, 27(3): 49-59.
    [11]
    ZENG L, XU X, CAI B, et al. Multi-scale convolutional neural networks for crowd counting[C]//International Conference on Image Processing. New York, USA: IEEE, 2017: 465-469.
    [12]
    XIONG F, SHI X, YEUNG D Y. Spatiotemporal modeling for crowd counting in videos[C]//International Conference on Computer Vision. New York, USA: IEEE, 2017: 5151-5159.
    [13]
    SINDAGI V A, PATEL V M. Generating high-quality crowd density maps using contextual pyramid cnns[C]//International Conference on Computer Vision. New York, USA: IEEE, 2017: 1861-1870.
    [14]
    SAM D B, SURYA S, BABU R V. Switching convolutional neural network for crowd counting[C]//Computer Vision and Pattern Recognition. New York, USA: IEEE, 2017: 5744-5752.
    [15]
    SINDAGI V A, PATEL V M. Cnn-based cascaded multi-task learning of high-level prior and density estimation for crowd counting[C]//International Conference on Advanced Video and Signal Based Surveillance. New York, USA: IEEE, 2017: 1-6.
    [16]
    ZHANG C, LI H, WANG X, et al. Cross-scene crowd counting via deep convolutional neural networks[C]//Computer Vision and Pattern Recognition. New York, USA: IEEE, 2015: 833-841.
    [17]
    SHELHAMER E, LONG J, DARRELL T. Fully convolutional networks for semantic segmentation[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2017, 39(4): 640-651. DOI: 10.1109/TPAMI.2016.2572683
    [18]
    SHI Z L, YE Y D, WU Y P, et al. Crowd counting using rank-based spatial pyramid pooling network[J]. Acta Automatica Sinica, 2016, 42(6): 866-874(in Chinese). http://d.old.wanfangdata.com.cn/Periodical/zdhxb201606007
    [19]
    SINDAGI V A, PATEL V M. A survey of recent advances in cnn-based single image crowd counting and density estimation[J]. Pattern Recognition Letters, 2018, 107(1): 3-16. http://cn.bing.com/academic/profile?id=8f28a1f63b3dc9e38e91c4690acb6817&encoded=0&v=paper_preview&mkt=zh-cn
    [20]
    RODRIGUEZ M, LAPTEV I, SIVIC J, et al. Density-aware person detection and tracking in crowds[C]//International Conference on Computer Vision. New York, USA: IEEE, 2011: 2423-2430.
    [21]
    LEMPITSKY V, ZISSERMAN A. Learning to count objects in images[C]//Annual Conference on Neural Information Processing Systems. Cambridge, Massachusetts, USA: MIT Press, 2010: 1324-1332.

Catalog

    Article views (4) PDF downloads (7) Cited by()

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return