Light field image compression method based on correlation of rendered views

LIU Deyang; WANG Guangjun; WU Jian; AI Liefu

doi:10.7510/jgjs.issn.1001-3806.2019.04.020

Volume 43 Issue 4

Jul. 2019

Article Contents

Turn off MathJax

Article Navigation > LASER TECHNOLOGY > 2019 > 43(4): 551-556

Citation:

Light field image compression method based on correlation of rendered views

1.
School of Computer and Information, Anqing Normal University, Anqing 246000, China
2.
The University Key Laboratory of Intelligent Perception and Computing of Anhui Province, Anqing Normal University, Anqing 246000, China

Received Date: 2018-08-06
Accepted Date: 2018-09-03

Abstract

In order to explore the strong correlation between virtual rendering viewpoints and improve the compression efficiency of optical field image, an optical field image compression algorithm based on viewpoint correlation was proposed. The algorithm was based on high efficiency video coding (HEVC) and screen content coding expansion platform. Linear weighting algorithm and intra-block copy hybrid prediction algorithm were used to improve the prediction accuracy of coded blocks. Distortion optimization process was used to adaptively select the optimal block size and prediction mode. The experimental results show that, the average BJONTEGAARD delta peak signal-to-noise ratio(BD-PSNR) coding gain of the proposed algorithm is 2.55dB compared with that of HEVC standard. At the same time, better quality of virtual view rendering can be gotten. The algorithm makes full use of the strong correlation between virtual rendering viewpoints and improves the coding efficiency of optical field images.
- image processing,
- light field image,
- image compression,
- view correlation,
- high efficiency video coding

References

[1]	ADELSON E H, BERGEN J R. The plenoptic function and the elements of early vision[M]. Cambridge, UK:MIT Press, 1991:3-20.
[2]	McMILLAN L, BISHOP G. Plenopticmodeling: an image-based rendering system[C]//Proceedings ofthe 22nd Annual Conference on Computer Graphics and Interactive Techniques. New York, USA: IEEE, 1995: 39-46.
[3]	LEVOY M, HANRAHAN P. Light field rendering[C]//Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH). New York, USA: IEEE, 1996: 31-42.
[4]	GORTLER S J, GRZESZCZUK R, SZELISKI R, et al. The lumigraph[C]//Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH).New York, USA: Association for Computing Machinery, 1996: 43-54.
[5]	WILBURN B, JOSHI N, VAISHV, et al. High performance imaging using large camera arrays [J].ACM Transactions on Graphics (TOG), 2005, 24(3): 765-776. doi: 10.1145/1073204
[6]	LUMSDAINE A, GEORGIEV T. The focused plenoptic camera[C]//IEEE International Conferenceon Computational Photography (ICCP).New York, USA: IEEE, 2009: 1-8.
[7]	PERWASS C, WIETZKE L. Single lens 3D-camera with extended depth-of-field [J]. Proceedings of the SPIE, 2012, 8291:77-89.
[8]	SUN Y J, LEI W H, HU Y H, et al. Rapid ship detection in remote sensing images based on visual saliency model[J]. Laser Technology, 2018, 42(3):379-384(in Chinese).
[9]	ZHANG Ch, LIU F, HOU G Q, et al. Light field photography and its application in computer vision[J]. Journal of Image and Graphics, 2016, 21(3): 263-281 (in Chinese).
[10]	LIU D, WANG L, LI L, et al. Pseudo-sequence-based light field image compression[C]//2016 IEEE International Conference on Multimedia & Expo Workshops(ICMEW).New York, USA: IEEE, 2016: 1-4.
[11]	WANG G, XIANG W, PICKERING M, et al. Light field multi-view video coding with two-directional parallel inter-view prediction[J]. IEEE Transactions on Image Processing, 2016, 25(11): 5104-5117. doi: 10.1109/TIP.2016.2603602
[12]	LI L, LI Z, LI B, et al.Pseudo sequence based 2-D hierarchical coding structure for light-field image compression[C]//2017 Data Compression Conference (DCC).New York, USA: IEEE, 2017: 131-140.
[13]	HELIN P, ASTOLA P, RAO B, et al. Sparse modeling and predictive coding of sub-aperture images for lossless plenoptic image compression[C]//2016 3DTV-Conference: The True Vision-Capture, Transmission andDisplay of 3D Video (3DTV-CON).New York, USA: IEEE, 2016: 1-4.
[14]	ZHU W Y, LI Y, YUAN F, et al. Multiple image fusion algorithm in wavelet domain based on JPEG[J]. Laser Technology, 2014, 38(3):425-430(in Chinese).
[15]	MONTEIRO R, LUCAS L, CONTI C, et al.Light field HEVC-based image coding using locally linear embedding and self-similarity compensated prediction[C]//2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).New York, USA: IEEE, 2016: 1-4.
[16]	MONTEIRO R J S, NUNES P J L, RODRIGUES N M M, et al. Light field image coding using high order intra block prediction[J]. IEEE Journal on Selected Topics in Signal Processing, 2017, 11(7):1120-1131. doi: 10.1109/JSTSP.2017.2721358
[17]	LIU D Y, AN P, MA R, et al. Three-dimensional holoscopic image coding scheme using high-efficiency video coding with kernel-based minimum mean-square-error estimation[J].Journal of Electronic Imaging, 2016, 25(4):043015. doi: 10.1117/1.JEI.25.4.043015
[18]	LI Y, SJOSTROM M, OLSSON R, et al.Scalable coding of plenoptic images by using a sparse set and disparities[J]. IEEE Transactions on Image Processing, 2016, 25(1):80-91.
[19]	LIU D Y, AN P, MA R, et al.Disparity compensation based 3D holoscopic image coding using HEVC[C]//IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP).New York, USA: IEEE, 2015: 201-205.
[20]	JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC).HEVC SCC reference software Ver. 3.0 (SCM 3.0) [EB/OL].(2016-02-01)[ 2018-08-31]. https://hevc.hhi.fraunhofer.de.
[21]	YU H, COHEN R, RAPAKA K, et al. JCTVC-X1015, common test conditions for screen content coding, geneva [EB/OL].(2016-08-14)[ 2018-08-31]. http://phenix.int-evry.fr/jct/.
[22]	ROSEWARNE C, SHARMAN K, NACCARI M, et al.JCTVC-P1013, HEVC range extensions test model 6 encoder description[EB/OL].(2014-02-23)[ 2018-08-31].http://phenix.int-evry.fr/jct/.
[23]	JOSHI R, XU J, COHEN R, et al. JCTVC-Q1014, Screen content coding test model 1 (SCM 1)[EB/OL].(2014-04-28)[ 2018-08-31].http://phenix.int-evry.fr/jct/.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(5) / Tables(2)

Get Citation

PDF

XML

Article views(6066) PDF downloads(25) Cited by()

Proportional views

HTML

引言

3维空间中光线强度的传输特性可以用光场来表征。1991年，ADELSON和BERGEN等人^[1]根据人眼对外部光线的视觉感知情况，提出利用7维全光函数来表征空间光线强度分布。7维全光函数主要由空间中某点的3维坐标、光线传播方向、波长以及时间7个维度构成。但是如此高维的数据存在记录与处理的困难。因此，McMILLAN等人^[2]对全光函数进行简化，提出用5维光场函数来表征任一时刻的自由空间光线。1996年，LEVOY^[3]和GORTLER^[4]又提出利用一组平行双平面来记录空间中光线的角度和位置信息，搭建了“双平面”表征模型，将5维光场函数进一步降维到4维光场函数。

基于4维光场理论以及现代光学成像技术的发展，多种光场成像系统被提出，例如光场相机阵列^[5]、Lytro光场相机^[6]以及Raytrix光场相机^[7]等。其中，Lytro光场相机可以通过单次曝光获取3维场景中光线的位置与角度4维信息，打破了传统成像系统只能记录光线2维信息的瓶颈。同时，由于光线位置信息的引入，使得Lytro光场相机具有了“先拍照，后聚焦”的优势，促进了其在计算机视觉^[8]各个层面上的广泛应用^[9]。

为了使获取的光场图像满足高清需求，就需要大量的数据来表示光场图像。这使得光场图像的数据量远远高于传统的自然2-D图像，给光场图像的存储与传输带来了巨大的挑战。然而，光场图像的存储与传输又是光场成像技术应用发展的关键因素。因此，针对光场图像内容的压缩算法就显得尤为重要。

目前，针对光场图像的压缩编码算法大致可以分为两类：基于伪序列的压缩算法以及基于空间相关性的压缩算法。基于伪序列的编码算法^[10-13]核心思想是将光场图像分解为多视点图像。然后将获取的多视点图像重组为一个视频序列，利用现有的视频编码标准^[14]对其进行压缩编码。但是该类算法需要获取光场图像准确的几何信息，以便提取多视点图像。基于空间相关性压缩算法^[15-19]的核心思想是利用光场图像中相邻子图像(micro-images, MIs)的强相关性来对光场图像进行压缩。虽然该类算法可以通过探索光场图像的自相关性来提升编码效率，但是很多压缩算法对于光场图像内容中的纹理复杂区域的预测精度并不高。此外，由于光场图像中包含3维场景的4维信息，可以从光场图像中提取出视点图像，且提取的视点图像之间同样存在较强的空间相关性。但是，很多算法并没有充分探索视点图像间的强相关性来提升光场图像编码效率。因此，本文中提出一种基于视点图像空间相关性的光场图像压缩算法。在充分利用提取视点图像之间的高空间相关性的同时，设计一种混合线性加权预测与帧内块拷贝(intra block copy, IBC)预测的预测算法，提升纹理复杂区域的预测精度，进而提高光场图像的编码效率。

4. 结论

本文中充分探索光场图像绘制视点图像之间的强相关性，提出一种基于绘制视点相关性的光场图像压缩编码算法。所提压缩编码算法结合高清视频编码屏幕编码扩展平台，利用率失真优化将线性加权算法与帧内块拷贝算法进行有机结合，进一步提升了编码块的预测精度。实验结果表明，所提编码算法相比于HEVC可以获得2.55dB的平均BD-PSNR编码增益。此外，本文中所提压缩编码算法还可以获得一个较好的虚拟绘制视点质量。

Reference (23)

[1]	ADELSON E H, BERGEN J R. The plenoptic function and the elements of early vision[M]. Cambridge, UK:MIT Press, 1991:3-20.
[2]	McMILLAN L, BISHOP G. Plenopticmodeling: an image-based rendering system[C]//Proceedings ofthe 22nd Annual Conference on Computer Graphics and Interactive Techniques. New York, USA: IEEE, 1995: 39-46.
[3]	LEVOY M, HANRAHAN P. Light field rendering[C]//Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH). New York, USA: IEEE, 1996: 31-42.
[4]	GORTLER S J, GRZESZCZUK R, SZELISKI R, et al. The lumigraph[C]//Proceedings of the 23rd Annual Conference on Computer Graphics and Interactive Techniques (SIGGRAPH).New York, USA: Association for Computing Machinery, 1996: 43-54.
[5]	WILBURN, JOSHI, VAISHV. High performance imaging using large camera arrays[J]. ACM Transactions on Graphics (TOG), 2005, 24(3): 765-776. doi: 10.1145/1073204
[6]	LUMSDAINE A, GEORGIEV T. The focused plenoptic camera[C]//IEEE International Conferenceon Computational Photography (ICCP).New York, USA: IEEE, 2009: 1-8.
[7]	PERWASS C, WIETZKE L. Single lens 3D-camera with extended depth-of-field[J]. Proceedings of the SPIE, 2012, 8291(): 77-89.
[8]	SUN Y J, LEI W H, HU Y H. Rapid ship detection in remote sensing images based on visual saliency model[J]. Laser Technology, 2018, 42(3): 379-384.
[9]	ZHANG Ch, LIU F, HOU G Q. Light field photography and its application in computer vision[J]. Journal of Image and Graphics, 2016, 21(3): 263-281.
[10]	LIU D, WANG L, LI L, et al. Pseudo-sequence-based light field image compression[C]//2016 IEEE International Conference on Multimedia & Expo Workshops(ICMEW).New York, USA: IEEE, 2016: 1-4.
[11]	WANG G, XIANG W, PICKERING M. Light field multi-view video coding with two-directional parallel inter-view prediction[J]. IEEE Transactions on Image Processing, 2016, 25(11): 5104-5117. doi: 10.1109/TIP.2016.2603602
[12]	LI L, LI Z, LI B, et al.Pseudo sequence based 2-D hierarchical coding structure for light-field image compression[C]//2017 Data Compression Conference (DCC).New York, USA: IEEE, 2017: 131-140.
[13]	HELIN P, ASTOLA P, RAO B, et al. Sparse modeling and predictive coding of sub-aperture images for lossless plenoptic image compression[C]//2016 3DTV-Conference: The True Vision-Capture, Transmission andDisplay of 3D Video (3DTV-CON).New York, USA: IEEE, 2016: 1-4.
[14]	ZHU W Y, LI Y, YUAN F. Multiple image fusion algorithm in wavelet domain based on JPEG[J]. Laser Technology, 2014, 38(3): 425-430.
[15]	MONTEIRO R, LUCAS L, CONTI C, et al.Light field HEVC-based image coding using locally linear embedding and self-similarity compensated prediction[C]//2016 IEEE International Conference on Multimedia & Expo Workshops (ICMEW).New York, USA: IEEE, 2016: 1-4.
[16]	MONTEIRO R J S, NUNES P J L, RODRIGUES N M M. Light field image coding using high order intra block prediction[J]. IEEE Journal on Selected Topics in Signal Processing, 2017, 11(7): 1120-1131. doi: 10.1109/JSTSP.2017.2721358
[17]	LIU D Y, AN P, MA R. Three-dimensional holoscopic image coding scheme using high-efficiency video coding with kernel-based minimum mean-square-error estimation[J]. Journal of Electronic Imaging, 2016, 25(4): 043015-. doi: 10.1117/1.JEI.25.4.043015
[18]	LI Y, SJOSTROM M, OLSSON R. Scalable coding of plenoptic images by using a sparse set and disparities[J]. IEEE Transactions on Image Processing, 2016, 25(1): 80-91.
[19]	LIU D Y, AN P, MA R, et al.Disparity compensation based 3D holoscopic image coding using HEVC[C]//IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP).New York, USA: IEEE, 2015: 201-205.
[20]	JOINT COLLABORATIVE TEAM ON VIDEO CODING (JCT-VC).HEVC SCC reference software Ver. 3.0 (SCM 3.0) [EB/OL].(2016-02-01)[ 2018-08-31]. https://hevc.hhi.fraunhofer.de.
[21]	YU H, COHEN R, RAPAKA K, et al. JCTVC-X1015, common test conditions for screen content coding, geneva [EB/OL].(2016-08-14)[ 2018-08-31]. http://phenix.int-evry.fr/jct/.
[22]	ROSEWARNE C, SHARMAN K, NACCARI M, et al.JCTVC-P1013, HEVC range extensions test model 6 encoder description[EB/OL].(2014-02-23)[ 2018-08-31].http://phenix.int-evry.fr/jct/.
[23]	JOSHI R, XU J, COHEN R, et al. JCTVC-Q1014, Screen content coding test model 1 (SCM 1)[EB/OL].(2014-04-28)[ 2018-08-31].http://phenix.int-evry.fr/jct/.

test images	compressionmethods	BD-PSNR/dB	BD-rate/%
bike	HEVC-RExt	1.35	-18.51
	DCCM	1.63	-23.21
	HEVC-SCC	1.69	-24.57
	the proposal	2.14	-32.41
fountain	HEVC-RExt	1.52	-22.11
	DCCM	1.77	-26.25
	HEVC-SCC	1.84	-27.33
	the proposal	2.34	-36.02
Laura	HEVC-RExt	1.53	-20.40
	DCCM	1.69	-22.80
	HEVC-SCC	1.88	-25.36
	the proposal	2.62	-35.99
seagull	HEVC-RExt	2.07	-31.77
	DCCM	2.24	-35.32
	HEVC-SCC	2.56	-39.23
	the proposal	3.08	-48.89

test images	HEVC-RExt	DCCM	HEVC-SCC	the proposal
bike	2.81	3.31	10.35	13.86
fountain	3.21	4.10	11.35	18.48
Laura	3.97	4.15	29.76	31.25
seagull	3.04	4.57	26.01	29.92
average	3.26	4.03	19.37	23.38

Light field image compression method based on correlation of rendered views

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Proportional views

Light field image compression method based on correlation of rendered views

HTML

2.1. 线性加权预测算法

2.2. 帧内块拷贝预测算法

2.3. 所提预测算法在HEVC-SCC平台上的集成

Catalog

Light field image compression method based on correlation of rendered views

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Proportional views

Light field image compression method based on correlation of rendered views

HTML

2.1. 线性加权预测算法

2.2. 帧内块拷贝预测算法

2.3. 所提预测算法在HEVC-SCC平台上的集成

Catalog

Export File

Citation

Format

Content