Fast generation of CGH using multi-core CPU-GPU heterogeneous system

MA Xiandong; GUI Jinbin; CHEN Aishuai; LIU Juntong

doi:10.7510/jgjs.issn.1001-3806.2024.02.010

Volume 48 Issue 2

Mar. 2024

Article Contents

Turn off MathJax

Article Navigation > LASER TECHNOLOGY > 2024 > 48(2): 210-215

Citation:

Fast generation of CGH using multi-core CPU-GPU heterogeneous system

College of Science, Kunming University of Science and Technology, Kunming 650500, China

Corresponding author: GUI Jinbin, jinbingui@163.com ;
Received Date: 2023-02-09
Accepted Date: 2023-04-20

Abstract

In order to make full use of the computing performance of the computer to improve the speed of computer-generated hologram(CGH) based on the point source model, a fast CGH generation system based on a multi-core central processing unit (CPU) and graphics processing unit (GPU) was designed and optimized in this paper. First of all, the system used the unified architecture platform to design and implement a CGH generation system based on the point source model and proposes the optimization strategy of computing. Then, an optimized calculation formula was proposed to reduce the amount of calculation. Finally, the task debugging was optimized to build a CPU parallel computing system. One of the cores was responsible for startup and function, and data transmission, while the other cores undertook some computing tasks to further improve the computing speed. The results show that, the designed system makes full use of the performance of both CPU and GPU. Under the same configuration of computing hardware, the speedup ratio of CGH generation is 4~4.75 times higher than that of CGH generation in a single GPU system. Heterogeneous systems can effectively improve the generation speed of computer-generated holograms. The research is helpful for generating a 3-D scene hologram quickly.
- holography,
- computer-generated hologram,
- heterogeneous system,
- point-source model

References

[1]	SAHIN E, STOYKOVA E, MKINEN J, et al. computer-generated holograms for 3D imaging: A survey[J]. ACM Computing Surveys, 2020, 53(2): 1-35.
[2]	ATHANASIA S, DAVID B, PETER S. Color computer-generated holography for point clouds utilizing the Phong illumination model[J]. Optics Express, 2018, 26(8): 10282-10298. doi: 10.1364/OE.26.010282
[3]	曾胜财, 甘亮勤. 编码法制彩色动态全息[J]. 激光技术, 2021, 45(2): 229-232. ZENG Sh C, GAN L Q. Making color dynamic holograms by the coding method[J]. Laser Technology, 2021, 45(2): 229-232(in Ch-inese).
[4]	刘柳, 姚燕, 蔡晋辉, 等. 基于数字全息的甲烷-氧气预混火焰温度场研究[J]. 激光技术, 2022, 46(3): 408-414. LIU L, YAO Y, CAI J H, et al. Study on methane-oxygen premixed flame temperature field based on digital holography[J]. Laser Technology, 2022, 46(3): 408-414(in Chinese).
[5]	REN N, KOHEI S, YOSHIKI M. Real-time gradation-expressible amplitude-modulationtype electroholography based on binary-weighted computer-generated hologram[J]. Chinese Optics Letters, 2021, 19(11): 110501. doi: 10.3788/COL202119.110501
[6]	金晓宇, 桂进斌, 刘超, 等. 基于点源模型计算全息图快速生成算法的研究进展[J]. 激光与光电子学进展, 2018, 55(10): 100005. JIN X Y, GUI J B, LIU Ch, et al. Research progress of fast algorithm for cgh generation based on point source model[J]. Laser & Opto-electronics Progress, 2018, 55(10): 100005(in Chinese).
[7]	ZHAO Y, SHI C X, KWON K C, et al. Optics communications fast calculation method of computer-generated hologram using a depth camera with point cloud gridding[J]. Optics Communications, 2018, 411: 166-169. doi: 10.1016/j.optcom.2017.11.040
[8]	MATSUSHIMA K, NAKAHARA S. New techniques for wave-field rendering of polygon-based high-definition CGHs[J]. Proceedings of the SPIE, 2011, 7957: 79571A. doi: 10.1117/12.876362
[9]	PAN Y J, WANG Y T, LIU J, et al. Fast polygon-based method for calculating computer-generated holograms in three-dimensional display[J]. Applied Optics, 2013, A52(1): 290-299.
[10]	KIM S C, KIM E S. Effective generation of digital holograms of three-dimensional objects using a novel look-up table method[J]. Applied Optics, 2008, D47(19): 55-62.
[11]	JIA J, WANG Y, LIU J, et al. Reducing the memory usage for e-ffective computer-generated hologram calculation using compressed look-up table in full-color holographic display[J]. Applied Optics, 2013, 52(7): 1404-1412. doi: 10.1364/AO.52.001404
[12]	SHIMOBABA T, MASUDA N, ITO T. Simple and fast calculation algorithm for computer-generated hologram with wavefront recording plane[J]. Optics Letters, 2009, 34(20): 3133-3135. doi: 10.1364/OL.34.003133
[13]	吴凯. 基于点源调控的视窗全息动态3维显示方法研究[D]. 苏州: 苏州大学, 2020. WU K. Research on dynamic 3D display method of window holography based on point source control[D]. Soochow: Soochow University, 2020(in Chinese).
[14]	LUCENT E, MARK E. Interactive computation of holograms using a look-up table[J]. Journal of Electronic Imaging, 1993, 2(1): 28-34. doi: 10.1117/12.133376
[15]	OKADA N, HIRAI D, ICHIHASHI Y, et al. Special-purpose computer HORN-7 with FPGA technology for phase modulation type electro-holography[C]//International Display Workshops. New York, USA: IEEE, 2012: 1284-1287.
[16]	TOMOYOSHI S, TOMOYOSHI I, NOBUYUKI M, et al. Fast calculation of computer-generated-hologram on AMD HD5000 series GPU and OpenCL[J]. Optics Express, 2010, 18(10): 9955-9960. doi: 10.1364/OE.18.009955
[17]	TAKADA N, SHIMOBABA T, NAKAYAMA H, et al. Fast high-resolution computer-generated hologram computation using multiple graphics processing unit cluster system[J]. Applied Optics, 2012, 51(30): 7303-7307. doi: 10.1364/AO.51.007303
[18]	蒋晓瑜, 丛彬, 裴闯, 等. 一种基于新型查表方法的统一计算设备架构并行计算全息算法[J]. 光学学报, 2015, 35(2): 0209001. JIANG X Y, CONG B, PEI Ch, et al. A parallel algorithm based on a new table lookup method for unified computing device architecture[J]. Acta Optica Sinica, 2015, 35(2): 0209001(in Chinese).
[19]	PAN Y, XU X, SOLANKI S, et al. Fast CGH computation using S-LUT on GPU[J]. Optics Express, 2009, 17(21): 18543-18555. doi: 10.1364/OE.17.018543
[20]	许可, 王星儿, 范旭浩, 等. 超表面全息术: 从概念到实现[J]. 光电工程, 2022, 49(10): 220183. XU K, WANG X E, FAN X H, et al. Meta-holography: From concept to realization[J]. Opto-Electron Engineering, 2022, 49(10): 220183(in Chinese).
[21]	AHRENBERG L, BENZIE P, MAGNOR M, et al. Computer gene-rated holography using parallel commodity graphics hardware[J]. Optics Express, 2006, 14(17): 7636-7641. doi: 10.1364/OE.14.007636
[22]	TAKADA N, SHIMOBABA T, NAKAYAMA H, et al. Fast high-resolution computer-generated hologram computation using multiple graphics processing unit cluster system[J]. Applied Optics, 2012, 51(30): 7303-7307. doi: 10.1364/AO.51.007303
[23]	JIN X Y, GUI J B, JIANG Zh X, et al. Fast calculation of computer generated hologram using multi-core CPUs and GPU system[J]. Proceedings of the SPIE, 2018, 1117: 10818.

Proportional views

通讯作者: 陈斌, bchen63@163.com

1.
沈阳化工大学材料科学与工程学院沈阳 110142

Figures(8) / Tables(1)

Get Citation

PDF

XML

Article views(539) PDF downloads(5) Cited by()

Proportional views

HTML

0. 引言

全息显示技术能够把物体的波前完整地重建出来，提供真实的视觉感受，因而成为国内外真3维显示技术的研究热点^[1-4]。计算全息是其中的一个重要部分，它是现代光学和计算机技术相结合的产物^[5-6]，它不需要搭建实际光路，通过计算机仿真计算就可以生成虚拟物体的全息图，具有很高的灵活性和重复性。目前生成计算全息图的方法主要分为两种: 一种是面元法^[7-9]，它的核心是将空间中的3维物体分割为许多不同形状的面元，把物光波视为这些面发出光波的叠加；另一种是点源法^[10-13]，它将空间中的3维物体采样离散为许多的点，物光波视为这些点发出光波的叠加。点源法具有原理简单、操作灵活的优势，而且通过点源法生成的计算全息图有着较好的重建质量，所以点源法有着巨大的潜力。但是，为了得到高质量的重建像，需要对3维物体采集大量的点数据，并进行大量运算，普通计算机很难达到生成计算全息图的实时计算的要求。

为了提高点源法的计算速度，LUCENT等人提出了查找表法(look-up table, LUT)^[14]。LUT方法预先计算每个可能位置点源的干涉条纹图样并储存起来，实时计算时只需读取图样并进行叠加，极大地缩短了线上的运算时间，但是预先计算的数据表需要庞大的内存空间；基于点源模型计算全息图的另一种方法是波前记录平面法^[12]，其核心在于在物体附近定义一个平面，该平面与全息面平行且等大，计算每个点源在该平面上贡献的复振幅，而不需要计算在全息面的复振幅叠加。该算法通过降低计算机全息图(computer-generated hologram, CGH)的计算复杂度来大幅提高了计算速度，但是其缺点是不能记录大于全息图尺寸的物体。

随着计算机技术的快速发展，提高计算速度的方法不再局限于算法的改进，将高性能硬件与算法结合成为了广大学者更优的选择。日本学者使用可编程逻辑器件构建了专门用于全息计算的硬件系统^[15]，使全息图的计算速度有了巨大的提升，但是由于计算全息专用硬件系统的成本过高，所以没有被广泛地应用，相比之下，图像处理单元(graphic processing unit, GPU)低成本、高性能^[16-20]，因此成为了许多研究人员的首选。

AHRENBERG等人采用OpenGL对GPU编程，有效地提高了计算全息图的生成速度^[21]。TAKADA等人使用多GPU系统计算生成全息图，使得计算速度得到极大的提升^[22]。但是目前利用GPU生成计算全息图都是使中央处理器(central processing unit, CPU)和GPU以串行方式工作，CPU与GPU总是只会有一个在工作状态，另一个处于等待状态，硬件得不到充分利用，导致计算速度减慢。为了高效地利用CPU和GPU异构系统计算全息图的计算性能，本课题组先期已经做了初步的报道^[23]，但在这篇文献中只是简单地实现了CPU和GPU异构系统并行计算全息图，并未对该系统进行优化处理，存在着不足。本文中为了进一步优化CPU和GPU异构系统并行计算全息图的性能，提出了数据处理与任务调度重叠并行的计算方法, 然后基于CPU和GPU异构系统的重叠并行的计算方法，进行全息图计算公式简化、任务分配、共享内存优化等, 设计并实现了CPU-GPU异构系统用于快速生成计算全息图。

5. 结论

在点源模型计算全息图的原理上，设计了基于多核CPU和GPU的异构并行系统，并对该系统使用了全息图计算公式简化、任务分配、共享内存等优化方法，提高了CPU-GPU异构系统的计算性能，实验验证了系统的可行性。在本文中的硬件条件下，实验结果表明, 计算全息图的加速比较GPU系统的加速比至少提高4倍左右，说明设计的系统能有效地解决点源模型计算全息图缓慢的问题。

在今后的实验中，本文作者准备采用更高性能的GPU和CPU，从而达到全息图的实时计算。但实际上，在GPU性能有大幅提升的情况下，CPU的性能也应同步提升，否则在GPU性能远远超过CPU性能时，使用异构系统反而会让运算速度较GPU系统的运算速度更慢。这是因为高性能的GPU能快速地将高并发的大量任务在短时间内完成，而数据的传输任务只能靠CPU串行执行，当CPU性能过差时，就会导致在传输数据上花费过多时间，并且能分配给CPU执行的任务量也较少。因此，提升CPU性能是很有必要的，它不仅能帮GPU承担更多的任务量，而且能及时将处理完成的数据传走，并喂进新的数据，让GPU处于满负荷的工作状态, 使该异构系统的计算速度得到很大的提升。

Reference (23)

[1]	SAHIN E, STOYKOVA E, MKINEN J. computer-generated holograms for 3D imaging: A survey[J]. ACM Computing Surveys, 2020, 53(2): 1-35.
[2]	ATHANASIA S, DAVID B, PETER S. Color computer-generated holography for point clouds utilizing the Phong illumination model[J]. Optics Express, 2018, 26(8): 10282-10298. doi: 10.1364/OE.26.010282
[3]	曾胜财, 甘亮勤. 编码法制彩色动态全息[J]. 激光技术, 2021, 45(2): 229-232.	ZENG Sh C, GAN L Q. Making color dynamic holograms by the coding method[J]. Laser Technology, 2021, 45(2): 229-232.
[4]	刘柳, 姚燕, 蔡晋辉. 基于数字全息的甲烷-氧气预混火焰温度场研究[J]. 激光技术, 2022, 46(3): 408-414.	LIU L, YAO Y, CAI J H. Study on methane-oxygen premixed flame temperature field based on digital holography[J]. Laser Technology, 2022, 46(3): 408-414.
[5]	REN N, KOHEI S, YOSHIKI M. Real-time gradation-expressible amplitude-modulationtype electroholography based on binary-weighted computer-generated hologram[J]. Chinese Optics Letters, 2021, 19(11): 110501-. doi: 10.3788/COL202119.110501
[6]	金晓宇, 桂进斌, 刘超. 基于点源模型计算全息图快速生成算法的研究进展[J]. 激光与光电子学进展, 2018, 55(10): 100005-.	JIN X Y, GUI J B, LIU Ch. Research progress of fast algorithm for cgh generation based on point source model[J]. Laser & Opto-electronics Progress, 2018, 55(10): 100005-.
[7]	ZHAO Y, SHI C X, KWON K C. Optics communications fast calculation method of computer-generated hologram using a depth camera with point cloud gridding[J]. Optics Communications, 2018, 411(): 166-169. doi: 10.1016/j.optcom.2017.11.040
[8]	MATSUSHIMA K, NAKAHARA S. New techniques for wave-field rendering of polygon-based high-definition CGHs[J]. Proceedings of the SPIE, 2011, 7957(): 79571A-. doi: 10.1117/12.876362
[9]	PAN Y J, WANG Y T, LIU J. Fast polygon-based method for calculating computer-generated holograms in three-dimensional display[J]. Applied Optics, 2013, A52(1): 290-299.
[10]	KIM S C, KIM E S. Effective generation of digital holograms of three-dimensional objects using a novel look-up table method[J]. Applied Optics, 2008, D47(19): 55-62.
[11]	JIA J, WANG Y, LIU J. Reducing the memory usage for e-ffective computer-generated hologram calculation using compressed look-up table in full-color holographic display[J]. Applied Optics, 2013, 52(7): 1404-1412. doi: 10.1364/AO.52.001404
[12]	SHIMOBABA T, MASUDA N, ITO T. Simple and fast calculation algorithm for computer-generated hologram with wavefront recording plane[J]. Optics Letters, 2009, 34(20): 3133-3135. doi: 10.1364/OL.34.003133
[13]	吴凯. 基于点源调控的视窗全息动态3维显示方法研究[D]. 苏州: 苏州大学, 2020.	WU K. Research on dynamic 3D display method of window holography based on point source control[D]. Soochow: Soochow University, 2020(in Chinese).
[14]	LUCENT E, MARK E. Interactive computation of holograms using a look-up table[J]. Journal of Electronic Imaging, 1993, 2(1): 28-34. doi: 10.1117/12.133376
[15]	OKADA N, HIRAI D, ICHIHASHI Y, et al. Special-purpose computer HORN-7 with FPGA technology for phase modulation type electro-holography[C]//International Display Workshops. New York, USA: IEEE, 2012: 1284-1287.
[16]	TOMOYOSHI S, TOMOYOSHI I, NOBUYUKI M. Fast calculation of computer-generated-hologram on AMD HD5000 series GPU and OpenCL[J]. Optics Express, 2010, 18(10): 9955-9960. doi: 10.1364/OE.18.009955
[17]	TAKADA N, SHIMOBABA T, NAKAYAMA H. Fast high-resolution computer-generated hologram computation using multiple graphics processing unit cluster system[J]. Applied Optics, 2012, 51(30): 7303-7307. doi: 10.1364/AO.51.007303
[18]	蒋晓瑜, 丛彬, 裴闯. 一种基于新型查表方法的统一计算设备架构并行计算全息算法[J]. 光学学报, 2015, 35(2): 0209001-.	JIANG X Y, CONG B, PEI Ch. A parallel algorithm based on a new table lookup method for unified computing device architecture[J]. Acta Optica Sinica, 2015, 35(2): 0209001-.
[19]	PAN Y, XU X, SOLANKI S. Fast CGH computation using S-LUT on GPU[J]. Optics Express, 2009, 17(21): 18543-18555. doi: 10.1364/OE.17.018543
[20]	许可, 王星儿, 范旭浩. 超表面全息术: 从概念到实现[J]. 光电工程, 2022, 49(10): 220183-.	XU K, WANG X E, FAN X H. Meta-holography: From concept to realization[J]. Opto-Electron Engineering, 2022, 49(10): 220183-.
[21]	AHRENBERG L, BENZIE P, MAGNOR M. Computer gene-rated holography using parallel commodity graphics hardware[J]. Optics Express, 2006, 14(17): 7636-7641. doi: 10.1364/OE.14.007636
[22]	TAKADA N, SHIMOBABA T, NAKAYAMA H. Fast high-resolution computer-generated hologram computation using multiple graphics processing unit cluster system[J]. Applied Optics, 2012, 51(30): 7303-7307. doi: 10.1364/AO.51.007303
[23]	JIN X Y, GUI J B, JIANG Zh X. Fast calculation of computer generated hologram using multi-core CPUs and GPU system[J]. Proceedings of the SPIE, 2018, 1117(): 10818-.

number of points	2354	4708	9416	11770	21174	42348
time spent by CPU-GPU system/ms	347	622	1138	1415	2521	4962
time spent by GPU system/ms	1417	2753	5366	6608	11840	23539
speedup ratio	4.08	4.43	4.72	4.67	4.70	4.74

number of points	101168	202336	404672	1018748	2037496
time spent by CPU-GPU system/ms	11763	23912	47558	120979	242568
time spent by GPU system/ms	56545	113783	227851	575493	1151167
speedup ratio	4.81	4.76	4.79	4.76	4.75

Fast generation of CGH using multi-core CPU-GPU heterogeneous system

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Proportional views

Fast generation of CGH using multi-core CPU-GPU heterogeneous system

Corresponding author: GUI Jinbin, jinbingui@163.com;

HTML

3.1. 简化计算公式

3.2. 任务调度

4.1. 结果分析

4.2. 实验验证

Catalog

Fast generation of CGH using multi-core CPU-GPU heterogeneous system

Abstract

References

Proportional views

通讯作者: 陈斌, bchen63@163.com

Proportional views

Fast generation of CGH using multi-core CPU-GPU heterogeneous system

Corresponding author: GUI Jinbin, jinbingui@163.com;

HTML

3.1. 简化计算公式

3.2. 任务调度

4.1. 结果分析

4.2. 实验验证

Catalog

Export File

Citation

Format

Content