中国科学院光电技术研究所机构知识库
Advanced  
IOE OpenIR  > 光电探测与信号处理研究室(五室)  > 会议论文
题名:
Bayer image parallel decoding based on GPU
作者: Hu, Rihui1,2; Xu, Zhiyong1; Wei, Yuxing1; Sun, Shaohua3
出版日期: 2012
会议名称: Proceedings of SPIE: Optoelectronic Imaging and Multimedia Technology II
会议日期: 2012
DOI: 10.1117/12.999431
通讯作者: Hu, R. (ustc_hui@126.com)
中文摘要: In the photoelectrical tracking system, Bayer image is decompressed in traditional method, which is CPU-based. However, it is too slow when the images become large, for example, 2Kx2Kx16bit. In order to accelerate the Bayer image decoding, this paper introduces a parallel speedup method for NVIDA's Graphics Processor Unit (GPU) which supports CUDA architecture. The decoding procedure can be divided into three parts: the first is serial part, the second is task-parallelism part, and the last is data-parallelism part including inverse quantization, inverse discrete wavelet transform (IDWT) as well as image post-processing part. For reducing the execution time, the task-parallelism part is optimized by OpenMP techniques. The data-parallelism part could advance its efficiency through executing on the GPU as CUDA parallel program. The optimization techniques include instruction optimization, shared memory access optimization, the access memory coalesced optimization and texture memory optimization. In particular, it can significantly speed up the IDWT by rewriting the 2D (Tow-dimensional) serial IDWT into 1D parallel IDWT. Through experimenting with 1Kx1Kx16bit Bayer image, data-parallelism part is 10 more times faster than CPU-based implementation. Finally, a CPU+GPU heterogeneous decompression system was designed. The experimental result shows that it could achieve 3 to 5 times speed increase compared to the CPU serial method. © Copyright SPIE.
英文摘要: In the photoelectrical tracking system, Bayer image is decompressed in traditional method, which is CPU-based. However, it is too slow when the images become large, for example, 2Kx2Kx16bit. In order to accelerate the Bayer image decoding, this paper introduces a parallel speedup method for NVIDA's Graphics Processor Unit (GPU) which supports CUDA architecture. The decoding procedure can be divided into three parts: the first is serial part, the second is task-parallelism part, and the last is data-parallelism part including inverse quantization, inverse discrete wavelet transform (IDWT) as well as image post-processing part. For reducing the execution time, the task-parallelism part is optimized by OpenMP techniques. The data-parallelism part could advance its efficiency through executing on the GPU as CUDA parallel program. The optimization techniques include instruction optimization, shared memory access optimization, the access memory coalesced optimization and texture memory optimization. In particular, it can significantly speed up the IDWT by rewriting the 2D (Tow-dimensional) serial IDWT into 1D parallel IDWT. Through experimenting with 1Kx1Kx16bit Bayer image, data-parallelism part is 10 more times faster than CPU-based implementation. Finally, a CPU+GPU heterogeneous decompression system was designed. The experimental result shows that it could achieve 3 to 5 times speed increase compared to the CPU serial method. © Copyright SPIE.
收录类别: Ei
语种: 英语
卷号: 8558
ISSN号: 0277786X
文章类型: 会议论文
页码: 85581T
Citation statistics:
内容类型: 会议论文
URI标识: http://ir.ioe.ac.cn/handle/181551/7695
Appears in Collections:光电探测与信号处理研究室(五室)_会议论文

Files in This Item:
File Name/ File Size Content Type Version Access License
2012-2168.pdf(224KB)会议论文--限制开放View 联系获取全文

作者单位: 1. Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu, Sichuan, 610209, China
2. Graduate School, Chinese Academy of Sciences, Beijing, 100039, China
3. 750 Test Field of China Shipbuilding Industry Corporation, Kunming, Yunnan, 650051, China

Recommended Citation:
Hu, Rihui,Xu, Zhiyong,Wei, Yuxing,et al. Bayer image parallel decoding based on GPU[C]. 见:Proceedings of SPIE: Optoelectronic Imaging and Multimedia Technology II. 2012.
Service
Recommend this item
Sava as my favorate item
Show this item's statistics
Export Endnote File
Google Scholar
Similar articles in Google Scholar
[Hu, Rihui]'s Articles
[Xu, Zhiyong]'s Articles
[Wei, Yuxing]'s Articles
CSDL cross search
Similar articles in CSDL Cross Search
[Hu, Rihui]‘s Articles
[Xu, Zhiyong]‘s Articles
[Wei, Yuxing]‘s Articles
Related Copyright Policies
Null
Social Bookmarking
Add to CiteULike Add to Connotea Add to Del.icio.us Add to Digg Add to Reddit
文件名: 2012-2168.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 
评注功能仅针对注册用户开放,请您登录
您对该条目有什么异议,请填写以下表单,管理员会尽快联系您。
内 容:
Email:  *
单位:
验证码:   刷新
您在IR的使用过程中有什么好的想法或者建议可以反馈给我们。
标 题:
 *
内 容:
Email:  *
验证码:   刷新

Items in IR are protected by copyright, with all rights reserved, unless otherwise indicated.

 

 

Valid XHTML 1.0!
Copyright © 2007-2016  中国科学院光电技术研究所 - Feedback
Powered by CSpace