CORC  > 北京大学  > 信息科学技术学院
Improving performance of floating point division on GPU and MIC
Huang, Kun ; Chen, Yifeng
2015
英文摘要Floating point computing ability is an important concern in high performance scientific application and engineering computing. Although as a fundamental operation, floating point division (or reciprocal) has long been much less efficiency compared with addition and multiplication. Architectures like GPU and MIC even have no instruction for such division in the instruction level. This paper proposes a fast approximation algorithm to estimate the division of floating point numbers in IEEE 754 format based on existing instructions which in most cases are accurate enough for practical computing. It consists of a predicting step and an iterating step like most iterative numerical algorithm. The predicting step makes use of the property of IEEE 754 format to calculate estimation by only one integer subtraction instruction. The iterating step improves the accuracy by fast iterations in about ten instructions. This new algorithm is extremely easy to implement and shows a great performance in practical experiments. ? Springer International Publishing Switzerland 2015.; EI; 691-703; 9529
语种英语
出处15th International Conference on Algorithms and Architectures for Parallel Processing, ICA3PP 2015
DOI标识10.1007/978-3-319-27122-4_48
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/436881]  
专题信息科学技术学院
推荐引用方式
GB/T 7714
Huang, Kun,Chen, Yifeng. Improving performance of floating point division on GPU and MIC. 2015-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace