范东睿  研究员  

高通量计算机研究中心主任

研究方向:众核处理器设计;高通量处理器设计;数据流处理器设计

所属部门:高通量计算机研究中心、计算机体系结构国家重点实验室

导师类别:博导计算机系统结构

联系方式:fandr@ict.ac.cn

个人网页:

简       历:

范东睿,中科院特聘研究员(骨干人才),博士生导师,中科院计算所高通量计算机研究中心主任。在国内外期刊、会议上发表论文120余篇,包括MICROHPCADACHotChipsPPoPPCGOPACT等领域顶级会议以及IEEE MicroTPDSTCTACO等领域顶级期刊。近年来在国内外应邀作学术报告30余次,已获授权/受理发明专利60余项,其中国际专利9项。担任过HPCAMICRO等顶级会议的程序委员会委员,以及ICPPIGCC等国际会议主席。 

主持和参加了十多项国家级项目,包括“973”“863”、核高基、国家自然科学基金重点项目、中科院先导A、先导C、国家重点研发计划、欧盟第七框架项目等。 

 研究工作经历(按时间倒排序) 

 1. 2017/1 — 至今,中科院计算所,研究员,博士生导师,高通量计算机研究中心主任 

 2. 2013/09 – 2016/12,中科院计算所,研究员,博士生导师,高性能计算机研究中心副主任,处理器结构实验室主任 

 3. 2010/11 – 2013/09,中科院计算所,计算机体系结构国家重点实验室,副研究员,博士生导师,处理器结构实验室主任 

 4. 2011/12 – 2012/06,美国 IBM 研究院,青年高级访问学者 

 5. 2006/07  – 2010/10,中科院计算所,前瞻研究实验室,副研究员,硕士生导师,微体系结构研究组组长 

 6. 2007/06 – 2007/12,美国特拉华大学,访问学者 

 7. 2005/07 – 2006/06,中科院计算所,微处理器研究中心,助理研究员 

最新更新参见国科大教师主页:  http://people.ucas.ac.cn/~fandongrui 

主要论著:

期刊文章: 

  1. Xiaochun Ye, Taoran Xiang, Xu Tan, Yujing Feng, Haibin Wu, Meng Wu, Dongrui Fan. Applying CNN on a Scientific Application Accelerator Based on Dataflow Architecture. CCF Transaction on High Performance Computing (CCF THPC). December 2019, Volume 1, Issue 3-4, pp 177-195.
  2. Xiaochun Ye, Xu Tan, Meng Wu, Yujing Feng, Da Wang, Hao Zhang, Songwen Pei, Dongrui Fan. An Efficient Dataflow Accelerator for Scientific Applications. Future Generation Computer Systems (FGCS), 2020.
  3. Dongrui Fan, Hao Zhang, Da Wang, Xiaochun Ye, Fenglong Song, Guojie Li, Ninghui Sun. Godson-T: An Efficient Many-Core Processor Exploring Thread-Level Parallelism. IEEE Micro 32(2): 38-47, 2012
  4. Dongrui Fan, Xiaowei Li, Guojie Li. New Methodologies for Parallel Architecture. Journal of Computer Science and Technology. 2011, 26(4): 578-587.
  5. Dongrui Fan, Nan Yuan, Junchao Zhang, Yongbin Zhou, Wei Lin, Fenglong Song, Xiaochun Ye, He Huang, Lei Yu, Guoping Long, and Hao Zhang. Godson-T: A Many-Core Processor for Efficient Multithreaded Program Executions. Journal of Computer Science and Technology (JCST), 2009, 24(6): 1061-1073.
  6. Xiaolong Xie, Yun Liang, Xiuhong Li, Yudong Wu, Guangyu Sun, Tao Wang, Dongrui Fan: CRAT: Enabling Coordinated Register Allocation and Thread-Level Parallelism Optimization for GPUs. IEEE Trans. Computers 67(6): 890-897 (2018)
  7. Hafiz Fahad SheikhIshfaq AhmadDongrui Fan: An Evolutionary Technique for Performance-Energy-Temperature Optimized Scheduling of Parallel Tasks on Multi-Core Processors. IEEE Trans. Parallel Distrib. Syst. 27(3): 668-681 (2016)
  8. Mingyu Yan, Zhaodong Chen, Lei Deng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan, and Yuan Xie. Characterizing and Understanding GCNs on GPU.  IEEE Computer Architecture Letters (CAL). 2020
  9. Guangming Tang, Peyao Qu, Xiangyu Zheng, Jiahong Yang, Xiaochun Ye, Dongrui Fan, Ninghui Sun. Bit-Slice Butterfly Processing Units for 64-Point RSFQ FFT Processors. IEEE Trans. on Applied Superconductivity, 30(1) 2020.
  10. Ninghui Sun, Yungang Bao, Dongrui Fan. The rise of high-throughput computing. Frontiers Inf. Technol. Electron. Eng., 2018, 19(10): 1245-1250.
  11. Guangming Tang, Peiyao Qu, Xiaochun Ye, Dongrui Fan, Ninghui Sun. 32-Bit 4×4 Bit-Slice RSFQ Matrix Multiplier, IEEE Transactions on Applied Superconductivity, 28(7), October 2018.
  12. Guangming Tang, Peiyao Qu, Xiaochun Ye, Dongrui Fan. Logic Design of a 16-bit Bit-Slice Arithmetic Logic Unit for 32-/64-bit RSFQ Microprocessors. IEEE Transactions on Applied Superconductivity, 28(4), June 2018.
  13. Xu Tan, Xiaowei Shen, Xiaochun Ye, Da Wang, Dongrui Fan, Lunkai Zhang, Wenming Li, Zhimin Zhang, Zhimin Tang. A Non-Stop Double Buffering Mechanism for Dataflow Architecture. Journal of Computer Science and Technology (JCST), 33(1):145-157, Jan. 2018.
  14. Xu Tan, Xiaochun Ye, Xiaowei Shen, Yuanchao Xu, Da Wang, Lunkai Zhang, Wenming Li, Dongrui Fan, Zhimin Tang. A Pipelining Loop Optimization Method for Dataflow Architecture. Journal of Computer Science and Technology (JCST), 33(1):116-130, Jan. 2018.
  15. Xiaowei Shen, Xiaochun Ye, Xu Tan, Da Wang, Lunkai Zhang, Wenming Li, Zhimin Zhang, Dongrui Fan, Ninghui Sun. An Efficient Network-on-Chip Router for Dataflow Architecture. Journal of Computer Science and Technology (JCST), 32(1):1-15 Jan. 2017.
  16. Peng, Liu; Tan, Guangming; Kalia, Rajiv K.; Nakano, Aiichiro; Vashishta, Priya; Fan, Dongrui; Zhang, Hao; Song, Fenglong. Scalability study of molecular dynamics simulation on Godson-T many-core architecture. Journal of Parallel and Distributed Computing (JPDC), 2013, 73(11): 1469-1482.
  17. Haitao Wei, Mingkang Qin, Junqing Yu, Dongrui Fan and Guang R. Gao.” StreamTMC: Stream Compilation for Tiled Multi-core Architectures” Elsevier Journal of Parallel and Distributed Computing (JPDC), Volume 73, Issue 4, April 2013, Pages 484-494.
  18. Huimin CuiJingling XueLei WangYang YangXiaobing Feng, Dongrui Fan: Extendable pattern-oriented optimization directives. TACO 9(3): 14 (2012).
  19. Huimin Cui, Lei Wang, Dongrui Fan, Xiaobing Feng: Landing Stencil Code on Godson-T. Journal of Computer Science and Technology. 25(4): 886-894 (2010).

 

会议文章: 

  1. Mingyu Yan, Lei Deng, Xing Hu, Ling Liang, Yujing Feng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan, Yuan Xie. HyGCN: A GCN Accelerator with Hybrid Architecture. In the 26th IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2020.
  2. Mingyu Yan, Xing Hu, Shuangchen Li, Abanti Basak, Han Li, Itir Akgun, Xin Ma, Yujing Feng, Peng Gu, Lei Deng, Xiaochun Ye, Zhimin Zhang, Dongrui Fan, Yuan Xie. Alleviating Irregularity in Graph Analytics Acceleration: a Hardware/Software Co-Design Approach. In The 52nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO). October 12-16, 2019. 
  3. Dongrui Fan, Wenming Li, Xiaochun Ye, Da Wang, Hao Zhang, Zhimin Tang, Ninghui Sun. SmarCo: An Efficient Many-Core Processor for High-Throughput Applications in Datacenters. In the 24th IEEE International Symposium on High-Performance Computer Architecture (HPCA), 2018.
  4. Yan Gao, Boxiao Liu, Nan Guo, Xiaochun Ye, Fang Wan, Haihang You, Dongrui Fan. C-MIDN: Coupled Multiple Instance Detection Network with Segmentation Guidance for Weakly Supervised Object Detection. ICCV 2019.
  5. Farzaneh ZokaeeMingzhe ZhangXiaochun YeDongrui FanLei Jiang: Magma: A Monolithic 3D Vertical Heterogeneous ReRAM-based Main Memory Architecture. DAC 2019: 115
  6. Xiaolong Xie, Yun Liang, Xiuhong Li, Yudong Wu, Guangyu Sun, Tao Wang, Dongrui Fan: Enabling coordinated register allocation and thread-level parallelism optimization for GPUs. MICRO 2015: 395-406.
  7. Guoping Long, Diana Franklin, Susmit Biswas, Pablo Ortiz, Jason Oberg, Dongrui Fan, Frederic T. Chong. Minimal Multi-Threading: Finding and Removing Redundant Instructions in Multi-Threaded Processors. In the proceedings of 43rd International Symposium on Microarchitecture (MICRO 43), Atlanta, Georgia, USA, Dec. 04~08, 2010.
  8. Lunkai Zhang; Dmitri Strukov; Hebatallah Saadeldeen; Dongrui Fan; Mingzhe Zhang; Diana Franklin. SpongeDirectory: Flexible Sparse Directories Utilizing Multi-Level Memristors. Parallelism in Architecture and Computing Techniques Conference (PACT),2014, Edmonton, Alberta, Canada.
  9. Dongrui Fan, Hao Zhang, Da Wang, Xiaochun Ye, Fenglong Song, Junchao Zhang, and Lingjun Fan. High-Efficient Architecture of Godson-T Many-Core Processor, In Proceedings of 23rd Symposium on Hot Chips, August 2011.
  10. Xiaochun Ye, Dongrui Fan, et al., High Performance Comparison-Based Sorting Algorithm on Many-Core GPUs, In Proceedings of IEEE International Parallel and Distributed Processing Symposium (IPDPS 2010), Apr., 2010.
  11. Huimin Cui, Jingling Xue, Lei Wang, Xiaobing Feng, Yang Yang, and Dongrui Fan. Extendable Pattern-Oriented Directives. 9th Annual IEEE/ACM International Symposium on Code Generation and Optimization (CGO). Chamonix, France, 2011.

科研项目:

中国科学院C类先导科技专项,XDC05000000,处理器与基础软件关键技术专项-高通量处理器关键技术课题,2020/01/01-2021/12/31,课题负责人 

[1]中科院A类先导科技专项:超导计算机研发专项-超导计算机系统集成技术、2018/01/01-2022/12/31、项目负责人; 

[2]国家自然科学基金重点项目:61732018、后E级时代的新型高能效处理器体系结构、2018/01/01- 2022/12/31、项目负责人; 

国家重点研发计划,2017YFC0803400,监管场所智能监控、预警防范关键技 

术研发与示范,2017/7-2020/12,课题负责人 

[3]国家973重点基础研究发展规划项目基金:2011CB302500、高通量计算系统的构建原理、支撑技术及云服务应用、2011/01-2015/08、课题一技术负责人; 

[4]核高基重大专项:千线程并行的众核CPU 体系结构和支撑技术研究、2014/01-2015/12、超级计算机处理器研发子课题负责人; 

[5]国家自然科学基金面上项目:61173007、众核体系结构中的渗透式延迟容忍方法研究、2012/01-2015/12、项目负责人; 

[6]中科院青年促进会基金:众核处理器设计、2012/01-2015/12、课题负责人; 

[7](横向)华为技术有限公司:20125152、高通量服务器研究项目、2012/02-2013/03、已结题、课题负责人; 

[8]核高基重大专项:2011ZX01028-001-002、超高性能CPU新型架构研究、2011/01-2011/12、分课题负责人; 

[9]北京市科技新星计划:2010B058、高通量众核处理器设计关键技术研究、2010/12-2013/12、已结题、课题负责人; 

[10]国家自然科学基金创新研究群体科学基金:60921002、超并行高效能计算机体系结构与设计方法研究、2010/01-2012/12、课题负责人; 

[11]国家高技术研究发展计划(863计划)智能感知与先进计算技术项目:2009AA01Z103、结合众核特征运行时系统关键技术研究、2009/01-2010/12、课题负责人; 

[12]北京市自然科学基金:4092044、适用于生物信息处理的众核结构设计方法研究、2009/01-2011/12、已结题、项目负责人; 

[13]欧盟第七框架项目( EUROPEAN COMMISSION 7th Framework Programme on Research, Technological Development and Demonstration )FP7-216693Multi-Objective Design Space Exploration of Multi-Processor SOC Architectures for Embedded Multimedia Applications2008/01-2010/12、已结题、课题负责人; 

[14]国家自然科学基金重点项目:60736012、高性能片上存储系统、2008/01-2011/12、已结题、子课题负责人; 

[15]国家973重点基础研究发展规划项目基金:2005CB321600、延长摩尔定律的微处理芯片新原理、新结构与新方法研究、2005/09-2010/12、已结题、课题一子课题负责人; 

获奖及荣誉:

曾获2013 年度北京市科学技术奖二等奖、2014 年度中科院卓越青年科学家奖、2017 年度北京市科学技术二等奖、2018 年度首都科技领军人才、2018 CCF-IEEE CS 青年科学家奖、2018 年共青团中央全国向上向善好青年、2019 年度北京市技术发明一等奖、2019 年中国科学院青年科学家奖等荣誉