郭崎  研究员  

研究方向:

所属部门:智能处理器研究中心、处理器芯片重点实验室

导师类别:博导计算机系统结构

联系方式:guoqi@ict.ac.cn

个人网页:http://novel.ict.ac.cn/qguo

简       历:

郭崎,中国科学院计算技术研究所研究员,长期从事处理器芯片及人工智能交叉研究。发表ACM/IEEE Trans.、《中国科学》等期刊及ISCA、MICRO、ASPLOS、HPCA、OSDI、PPoPP、AAAI、IJCAI、ICML、NeurIPS等会议论文百余篇,授权国内外发明专利百余项。获中国青年五四奖章、中国科学院青年科学家奖、中国计算机学会青年科技奖等。作为主要完成人,获国家自然科学二等奖、中国科学院杰出科技成就奖、世界互联网大会领先科技奖等。

2019年09月 — 今:中国科学院计算技术研究所,研究员

2015年11月 — 2019年09月:中国科学院计算技术研究所,副研究员

2014年04月 — 2015年11月:CMU,博士后

2012年07月 — 2014年04月:IBM中国研究院,研究员

2007年09月— 2012年07月:中国科学院计算技术研究所,硕博连读

2002年09月— 2007年07月:同济大学,计算机系,本科生

主要论著:

会议文章:

[1] Tianbo Liu, Xinkai Song, Zhifei Yue, Rui Wen, Xing Hu, Zhuoran Song, Yuanbo Wen, Yifan Hao, Wei Li, Zidong Du, Rui Zhang, Jiaming Guo, Di Huang, Shaohui Peng, Guangzhong Sun, Qi Guo, Tianshi Chen. Cambricon-SR: An Accelerator for Neural Scene Representation with Sparse Encoding Table. In: Proceedings of the International Symposium on Computer Architecture (ISCA), 2025 (CCF-A)

[2] Jianxing Xu, Yuanbo Wen, Zikang Liu, Ruibai Xu, Tingfeng Ruan, Jun Bi, Rui Zhang, Di Huang, Xinkai Song, Yifan Hao, Xing Hu, Zidong Du, Chongqing Zhao, Jiang Jie, Qi Guo. Mosaic: Exploiting Instruction-Level Parallelism on Deep Learning Accelerators with iTex Tessellation. In: Proceedings of International Conference on Architectural Support of for Programming Language and Operating Systems (ASPLOS), 2025. (CCF-A)

[3] Shuyao Cheng, Rui Zhang, Wenkai He, Pengwei Jin, Chongxiao Li, Zidong Du, Xing Hu, Yifan Hao, Guanglin Xu, Yuanbo Wen, Ling Li, Qi Guo, Yunji Chen. Automated Superscalar Processor Design by Learning Data Dependencies. In: Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), 2025. (CCF-A)

[4] Yi Chen, Yongwei Zhao, Yifan Hao, Yuanbo Wen, Yuntao Dai, Xiaqing Li, Yang Liu, Rui Zhang, Mo Zou, Xinkai Song, Xing Hu, Zidong Du, Huaping Chen, Qi Guo, Tianshi Chen. Cambricon-C: Efficient 4-Bit Matrix Unit via Primitivization. In: Proceedings of International Symposium on Microarchitecture (MICRO), 2024. (CCF-A)

[5] Qirui Zhou, Yuanbo Wen, Ruizhi Chen, Ke Gao, Weiqiang Xiong, Ling Li, Qi Guo, Yanjun Wu, Yunji Chen. QiMeng-GEMM: Automatically Generating High-Performance Matrix Multiplication Code by Exploiting Large Language Models. In: Proceedings of the Annual AAAI Conference on Artificial Intelligence (AAAI), 2025. (CCF-A)

[6] Shuyao Cheng, Pengwei Jin, Qi Guo, Zidong Du, Rui Zhang, Xing Hu, Yongwei Zhao, Yifan Hao, Xiangtao Guan, Husheng Han, Zhengyue Zhao, Ximing Liu, Xishan Zhang, Yuejie Chu, Weilong Mao, Tianshi Chen, Yunji Chen. Automated CPU Design by Learning from Input-Output Examples. In: Proceedings of International Joint Conference on Artificial Intelligence (IJCAI), 2024. (CCF-A)

[7] Huilai Chen, Yuanbo Wen, Limin Cheng, Shouxu Kuang, Yumeng Liu, Weijia Li, Ling Li, Rui Zhang, Xinkai Song, Wei Li, Qi Guo, Yunji Chen. AutoOS: Make Your OS More Powerful by Exploiting Large Language Models. In: Proceedings of the International Conference on Machine Learning (ICML), 2024. (CCF-A)

[8] Shuyao Cheng, Chongxiao Li, Zidong Du, Rui Zhang, Xing Hu, Xiaqing Li, Guanglin Xu, Yuanbo Wen, Qi Guo. Revisiting Automatic Pipelining: Gate-level Forwarding and Speculation. In: Proceedings of the Design Automation Conference (DAC), 2024. (CCF-A)

[9] Jun Bi, Qi Guo, Xiaqing Li, Yongwei Zhao, Yuanbo Wen, Yuxuan Guo, Enshuai Zhou, Xing Hu, Zidong Du, Ling Li, Huaping Chen, Tianshi Chen. Automatically constrained high-performance library generation for deep learning accelerators. In: Proceedings of International Conference on Architectural Support of for Programming Language and Operating Systems (ASPLOS), 2023. (CCF-A)

[10] Yifan Hao, Yongwei Zhao, Chenxiao Liu, Shuyao Cheng, Xiaqing Li, Xing Hu, Zidong Du, Qi Guo, Zhiwei Xu, Tianshi Chen. Cambricon-P: A bitflow architecture for arbitrary precision computing. In: Proceedings of International Symposium on Microarchitecture (MICRO), 2022. (CCF-A, Best Paper Runner-Up Award)

期刊文章:

[1] Xiyue Yu, Jun Bi, Yuanbo Wen, Jianxing Xu, Di Huang, Jiaming Guo, Wei Li, Zidong Du, Jing Li, Tianshi Chen, Qi Guo. Swift: High Parallelism Program Generation of Tensor Operators for Accelerating Deep Learning Inference. ACM Transactions on Architecture and Code Optimization (ACM TACO), 22(4): 1-26 (2025) (CCF-A)

[2] Ximing Liu, Yongwei Zhao, Mo Zou, Yang Liu, Yifan Hao, Xiaqing Li, Rui Zhang, Yuanbo Wen, Xing Hu, Zidong Du, Qi Guo, Tianshi Chen. VariPar: Variation-Aware Workload Partitioning in Chiplet-Based DNN Accelerators. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD) 44(12): 4643-4656 (2025) (CCF-A)

[3] Pengwei Jin, Zhe Fan, Yongwei Zhao, Zidong Du, Hongrui Guo, Ziyuan Nan, Yifan Hao, Chongxiao Li, Tianyun Ma, Zhenxing Zhang, Xiaqing Li, Wei Li, Xing Hu, Qi Guo, Zhiwei Xu, Tianshi Chen. SaaP: Rearchitect SoC-as-a-Processor to Orchestrate Hardware Heterogeneity. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD) 44(10): 3962-3975 (2025) (CCF-A)

[4] Jun Bi, Yuanbo Wen, Xiaqing Li, Yongwei Zhao, Yuxuan Guo, Enshuai Zhou, Xing Hu, Zidong Du, Ling Li, Huaping Chen, Tianshi Chen, Qi Guo. Efficient and Fast High-Performance Library Generation for Deep Learning Accelerators. IEEE Transactions on Computers (IEEE TC) 74(1): 155-169 (2025) (CCF-A)

[5] Xiaqing Li, Qi Guo, Guangyan Zhang, Siwei Ye, Guanhua He, Yiheng Yao, Rui Zhang, Yifan Hao, Zidong Du, Weimin Zheng. FastTuning: Enabling Fast and Efficient Hyper-Parameter Tuning with Partitioning and Parallelism of Search Space. IEEE Transactions on Parallel and Distributed Systems (IEEE TPDS) 35(7): 1174-1188 (2024) (CCF-A)

[6] Zidong Du, Qi Guo, Yongwei Zhao, Xi Zeng, Ling Li, Limin Cheng, Zhiwei Xu, Ninghui Sun, Yunji Chen. Breaking the Interaction Wall: A DLPU-Centric Deep Learning Computing System. IEEE Transactions on Computers (IEEE TC) 71(1): 209-222 (2022) (CCF-A)

[7] Yuanbo Wen, Qi Guo, Zidong Du, Jianxing Xu, Zhenxing Zhang, Xing Hu, Wei Li, Rui Zhang, Chao Wang, Xuehai Zhou, Tianshi Chen. Enabling One-Size-Fits-All Compilation Optimization for Inference Across Machine Learning Computers. IEEE Transactions on Computers (IEEE TC) 71(9): 2313-2326 (2022) (CCF-A)

[8] Xinkai Song, Tian Zhi, Zhe Fan, Zhenxing Zhang, Xi Zeng, Wei Li, Xing Hu, Zidong Du, Qi Guo, Yunji Chen. Cambricon-G: A polyvalent energy-efficient accelerator for dynamic graph neural networks. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD) 41(1): 116-128 (2022) (CCF-A)

[9] Zidong Du, Qi Guo, Tian Zhi, Yongwei Zhao, Yunji Chen, and Zhiwei Xu. Self-aware Neural Network Systems: A Survey and New Perspective. Proceedings of IEEE (PIEEE) 108(7): 1047-1067 (2020) (CCF-A)

[10] Shengyuan Zhou, Qi Guo, Zidong Du, Dao-Fu Liu, Tianshi Chen, Ling Li, Shaoli Liu, Jinhong Zhou, Olivier Temam, Xiaobing Feng, Xuehai Zhou, Yunji Chen. ParaML: A Polyvalent Multicore Accelerator for Machine Learning. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (IEEE TCAD) 39(9): 1764-1777 (2020) (CCF-A)

科研项目:

获奖及荣誉:

2025年 中国计算机学会青年科技奖

2025年 中国青年五四奖章

2024年 MICRO会议名人堂

2022年 中国科学院青年科学家奖

2022年 MICRO Best Paper Runner-Up Award

2022年 中国科学院青年创新促进会优秀会员

2020年 国家自然科学二等奖

2019年 中国科学院杰出科技成就奖

2018年 国家级青年人才

2018年 中国科学院青年创新促进会会员

2015年 中国科协青年人才托举计划暨CCF青年人才发展计划