Yuanwei (Kevin) Fang
Biography
Yuanwei currently works at Modular -- a startup focusing on AI infra. He was an AI system researcher at Alibaba DAMO. He obtained his Ph.D. degree from the Computer Science Department of The University of Chicago in 2019 under the supervision of Prof. Andrew A. Chien. He is interested in the entire stack of computing systems, such as computer architecture, compiler, system, database, and AI. At Alibaba, he mainly works on AI software (compiler and runtime system) and LLM optimization. In his Ph.D., Yuanwei worked on accelerating data analysis in the data lake paradigm using both software and hardware techniques. He is the recipient of the 2016 Qualcomm Roberto Padovani Award in recognition of his innovative contribution. He also received the CERES Research Award from the CS Department of the University of Chicago. He received his B.S. in Microelectronics from Fudan University.
News
11/2023 I joined Modular, beginning my startup journey!
9/2019 I worked at Alibaba DAMO Academy as a research scientist.
6/2019 ACCORDA paper got accepted by VLDB'19.
6/2019 I earned my Ph.D. from The University of Chicago!
9/2018 I completed my internship at Google (ranked as "superb" ).
6/2018 I worked with the Google Datacenter platform team and the Youtube data warehouse team (Procella) to speed up Youtube's OLAP data analytic system (host: Jichuan, co-host: Biswa ).
Publications
[arxiv 2023] Y. Fang, Z. Liu, Y. Lu, J. Liu, J. Li, Y. Jin, J. Chen, Y. Chen, H. Zheng, Y. Xie, "NPS: A Framework for Accurate Program Sampling Using Graph Neural Network", arXiv:2304.08880, 2023.
[ISCA 2022] S. Li, D. Niu, Y. Wang, W. Han, Z. Zhang, T. Guan, Y. Guan, H. Liu, L. Huang, Z. Du, F. Xue, Y. Fang, H. Zheng, Y. Xie, "Hyperscale FPGA-as-a-service architecture for large-scale distributed graph neural network", in Proc. 49th Annual International Symposium on Computer Architecture.
[ISSCC 2022] D. Niu, S. Li, Y. Wang, W. Han, Z. Zhang, Y. Guan, T. Guan, F. Sun, F. Xue, L. Duan, Y. Fang, H. Zheng, X. Jiang, S. Wang, F. Zuo, Y. Wang, B. Yu, Q. Ren, Y. Xie, "184QPS/W 64Mb/mm 3D Logic-to-DRAM Hybrid Bonding with Process-Near-Memory Engine for Recommendation System", 2022 IEEE International Solid-State Circuits Conference.
[PVLDB 2019] Yuanwei Fang, Chen Zou, and Andrew A. Chien. "Accelerating Raw Data Analysis with the ACCORDA Software and Hardware Architecture", in Proc. 45th International Conference on Very Large Data Bases, August 2019. [Paper]
[HCW 2019] Arjun Rawal, Yuanwei Fang, and Andrew A. Chien. "Programmable Acceleration for Sparse Matrices in a Data-movement Limited World", in Heterogenous Computing Workshop 2019 affiliated with IPDPS'19. [Paper]
[MICRO 2017] Yuanwei Fang, Chen Zou, Aaron J. Elmore, and Andrew A. Chien. "UDP: A Programmable Accelerator for Extract-Transform-Load Workloads and More" in Proc. 50th Annual IEEE/ACM International Symposium on Microarchitecture, October 2017 [acceptance rate: 61/327 = 18.6%]. [Paper]
[TR 2017] Yuanwei Fang and Andrew A. Chien. "UDP System Interface and Lane ISA Definition" in The University of Chicago Technical Report, TR-2017-05, Aug 2017. [Spec]
[IMTC 2016] Yuanwei Fang, Andrew A. Chien, Andrew Lahane, and Lee Barford, "Performance of Parallel Prefix Circuit Transition Localization of Pulsed Waveforms" in Proc. 2016 IEEE International Instrumentation and Measurement Technology Conference. [Paper]
[MICRO 2015] Yuanwei Fang, Tung T. Hoang, Michela Becchi, and Andrew A. Chien. "Fast support for unstructured data processing: the unified automata processor" in Proc. 48th Annual IEEE/ACM International Symposium on Microarchitecture, December 2015 [acceptance rate: 61/283 = 21.6%]. [Paper]
[CAN] Andrew A. Chien, Tung T. Hoang, Dilip Vasudevan, Yuanwei Fang, and Amir Shambayati, "10x10: A case study for federated heterogeneous computing", in ACM SIGARCH Computer Architecture News, Volume 43 Issue 3, May 2015. [Paper]
[TR 2015] Yuanwei Fang, Andrew Lehane, and Andrew A. Chien, "EffCLiP: Efficient coupled-linear packing for finite automata", in The University of Chicago Technical Report, TR-2015-05, May 2015. [Paper]
[ASBD 2014] Yuanwei Fang, Raihan ur Rasool, Dilip Vasudevan, and Andrew A. Chien, "Generalized pattern matching micro-engine", in 4th Workshop on Architectures and Systems for Big Data (ASBD) held with ISCA'14. [Paper]
[APCCAS 2012] Yang Hong, Qingqing Yang, Yuanwei Fang, Xiaofang Zhou, Gerald E. Sobelman. "A Novel Hardware-oriented Decoding Algorithm for Non-binary LDPC Codes". in Proc. 2012 IEEE Asia Pacific Conference on Circuits and Systems.