Ph.D. Candidate
Ant Group (蚂蚁集团)
Email: shihao.wsh at antgroup.com
Biography
2017-now Mobile AI Engine - xNN Team, Ant Group, Hangzhou, China
2014-2017 Ph.D. Candidate in Optimization Technology Lab (Prof. Yoshimura Lab), Waseda University, Japan
2012-2014 M. Eng. in Prof. Goto Lab, Waseda University, Japan
2008–2012 B. Eng. in Electronic Engineering, Shanghai Jiao Tong University, China
2002–2008 The Affiliated Middle School of Henan Normal University, China
Mobile AI Algorithms and Applications
ASIC design for 8K UHDTV video codec for HEVC standard, CNN accelerator
Energy-efficient Scheduling Method with Cross-loop Model for customized CNN Accelerators,
Kaiyi Yang, Shihao Wang, Jianbin Zhou and Takeshi Yoshimura,
IEEE International Symposium on Circuit and Systems (ISCAS), 2017, accepted.
Chain-NN: An Energy-Efficient 1D Chain Architecture for Accelerating Deep Convolutional Neural Networks,
Shihao Wang, Dajiang Zhou, Xushen Han and Takeshi Yoshimura,
Design, Automation and Test in Europe, (DATE), 2017. [Link]
CNN-MERP: An FPGA-Based Memory-Efficient Reconfigurable Processor for Forward and Backward Propagation of Convolutional Neural Network,
Xushen Han, Dajiang Zhou, Shihao Wang and Shinji Kimura,
IEEE International Conference on Computer Design (ICCD), 2017. [Link]
An 8K H.265/HEVC Video Decoder Chip with a New System Pipeline Design,”
Dajiang Zhou, Shihao Wang, Heming Sun, Jianbin Zhou, Jiayi Zhu, Yijin Zhao, Jinjia Zhou, Shuping Zhang, Shinji Kimura, Takeshi Yoshimura, and Satoshi Goto, “
IEEE Journal of Solid State Circuits (JSSC), Jan. 2017. [Link]
VLSI implementation of HEVC motion compensation with distance biased direct cache mapping for 8K UHDTV applications,
Shihao Wang, Dajiang Zhou, Jianbin Zhou, Takeshi Yoshimura, and Satoshi Goto,
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2016. [Link]
A Dual-Clock VLSI Design of H.265 Sample Adaptive Offset Estimation for 8K Ultra-HD TV Encoding,
Jianbin Zhou, Dajiang Zhou, Shihao Wang, Shuping Zhang, Takeshi Yoshimura, and Satoshi Goto,
IEEE Transactions on Very Large Scale Integration Systems (TVLSI), 2016. [Link]
A 4Gpixel/s 8/10b H.265/HEVC video decoder chip for 8K Ultra HD applications,
Dajiang Zhou, Shihao Wang, Heming Sun, Jianbin Zhou, Jiayi Zhu, Yijin Zhao, Jinjia Zhou, Shuping Zhang, Shinji Kimura, Takeshi Yoshimura, and Satoshi Goto,
IEEE International Solid-State Circuits Conference (ISSCC), San Francisco, USA, pp. 266-267, February 2016. [Link]
High performance VLSI architecture of H.265/HEVC intra prediction for 8K UHDTV video decoder,
Jianbin Zhou, Dajiang Zhou, Shihao Wang, Takeshi Yoshimura, and Satoshi Goto,
IEICE Transactions on Fundamentals, Vol. E98-A, No. 12, pp. 2519-2527, December 2015. [Link]
Unified parameter decoder architecture for H.265/HEVC motion vector and boundary strength decoding,
Shihao Wang, Dajiang Zhou, Jianbin Zhou, Takeshi Yoshimura, and Satoshi Goto,
IEICE Transactions on Fundamentals, Vol. E98-A, No. 7, pp. 1356-1365, July 2015. [Link]
Unified VLSI Architecture of Motion Vector and Boundary Strength Parameter Decoder for 8K UHDTV HEVC Decoder,
Shihao Wang, Dajiang Zhou, Jianbin Zhou, Takeshi Yoshimura and Satoshi Goto,
Pacific Rim Conference on Multimedia (PCM), Kuching, Malaysia, December, 2014.. [Link]
Motion compensation architecture for 8K UHDTV HEVC decoder,
Shihao Wang, Dajiang Zhou, and Satoshi Goto,
IEEE International Conference on Multimedia and Expo (ICME), Chengdu, China, July, 2014. [Link]
Academic Experience:
2016 – Now | Deep Convolutional Neural Network (CNN) Accelerator
• Developed a novel 1D chain architecture based chip-level CNN accelerator, named Chain-NN. This work has been accepted by DATE 2017, a top design automation conference. Its energy efficiency and throughput performance surpass all the existing state-of-the-art works.
• Developed an FPGA-based demonstration system of Chain-NN.
• Participated in researching on optimizing CNN algorithms and architectures for FPGA platforms.
2012 – 2015 | HEVC Decoder Chip for 8K Ultra High Definition TV
• Core participant (the 2nd in contribution) for the world first HEVC 8K decoder chip. The chip was presented at the No.1 integrated circuits conference ISSCC and won the conference’s highest honored Outstanding Far East Paper Award.
• Conducted algorithm- and architecture- level development of key components of the chip including cache, motion compensation and parameter decoder.
• Conducted system-level FPGA designs for chip’s verification and real-time demonstration.