Kihyuk Sohn

I am a Research Scientist at Google Cloud AI in Sunnyvale, CA. Prior to joining Google, I was a researcher in Media Analytics group of NEC Laboratories America. I completed my Ph.D. at University of Michigan under the supervision by professor Honglak Lee. I have broad interest in machine learning and computer vision. Specifically, my research focuses on supervised and unsupervised deep representation learning with applications to computer vision, audio recognition, and text processing, using graphical models that are invariant to many factors of variation for robust perception from complex and multimodal data.

Experience

Curriculum Vitae [pdf]

July 2019 ~

Research Scientist, Google Cloud AI

July 2015 ~ July 2019

Researcher, NEC Laboratories America

September 2008 ~ June 2015

Ph.D. in Electrical Engineering: Systems, University of Michigan, Ann Arbor

Thesis advisor : Professor Honglak Lee

March 2003 ~ February 2008

Korea Advanced Institute of Science and Technology (KAIST), Daejeon, Korea

B.S. in Electrical Engineering and Computer Science and Mathematical Science

Contact information

Email:

kihyuks [at] google [dot] com

kihyuk.sohn [at] gmail [dot] com

Publications

[25] Domain Adaptation for Structured Output via Discriminative Patch Representations.

Yi-Hsuan Tsai, Kihyuk Sohn, Samuel Schulter and Manmohan Chandraker

To appear in Proceedings of the IEEE International Conference on Computer Vision (ICCV), 2019 (oral presentation). [pdf][arXiv]

[24] Gotta Adapt ’Em All: Joint Pixel and Feature-Level Domain Adaptation for Recognition in the Wild.

Luan Tran, Kihyuk Sohn, Xiang Yu, Xiaoming Liu and Manmohan Chandraker

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. [pdf][supp][arXiv]

[23] Feature Transfer Learning for Face Recognition with Under-Represented Data.

Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu and Manmohan Chandraker

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019. [pdf][supp][arXiv]

[22] Unsupervised Domain Adaptation for Distance Metric Learning.

Kihyuk Sohn, Wenling Shang, Xiang Yu and Manmohan Chandraker

In International Conference on Learning Representations (ICLR), 2019. [pdf]

[21] Attentive Conditional Channel-Recurrent Autoencoding for Attribute-Conditioned Face Synthesis.

Wenling Shang and Kihyuk Sohn

In Winter Conference on Applications of Computer Vision (WACV), 2019. [pdf][code]

[20] Learning to Adapt Structured Output Space for Semantic Segmentation.

Yi-Hsuan Tsai*, Wei-Chih Hung*, Samuel Schulter, Kihyuk Sohn, Ming-Hsuan Yang and Manmohan Chandraker

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018 (spotlight) (* indicates equal contribution). [pdf][project page]

[19] Channel-Recurrent Autoencoding for Image Modeling.

Wenling Shang, Kihyuk Sohn, Yuandong Tian

In Winter Conference on Applications of Computer Vision (WACV), 2018. [pdf][arXiv][code]

[18] Unsupervised Domain Adaptation for Face Recognition in Unlabeled Videos.

Kihyuk Sohn, Sifei Liu, Guangyu Zhong, Xiang Yu, Ming-Hsuan Yang, Manmohan Chandraker

In International Conference on Computer Vision (ICCV), 2017. [pdf][arXiv]

[17] Towards Large-Pose Face Frontalization.

Xi Yin, Xiang Yu, Kihyuk Sohn, Xiaoming Liu, Manmohan Chandraker

In International Conference on Computer Vision (ICCV), 2017. [pdf][arXiv]

[16] Feature Reconstruction Disentangling for Pose-invariant Face Recognition.

Xi Peng, Xiang Yu, Kihyuk Sohn, Dimitris N. Metaxas, Manmohan Chandraker

In International Conference on Computer Vision (ICCV), 2017. [pdf][arXiv]

[15] Exploring Normalization in Deep Residual Networks with Concatenated Rectified Linear Units.

Wenling Shang, Justin Chiu, Kihyuk Sohn

In Association for the Advancement of Artificial Intelligence (AAAI), 2017. [pdf]

[14] Improved Deep Metric Learning with Multi-class N-pair Loss Objective.

Kihyuk Sohn

In Advances in Neural Information Processing Systems (NIPS), 2016. [pdf][bib]

[13] Attribute2Image: Conditional Image Generation from Visual Attributes.

Xinchen Yan, Jimei Yang, Kihyuk Sohn, Honglak Lee

In European Conference on Computer Vision (ECCV), 2016. [pdf][arXiv][code]

[12] Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units.

Wenling Shang, Kihyuk Sohn, Diogo Almeida, Honglak Lee

In Proceedings of the 33rd International Conference on Machine Learning (ICML), 2016. [pdf][arXiv]

[11] Discriminative Training of Structured Dictionaries via Block Orthogonal Matching Pursuit.

Wenling Shang, Kihyuk Sohn, Honglak Lee, Anna Gilbert

In SIAM International Conference on Data Mining (SDM), 2016 [pdf]

[10] Learning Structured Output Representation using Deep Conditional Generative Models.

Kihyuk Sohn, Xinchen Yan and Honglak Lee.

In Advances in Neural Information Processing Systems (NIPS), 2015 [pdf][supp][bib][code]

[9] Improving Object Detection with Deep Convolutional Networks via Bayesian Optimization and Structured Prediction.

Yuting Zhang, Kihyuk Sohn, Ruben Villegas, Gang Pan and Honglak Lee

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015 (oral presentation). [pdf][supp][tech report][code]

OpenCV People’s Vote Winning Paper [link]

[8] Improved Multimodal Deep Learning with Variation of Information.

Kihyuk Sohn, Wenling Shang and Honglak Lee

In Advances in Neural Information Processing Systems (NIPS), 2014 [pdf][pdf (full)][bib][github]

[7] Learning to Disentangle Factors of Variation with Manifold Interaction.

Scott Reed, Kihyuk Sohn, Yuting Zhang and Honglak Lee

In Proceedings of the 31st International Conference on Machine Learning (ICML), 2014. [pdf][bib][code]

[6] Augmenting CRFs with Boltzmann Machine Shape Priors for Image Labeling.

Kihyuk Sohn*, Andrew Kae*, Honglak Lee and Erik Learned-Miller.

In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2013. [pdf][bib][project page][code] (* indicates equal contribution.)

[5] Learning and Selecting Features Jointly with Point-wise Gated Boltzmann Machines.

Kihyuk Sohn, Guanyu Zhou, Chansoo Lee, and Honglak Lee.

In Proceedings of the 30th International Conference on Machine Learning (ICML), 2013. [pdf][bib][supp][project page][code]

(A previous version was presented at the NIPS Workshop on Deep Learning and Unsupervised Feature Learning, 2012.)

[4] Learning Invariant Representations with Local Transformations.

Kihyuk Sohn and Honglak Lee.

In Proceedings of the 29th International Conference on Machine Learning (ICML), 2012. [pdf][bib][github]

[3] Online Incremental Feature Learning with Denoising Autoencoders.

Guanyu Zhou, Kihyuk Sohn, and Honglak Lee.

In Proceedings of the 15th International Conference on Artificial Intelligence and Statistics (AISTATS), JMLR W&CP 22, 2012. [pdf][bib][supp] (oral presentation)

(A previous version was presented at the NIPS Workshop on Deep Learning and Unsupervised Feature Learning, 2011.)

[2] An interpretation of the Cover and Leung capacity region for the MAC with feedback through stochastic control.

Achilleas Anastasopoulos and Kihyuk Sohn.

In Proceedings of IEEE International Conference on Communications (ICC), 2012. [pdf][bib]

[1] Efficient Learning of Sparse, Distributed, Convolutional Feature Representations for Object Recognition.

Kihyuk Sohn, Dae Yon Jung, Honglak Lee, and Alfred Hero III.

In Proceedings of 13th International Conference on Computer Vision (ICCV), 2011. [pdf][bib]

Software

  • Github (last updated: 2019.06.24)
    • Learning Structured Output Representation using Deep Conditional Generative Models. In NIPS 2015. [pdf][supp][bib][code]
    • Improved Multimodal Deep Learning with Variation of Information. In NIPS 2014. [pdf][pdf (full)][bib][github]
    • Learning Invariant Representations with Local Transformations. In ICML 2012. [pdf][bib][github]
  • Disentangling Boltzmann machine
    • Learning to Disentangle Factors of Variation with Manifold Interaction. In ICML 2014. [pdf][bib]
  • GLOC (last updated: 2013.07.24)
    • related publication: Augmenting CRFs with Boltzmann Machine Shape Priors for Image Labeling. In CVPR 2013. [pdf][bib][project page]
  • Point-wise Gated Boltzmann machine (last updated: 2013.06.18)
    • related publication: Learning and Selecting Features Jointly with Point-wise Gated Boltzmann Machines. In ICML 2013. [pdf][bib][supp][project page]