Kota Yamaguchi
Principal Research Scientist at CyberAgent AI Lab
Google Scholar / Semantic Scholar / Github / X / LinkedIn
Principal Research Scientist at CyberAgent AI Lab
Google Scholar / Semantic Scholar / Github / X / LinkedIn
Kota Yamaguchi is a principal research scientist at CyberAgent, where he is leading the research organization and also working on computer vision and machine learning for graphic design. Before joining CyberAgent, he was an assistant professor at Tohoku University from 2014 to 2017. He received a Ph.D. degree in Computer Science from Stony Brook University in 2014. He received an MS in 2008 and a BE in 2006, both from University of Tokyo.
Tomoyuki Suzuki, Kangjun Liu, Naoto Inoue, Kota Yamaguchi, "LayerD: Decomposing Raster Graphic Designs into Layers", ICCV 2025.
Jan Zdenek, Wataru Shimoda, Kota Yamaguchi, "OTR: Synthesizing Overlay Text Dataset for Text Removal", ACMMM Dataset Track 2025.
Kotaro Kikuchi, Ukyo Honda, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi, "Multimodal Markup Document Models for Graphic Design Completion", ACMMM 2025. [arXiv] [Project]
Daichi Haraguchi, Wataru Shimoda, Kota Yamaguchi, Seiichi Uchida, "Total Disentanglement of Font Images into Style and Character Class Features", ICDAR 2025. [arXiv]
Wataru Shimoda, Naoto Inoue, Daichi Haraguchi, Hayato Mitani, Seichi Uchida, Kota Yamaguchi, "Type-R: Automatically Retouching Typos for Text-to-Image Generation", CVPR 2025. [arXiv]
Daichi Haraguchi, Naoto Inoue, Wataru Shimoda, Hayato Mitani, Seiichi Uchida, Kota Yamaguchi, "Can GPTs Evaluate Graphic Design Based on Design Principles?" SIGGRAPH Asia Technical Communications 2024. [arXiv]
Tomoyuki Suzuki, Kotaro Kikuchi, Kota Yamaguchi, "Fast Sprite Decomposition from Animated Graphics", ECCV 2024. [arXiv] [Project]
Naoto Inoue, Kento Masui, Wataru Shimoda, Kota Yamaguchi, "OpenCOLE: Towards Reproducible Automatic Graphic Design Generation", CVPR Workshops 2024. [arXiv] [Project]
Daichi Horita, Naoto Inoue, Kotaro Kikuchi, Kota Yamaguchi, Kiyoharu Aizawa, "Retrieval-Augmented Layout Transformer for Content-Aware Layout Generation", CVPR 2024. [arXiv] [Project]
Shoko Sawada, Tomoyuki Suzuki, Kota Yamaguchi, Masashi Toyoda, "Visual Explanation for Advertising Creative Workflow", CHI EA 2024. [ACM]
Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi, "Towards Diverse and Consistent Typography Generation", WACV 2024. [arXiv]
Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi, “LayoutDM: Discrete Diffusion Model for Controllable Layout Generation”, CVPR 2023. [arXiv] [Project]
Naoto Inoue, Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi, “Towards Flexible Multi-modal Document Models”, CVPR 2023. [arXiv] [Project]
Kotaro Kikuchi, Naoto Inoue, Mayu Otani, Edgar Simo-Serra, Kota Yamaguchi, "Generative Colorization of Structured Mobile Web Pages", WACV 2023. [arXiv] [Project]
Kotaro Kikuchi, Mayu Otani, Kota Yamaguchi, Edgar Simo-Serra, "Modeling Visual Containment for Web Page Layout Optimization", Pacific Graphics 2021. [PDF] [Project]
Wataru Shimoda, Daichi Haraguchi, Seiichi Uchida, Kota Yamaguchi, "De-rendering Stylized Texts", ICCV 2021. [arXiv] [Github]
Kota Yamaguchi, "CanvasVAE: Learning to Generate Vector Graphic Documents", ICCV 2021. [arXiv] [Github]
Kotaro Kikuchi, Edgar Simo-Serra, Mayu Otani, Kota Yamaguchi, "Constrained Graphic Layout Generation via Latent Optimization", ACMMM 2021. [arXiv] [Github] [Project]
Kotaro Kikuchi, Kota Yamaguchi, Edgar Simo-Serra, Tetsunori Kobayashi, "Regularized Adversarial Training for Single-Shot Virtual Try-On", ICCV Workshops 2019. [CVF]
Yuto Shinahara, Takuro Karamatsu, Daisuke Harada, Kota Yamaguchi, Seiichi Uchida, "Serif or Sans: Visual Font Analytics on Book Covers and Online Advertisements", ICDAR 2019. [arXiv]
Tianlu Wang, Kota Yamaguchi, Vicente Ordonez, "Feedback-prop: Convolutional Neural Network Inference under Partial Evidence", CVPR 2018. [arXiv] [Github]
Kota Yamaguchi, Takayuki Okatani, Takayuki Umeda, Kazuhiko Murasaki, Kyoko Sudo, "End-to-end learning potentials for structured attribute prediction", arXiv preprint arXiv:1708.01892. [arXiv]
Shuohao Li, Kota Yamaguchi, Takayuki Okatani, "Attention to Describe Products with Attributes", MVA 2017. [IEEE]
Pongsate Tangseng, Kota Yamaguchi, Takayuki Okatani, "Recommending outfits from personal closet", ICCV Workshops 2017. [arXiv] [project]
Takuya Yashima, Naoaki Okazaki, Kentaro Inui, Kota Yamaguchi, Takayuki Okatani, "Learning to Describe E-Commerce Images from Noisy Online Data", ACCV 2016. [Springer]
Masayasu Muraoka, Sumit Maharjan, Masaki Saito, Kota Yamaguchi, Naoaki Okazaki, Takayuki Okatani, Kentaro Inui, "Recognizing Open-Vocabulary Relations between Objects in Images", PACLIC 2016. [PDF]
Pongsate Tangseng, Zhipeng Wu, Kota Yamaguchi, "Looking at Outfit to Parse Clothing", arXiv preprint arXiv:1703.01386. [arXiv] [project]
Sirion Vittayakorn, Takayuki Umeda, Kazuhiko Murasaki, Kyoko Sudo, Takayuki Okatani, Kota Yamaguchi, "Automatic Attribute Discovery with Neural Activations", ECCV 2016. [arXiv]
Vicente Ordonez, Xufeng Han, Polina Kuznetsova, Girish Kulkarni, Margaret Mitchell, Kota Yamaguchi, Karl Stratos, Amit Goyal, Jesse Dodge, Alyssa Mensch, Hal Daume III, Alexander C Berg, Yejin Choi, Tamara L Berg, "Large Scale Retrieval and Generation of Image Descriptions", IJCV 2015. [Springer]
Kota Yamaguchi, Takayuki Okatani, Kyoko Sudo, Kazuhiko Murasaki, Yukinobu Taniguchi, "Mix and Match: Joint Model for Clothing and Attribute Recognition", BMVC 2015. [BMVA]
Sirion Vittayakorn, Kota Yamaguchi, Alexander C Berg, Tamara L Berg, "Runway to Realway: Visual Analysis of Fashion", WACV 2015. [IEEE]
Kota Yamaguchi, Tamara L Berg, Luis E Ortiz, "Chic or Social: Visual Popularity Analysis in Online Fashion Networks", ACM Multimedia 2014. [ACM]
M Hadi Kiapour, Kota Yamaguchi, Alexander C Berg, Tamara L Berg, "Hipster Wars: Discovering Elements of Fashion Styles", ECCV 2014. [Springer]
Kota Yamaguchi, M Hadi Kiapour, Luis E Ortiz, Tamara L Berg, "Retrieving Similar Styles to Parse Clothing", TPAMI 2014. [IEEE]
Kota Yamaguchi, M Hadi Kiapour, Tamara L Berg, "Paper Doll Parsing: Retrieving Similar Styles to Parse Clothing Items", ICCV 2013. [IEEE] [Github]
Kota Yamaguchi, M Hadi Kiapour, Luis E Ortiz, Tamara L Berg, "Parsing Clothing in Fashion Photographs", CVPR 2012. [IEEE]
Alexander C Berg, Tamara L Berg, Hal Daume III, Jesse Dodge, Amit Goyal, Xufeng Han, Alyssa Mensch, Margaret Mitchell, Aneesh Sood, Karl Stratos, Kota Yamaguchi, "Understanding and Predicting Importance in Images", CVPR 2012. [IEEE]
Jesse Dodge, Amit Goyal, Xufeng Han, Alyssa Mensch, Margaret Mitchell, Karl Stratos, Kota Yamaguchi, Yejin Choi, Hal Daume III, Alexander C Berg, Tamara L Berg, "Detecting visual text", NAACL 2012. [ACM]
Margaret Mitchell, Jesse Dodge, Amit Goyal, Kota Yamaguchi, Karl Stratos, Xufeng Han, Alyssa Mensch, Alexander C Berg, Tamara L Berg, Hal Daume III, "Midge: Generating Image Descriptions From Computer Vision Detections", European Chapter of the Association for Computational Linguistics, EACL 2012. [PDF]
Kota Yamaguchi, Alexander C Berg, Luis E Ortiz, Tamara L Berg, "Who are you with and where are you going?", CVPR 2011. [IEEE]
Kota Yamaguchi, Takashi Komuro, Masatoshi Ishikawa, "PTZ Control with Head Tracking for Video Chat", ACM CHI 2009 Extended Abstracts.
Kota Yamaguchi, Yoshihiro Watanabe, Takashi Komuro, Masatoshi Ishikawa, "Interleaved Pixel Lookup for Embedded Computer Vision", Fourth IEEE Workshop on Embedded Computer Vision (ECVW).
Kota Yamaguchi, Yoshihiro Watanabe, Takashi Komuro, Masatoshi Ishikawa, "Design of a Massively Parallel Vision Processor based on Multi-SIMD Architecture", 2007 IEEE International Symposium on Circuits and Systems (ISCAS2007), pp.3498-3501.
ACCV AC (2020, 2022), Tutorial Chair (2024), Financial Chair (2026)
ICCV AC (2023)
CVPR GDUG workshop organizer (2024)
MIRU program co-chair (2015), AC (2016-2018), financial chair (2022), industry chair (2024)