Win Pa pa
She got B.Sc.(Maths) degree from Mandalay University, M.I.Sc (Master of Information Science) and PhD(Information Technology) from University of Computer Studies, Yangon, Myanmar in 2001, 2004 and 2009. She is working as an Professor and have been doing research at Natural Language Processing lab of UCSY since August, 2009. Her Ph.D thesis was on "Myanmar Word Segmentation" which is essential for Myanmar NLP and she is still doing research on Word Segmentation for accuracy. Her other research interests are Machine Translation and Speech synthesis. She has been supervising Master and Ph.D thesis on Natural language processing such as Information Retrieval, Morphological Analysis, Part of Speech Tagging, Parsing, and Automatic Speech Recognition. She took part in the project of ASEAN MT, the machine translation project for South East Asian languages. She also participated in the projects of Myanmar Automatic Speech Recognition and HMM Based Myanmar Speech Synthesis (Text to Speech) that were the research collaboration between NICT, Japan and UCSY. She went to Universal Communication Research Institute, NICT, Kyoto, Japan as a visiting Researcher from June 2014 to March 2015.
Contact Information:
Dr. Win Pa Pa, Professor, Natural Language Processing lab, UCSY, Myanmar
Contact Email:
winpapa at ucsy -dot- edu -dot- mm
Participated in Projects
ASEAN MT, ASEAN languages Machine Translation Project, Organized by NECTEC Thailand, 2012-2015
VoiceTra, Multilingual Speech to Speech Translation, Organized by NICT Japan, 2014-2015
ALT, Asian Language Tree Bank Building, Organized by NICT Japan, 2015-2016
uniTRANS, ASEAN languages Speech to Speech Translation Project, Organized by I2R Singapore, 2016-2018
Demo
Myanmar Speech Synthesis (Text-to-Speech)
"Introduction to Multilingual Speech to Speech Translation, VoiceTra", 24th November 2015, UCSY, Myanmar
"ASEAN Machine Translation", Myanmar Software Showcase, 2-3 April 2015, Nay Pyi Taw, Myanmar
"Myanmar Printed Character Recognition", Opening Ceremony of Myanmar Institute of Information Technology, November 2015, Mandalay, Myanmar,
Myanmar Word Segmentation
Her Ph.D Thesis, "Myanmar Word Segmentation" is applying in Myanmar NLP applications by UCSY and it can be used online. It is N-gram based Word Segmentation and used her own corpus (50,000 words) collected manually from Myanmar Text Books, Newspapers, and Journals. The corpus is in both Zawgyi and Unicode encoding.
Activities
Editor of PACLING2017, Springer CCIS Volume 781
Organizer
Reviewer
Students
San Pa Pa Aung (Ph.D) (2019-2022)
Aye Mya Hlaing (Ph.D) (2016-2020)
Hnin Thuzar Aye (Ph.D) (2015-2020)
Aye Nyein Mon (Ph.D) (2015-2019)
Win Lai Lai Phyu (Ph.D) (2017-2024)
Yamin Thu (Ph.D) (2017-)
Hay Man Oo (Ph.D) (2017-)
Lwin Lwin Mar(Ph.D) (2017-)
Nan Kham Htwe (Ph.D) (2020-2024)
Myat Aye Aye Aung (MCSc.) (2016-2018)
Hay Mar Su Aung (MCSc.) (2019-2022)
Thazin Win(M.C.Sc.)(2019-2022)
Publications
2024
Nang Kham Htwe, Win Pa Pa, "Multimodel Generative Model based Text-to-Image Synthesis", International Journal of Intelligent Engineering and Systems, Vol.17, No.2, 2024
2023
Hay Mar Soe Naing, Win Pa Pa, "A Large Vocabulary End-to-End Myanmar Automatic Speech Recognition", M3Oriental Workshop of ACM Multimedia Asia 2023, December 2023, Taiwan
Win Lai Lai Phyu, Hay Mar Soe Naing, Win Pa Pa, "Improving the Performance of Low-resourced Speaker Identification with Data Preprocessing", Journal of ICT Research and Applications, 17(3), 275-291
Myat Aye Aye Aung, Win Pa Pa , "M-Diarization: A Myanmar Speaker Diarization using Multi-scale dynamic weights", In Proceedings of The 21st Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (Oriental COCOSDA 2023), December 2023, Delhi, India
Aye Nyein Mon, Hnin Thida Kyaw, Win Pa Pa , "Improving a Rakhine ASR with Subspace Gaussian Mixture model (SGMM)", Proceedings of ICCR2023, October 2023, Daejeon, Korea
Nwe Nwe Win, Win Pa Pa, "Dependency Annotated Dataset of The Myanmar Language", Proceedings of ICCR2023, October 2023, Daejeon, Korea
Myat Aye Aye Aung, Win Pa Pa, Hay Mar Soe Naing, "Speaker Diarization for Multiple Speaker datasets using a neural diarizer", Proceedings of ICCR2023, October 2023, Daejeon, Korea
Lwin Lwin Mar, Win Pa Pa, Tin Lay Nwe, "Study for Burmese Speech Emotion Recognition", In Proceedings of ICCA2023, February 2023, Myanmar
San Pa Pa Aung, Win Pa Pa, "EfficientNetB7 and Bi-LSTM with GloVe Vector Based Myanmar Image Captioning", International Journal of Intelligent Engineering and Systems, Vol.16, No.1, 2023
2022
Chenchen Ding, Win Pa Pa, Masao Utiyama, Eiichiro Sumita, "Transliteration of Foreign Words in Burmese: Descriptions by a Mortise-and-Tenon Notation". O-COCOSDA 2022
Aye Mya Hlaing, Win Pa Pa, "MyanBERTa: A Pre-trained Language Model For Myanmar", In Proceedings of 2022 International Conference on Communication and Computer Research (ICCR2022), November 2022, Seoul, Republic of Korea
San Pa Pa Aung, Win Pa Pa, "Comparison of NASNetLarge and NASNetMobile as Encoder for Myanmar Image Caption Generation", In Proceedings of 2022 International Conference on Communication and Computer Research (ICCR2022), November 2022, Seoul, Republic of Korea
Nang Kham Htwe, Win Pa Pa, "Generative Adversarial Networks for Myanmar Text to Image Synthesis", In Proceedings of 2022 International Conference on Communication and Computer Research (ICCR2022), November 2022, Seoul, Republic of Korea
Hay Mar Soe Naing, Win Pa Pa, "Improving Myanmar Automatic Speech Recognition with End-to-End Technique", In Proceedings of 2022 International Conference on Communication and Computer Research (ICCR2022), November 2022, Seoul, Republic of Korea
Eaint Thet Hmu Soe, Win Pa Pa , "Utilizing RoBERTa Intermediate Layers and Fine-Tuning for Sentence Classification", In Proceedings of 2022 International Conference on Communication and Computer Research (ICCR2022), November 2022, Seoul, Republic of Korea
2021
Toshiaki Nakazawa, Hideki Nakayama, Chenchen Ding, Raj Dabre, Shohei Higashiyama, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Chenhui Chu, Akiko Eriguchi, Kaori Abe, Yusuke Oda, Sadao Kurohash, "Overview of the 8th workshop on Asian translation", Proceedings of the 8th Workshop on Asian Translation (WAT2021)
Saw Win, Win Pa Pa, "MyanmarBERT: Myanmar Pre-trained Language Model using BERT", In Proceedings of ICCA2021, pp. 402-407, February 2021, Myanmar
San Pa Pa Aung, Win Pa Pa, "Improving Myanmar Image Caption Generation Using NASNetLarge and Bi-directional LSTM", In Proceedings of ICCA2021, February 2021, Myanmar, 164-169
Nan Kham Htwe, Win Pa Pa, "Building Annotated Image Dataset for Myanmar Text to Image Synthesis", In Proceedings of ICCA2021, February 2021, Myanmar, 170-175
Lwin Lwin Mar, Win Pa Pa, Tin Lay Nwe, "BMISEC: Corpus of Burmuse Emotion Speech", In Proceedings of ICCA2021, February 2021, Myanmar, 302-307
2020
San Pa Pa Aung, Win Pa Pa, Tin Lay Nwe, "Automatic Myanmar Image Captioning using CNN and LSTM-Based Language Model", 1st Joint SLTU and CCURL Workshop (SLTU-CCURL 2020), Merseille, pp 139–143, France, 2020 (The proceedings is available here)
Aye Mya Hlaing, Win Pa Pa, "Word Representations for Neural Network Based Myanmar Text-to-Speech System", International Journal of Intelligent Engineering and Systems (INASS), Vol.13, No.2, pp 239-249, April 2020
Chenchen Ding, Sann Su Su Yee, Win Pa Pa, Khin Mar Soe, Masao Utiyama, and Eiichiro Sumita. "A Burmese (Myanmar) Treebank: Guideline and Analysis", ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Vol. 19 Issue 3, Article No. 40, January 2020
Win Lai Lai Phyu, Win Pa Pa, "Building Speaker Identification Dataset for Noisy Conditions", The Proceedings of 18th IEEE International Conference on Computer Application (ICCA2020), 27-28 February 2020, Yangon, Myanmar
Yamin Thu, Win Pa Pa, "Generating Myanmar News Headlines using Recursive Neural Network", The Proceedings of 18th IEEE International Conference on Computer Application(ICCA2020), 27-28 February 2020, Yangon, Myanmar
Myat Aye Aye Aung, Win Pa Pa, "Time Delay Neural Network for Myanmar Automatic Speech Recognition", The Proceedings of 18th IEEE International Conference on Computer Application(ICCA2020), 27-28 February 2020, Yangon, Myanmar
Hay Mar Su Aung, Win Pa Pa, "Analysis of Word Vector Representation Techniques with Machine-Learning Classifiers for Sentiment Analysis of Public Facebook Page’s Comments in Myanmar Text", The Proceedings of 18th IEEE International Conference on Computer Application(ICCA2020), 27-28 February 2020, Yangon, Myanmar
Hay Man Oo, Win Pa Pa, "Myanmar News Retrieval in Vector Space Model using Cosine Similarity Measure", The Proceedings of 18th IEEE International Conference on Computer Application(ICCA2020), 27-28 February 2020, Yangon, Myanmar
2019
Toshiaki Nakazawa, Nobushige Doi, Shohei Higashiyama, Chenchen Ding, Raj Dabre, Hideya Mino, Isao Goto, Win Pa Pa, Anoop Kunchukuttan, Shantipriya Parida, Ondřej Bojar, Sadao Kurohashi, "Overview of the 6th workshop on Asian translation", Proceedings of the 6th Workshop on Asian Translation, November 2019, Hongkong
Yimon Shwe Sin, Win Pa Pa, Khin Mar Soe, "UCSYNLP-Lab Machine Translation Systems for WAT 2019", Proceedings of the 6th Workshop on Asian Translation, November 2019, Hongkong
Aye Mya Hlaing, Win Pa Pa, "Sequence-to-Sequence Models for Grapheme-to-Phoneme Conversion on Large Myanmar Pronunciation Dictionary", In Proceedings of The 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (Oriental COCOSDA 2019), October 2019, Cebu, Philippines
Aye Mya Hlaing, Win Pa Pa, Ye Kyaw Thu, "Enhancing Myanmar Speech Synthesis with Linguistic Information and LSTM-RNN", In Proceedings of 10th ISCA Speech Synthesis Workshop (SSW10), September 2019, Vienna, Austria
Chenchen Ding, Hnin Thu Zar Aye, Win Pa Pa, Khin Thandar Nwet, Khin Mar Soe, Masao Utiyama, Eiichiro Sumita, "Towards Burmese(Myanmar) Morphological Analysis: Syllable-based Tokenization and Part-of-epeech Tagging", ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), Volume 19 Issue 1, July 2019
Lwin Lwin Mar, Win Pa Pa, Tin Lay Nwe, "Dataset for Depresession Detection from Speech Emotion Recogniton", In Proceedings of The 11th International Conference on Future Computer and Communication (ICFCC 2019), 27-28 February 2019, Yangon, Myanmar
Win Lai Lai Phyu, Win Pa Pa "Text Independent Speaker Recognition for Myanmar Speech", The Proceedings of 17th International Conference on Computer Application(ICCA2019), 27-28 February 2019, Yangon, Myanmar
Aye Nyein Mon, Win Pa Pa, Ye Kyaw Thu, "Improving Myanmar Automatic Speech Recognition with Optimization of Convolutional Neural Network Parameters", International Journal on Natural Language Computing (IJNLC), Vol 7, No 6, December, 2018
2018
Aye Mya Hlaing, Win Pa Pa, Ye Kyaw Thu, "DNN-based Myanmar Text-to-Speech", In Proceedings of The 6th international workshop on spoken language technologies for under-resourced languages(SLTU'18), 29-31 August 2018, Gurugram, India
Hnin Thu Zar Aye, Win Pa Pa, and Ye Kyaw Thu, "Unsupervised Dependency Corpus Annotation for Myanmar Language", In Proceedings of The 21st Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (Oriental COCOSDA 2018), May 2018, Miyazaki, Japan
Hay Mar Soe Naing, Win Pa Pa, "Automatic Speech Recognition on Spontaneous Interview Speech", The Proceedings of 16th International Conference on Computer Application(ICCA2018), pages 203-208. Yangon, Myanmar, 22-23 February 2018
Aye Mya Hlaing, Win Pa Pa, and Ye Kyaw Thu, "Word-based Myanmar Text-to-Speech", The Proceedings of 16th International Conference on Computer Application(ICCA2018), pages 185-189. Yangon, Myanmar, 22-23 February 2018
2017
Aye Nyein Mon, Win Pa Pa, Ye Kyaw Thu and Yoshinori Sagisaka, "Developing a Speech Corpus from Web News for Myanmar (Burmese) Language", In Proceedings of The 20th Conference of the Oriental Chapter of the International Coordinating Committee on Speech Databases and Speech I/O Systems and Assessment (Oriental COCOSDA 2017), November 1-3, 2017, Seoul National University Hoam Faculty House, Convention center, Seoul, R.O. Korea, pp. 75-80
Aye Nyein Mon, Win Pa Pa, and Ye Kyaw Thu,"Exploring the Effect of Tones for Myanmar Language Speech Recognition Using Convolutional Neural Network (CNN)", In Proceedings of 15th International Conference of the Pacific Association for Computational Linguistics (PACLING), 2017, Yangon, Myanmar, pp. 334-345
Haymar Soe Naing and Win Pa Pa, "Speaker Adaptation on Myanmar Spontaneous Speech Recognition", In Proceedings of 15th International Conference of the Pacific Association for Computational Linguistics (PACLING), 2017, Yangon, Myanmar.
Aye Mya Hlaing, Win Pa Pa, and Ye Kyaw Thu, "Myanmar Number Normalization for Text-to-Speech", In Proceedings of 15th International Conference of the Pacific Association for Computational Linguistics (PACLING), 2017, Yangon, Myanmar, pp. 346-356.
Chenchen Ding, Win Pa Pa, Masao Utiyama, Eiichiro Sumita, "Burmese (Myanmar) Name Romanization: A Sub-syllabic Segmentation Scheme for Statistical Solutions", In Proceedings of 15th International Conference of the Pacific Association for Computational Linguistics (PACLING), 2017, Yangon, Myanmar
Khin War War Htike, Ye Kyaw Thu, Zuping Zhang, Win Pa Pa, Yoshinori Sagisaka and Naoto Iwahashi, "Comparison of Six POS Tagging Methods on 10K Sentences Myanmar Language (Burmese) POS Tagged Corpus", at 18th International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2017), April 17~23, 2017, Budapest, Hungary. [Poster] [myPOS Corpus]
Hnin Thu Zar Aye, Chenchen Ding, Win Pa Pa, Khin Thandar Nwet, Masao Utiyama, Eiichiro Sumita, "English-to-Myanmar Statistical Machine Translation using Lanuguage Model on Part-of-Speech in Decoding", The Proceedings of 15th International Conference on Computer Application (ICCA2017), pages 409-414. Yangon, Myanmar, 16-17 February
Aye Nyein Mon, Win Pa Pa, Ye Kyaw Thu, "Building HMM-SGMM Continuous Automatic Speech Recognition on Myanmar Web News". The Proceedings of 15th International Conference on Computer Application(ICCA2017), pages 446-453. Yangon, Myanmar, 16-17 February
2016
Ye Kyaw Thu, Win Pa Pa, Yoshinori Sagisaka, Naoto Iwahashi, "Comparison of Grapheme-to-Phoneme conversion Methods on a Myanmar Pronunciation Dictionary", The Proceedings of the 6th Workshop on South and Southeast Asian Natural Language Processing (WSSANLP2016), pages 11–22, Osaka, Japan, December 11-17 2016
Haymar Hnin, Win Pa Pa, Ye Kyaw Thu, "Back-Propagation Neural Network to Myanmar Part-of-Speech Tagging", "The 10th International Conference on Genetic and Evolutionary Computing (ICGEC 2016)", 7-9 November 2016, China
Ye Kyaw Thu, Win Pa Pa, Masao Utiyama, Andrew Finch, and Eiichiro Sumita, "Introducing the Asian Language Treebank (ALT)", The 10th edition of the Language Resources and Evaluation Conference, 23-28 May 2016, Portorož (Slovenia)
Win Pa Pa, Ye Kyaw Thu, Andrew Finch, and Eiichiro Sumita, "A Study of Statistical Machine Translation Methods for Under Resourced Languages", 5th Workshop on Spoken Language Technologies for Under-resourced Languages (SLTU Workshop), 09-12 May, 2016, Yogyakarta, Indonesia
Ye Kyaw Thu, Andrew Finch, Win Pa Pa, Khin War War Htike and Eiichiro Sumita, "String to Tree and Tree to String Statistical Machine Translation for Myanmar Language", In Proceedings of ICCA2016, February 25-26, 2016, Yangon, Myanmar, pp.249-257
Ye Kyaw Thu, Andrew Finch, Win Pa Pa, and Eiichiro Sumita, "A Large-scale Study of Statistical Machine Translation Methods for Myanmar Language", In Proceedings of SNLP2016, February 10-12, 2016, Phranakhon Si Ayutthaya, Thailand. [Paper]
2015
Hay Mar Soe Naing, Aye Mya Hlaing, Win Pa Pa, Xinhui Hu, Ye Kyaw Thu, Chiori Hori and Hisashi Kawai, "A Myanmar Large Vocabulary Continuous Speech Recognition System", In Proceedings of APSIPA Annual Summit and Conference (APSIPA ASC 2015), December 16–19, 2015, Hong Kong, pp. 320-327
Ye Kyaw Thu, Win Pa Pa, Jinfu Ni, Yoshinori Shiga, Andrew Finch, Chiori Hori, Hisashi Kawai, Eiichiro Sumita, "HMM Based Myanmar Text to Speech System", In Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015), September 6-10, 2015, Dresden, Germany, pp. 2237-2241
Win Pa Pa, Ye Kyaw Thu, Andrew Finch, Eiichiro Sumita, "Word Boundary Identification for Myanmar Text Using Conditional Random Fields", In Proceedings of the Ninth International Conference on Genetic and Evolutionary Computing (ICGEC 2015), August 26-28, 2015, Yangon, Myanmar, pp. 447-456.
Ye Kyaw Thu, Win Pa Pa, Andrew Finch, Jinfu Ni, Eiichiro Sumita and Chiori Hori, 2015, "The Application of Phrase Based Statistical Machine Translation Techniques to Myanmar Grapheme to Phoneme Conversion", In Proceedings of the Pacific Association for Computational Linguistics Conference (PACLING 2015), May 19~21, 2015, Legian, Bali, Indonesia, pp. 170-176. Springer Communication in Computer and Information Science (CCIS), ISSN:1865-0929, pp. 238-250
Hay Mar Soe Naing, Ye Kyaw Thu, Win Pa Pa, Hiroaki Kato, Andrew Finch, Eiichiro Sumita, Chiori Hori. "Rule Based Katakana to Myanmar Transliteration for Post-editing Machine Translation", 言語処理学会第21回年次大会 発表論文集, March 16~21, 2015, Kyoto, Japan, pp. 257-260.
Ye Kyaw Thu, Win Pa Pa, Andrew Finch, Aye Mya Hlaing, Hay Mar Soe Naing, Eiichiro Sumita and Chiori Hori, "Syllable Pronunciation Features for Myanmar Grapheme to Phoneme Conversion", In Proceedings of the 13th International Conference on Computer Applications (ICCA 2015), February 5~6, 2015, Yangon, Myanmar, pp. 161-167. [Best Paper Award]
2009
Win Pa Pa, Ni Lar Thein, "Disambiguation in Myanmar Word Segmentation" , Proceedings of 7th International Conference on Computer Applications, 2009, Yangon
Win Pa Pa, Ni Lar Thein, "Myanmar Word Segmentation using a Combined Model " , e-Case 2009, January, 2009
2008
Win Pa Pa, Ni Lar Thein, "Myanmar Word Segmentation using Hybrid Approach" , Proceedings of 6th International Conference on Computer Applications (ICCA), 2008, Yangon, pp-166-170
Workshops
2016
"Comparison of different approaches for Myanmar Grapheme to Phoneme conversion", The Sixth Workshop on Natural Language Processing, ICCA2016, February 25-26, 2016, Yangon, Myanmar.
2012
"Myanmar Information Retrieval", The Second Joint Workshop of Natural Language Processing and Speech Recognition between UCSY, Myanmar and Waseda University, Japan, ICCA2012, Yangon
2011
"Overview of Myanmar Language Analysis", Joint Workshop of Natural Language Processing and Speech Recognition between UCSY, Myanmar and Waseda University, Japan, May 2011, ICCA2011, Yangon, Myanmar
2010
"Natural Language Processing from the Perspective of Myanmar Language", 2010, at NLP Workshop, MCF, Yangon
Local Conferences
2015
Win Pa Pa, “Developing Myanmar Automatic Speech Recognition and Text to Speech Systems”, MCPA Developer Conference 2015, 21-22 November 2015.
2013
Thet Thet Tun, Win Pa Pa, " Myanmar Morphological Analyzer " , Proceedings of The seventh Conference on Parallel and Soft Computing, December 2013, Yangon, UCSY
2012
Thuzar Tun, Win Pa Pa, " Myanmar Information Retrieval with Genetic Algorithms " , Proceedings of The sixth Conference on Parallel and Soft Computing, December 2012, Yangon, UCSY
2011
Shally Soe Than, Win Pa Pa, " Automatic Mining of Verb Affixes for Myanmar Language with unsupervised learning by using N-Grams " , Proceedings of The fifth Conference on Parallel and Soft Computing, December 2011, Yangon, UCSY, p-283
Win Pa Pa, “Myanmar Word Segmentation”, MCPA Developer Conference, 20th August, 2011.