Scientific publications

Information about who has cited these papers can be found at my Google Scholar page.

Theses

  • Vesa Siivola: "Language models for automatic speech recognition: construction and complexity control", Ph.D. thesis, Helsinki University of Technology, 2007. (link)

Journal articles

  • Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arisoy, Murat Saraclar and Andreas Stolcke, "Morph-Based Speech Recognition and Modeling of Out-of-Vocabulary Words Across Languages", ACM Transactions on Speech and Language Processing, 5(1):Article No. 3, 2007 (link to publisher's site)
  • Vesa Siivola and Teemu Hirsimäki and Sami Virpioja, "On Growing and Pruning Kneser-Ney Smoothed N-Gram Models"IEEE Transactions on Speech, Audio and Language Processing, 15(5):1617-1624, 2007. ©2007 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. (pdf)
  • Teemu Hirsimäki, Mathias Creutz, Vesa Siivola, Mikko Kurimo, Sami Virpioja and Janne Pylkkönen: "Unlimited vocabulary speech recognition with morph language models applied to Finnish"Computer Speech and Language 20(4):515--541, 2006. (pdf)

Published conference papers

  • Vesa Siivola, Bryan Pellom and Meagan Sills: "Language Identification for Text Chats", Proceedings of Interspeech'11, 2011. (pdf)
  • Vesa Siivola, Mathias Creutz and Mikko Kurimo: "Morfessor and VariKN machine learning tools for speech and language technology", Proceedings of the 8th International Conference on Speech Communication and Technology (INTERSPEECH'07), 2007. (pdf)
  • Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo, Antti Puurula, Janne Pylkkönen, Vesa Siivola, Matti Varjokallio, Ebru Arisoy, Murat Saraclar and Andreas Stoclke, "Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages", In Proceedings of Human Language Technologies / The Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL-HLT'07), Rochester, NY, USA, 23-25 April, pages 380-387, 2007. (pdf)
  • Mikko Kurimo, Antti Puurula, Ebru Arisoy, Vesa Siivola, Teemu Hirsimäki, Janne Pylkkönen, Tanel Alumae and Murat Saraclar. "Unlimited vocabulary speech recognition for agglutinative languages", In Human Language Technology, Conference of the North American Chapter of the Association for Computational Linguistics (HLT-NAACL'06), 2006. (pdf)
  • Vesa Siivola and Bryan Pellom: "Growing an n-gram model", Proceedings of the 9th European Conference on Speech Communication and Technology (INTERSPEECH'05), pp. 1309-1312, 2005. (pdf)
  • Teemu Hirsimäki, Mathias Creutz, Vesa Siivola and Mikko Kurimo: "Morphologically Motivated Language Models in Speech Recognition", International and Interdisciplinary conference on adaptive knowledge representation and reasoning, 2005. (pdf)
  • Vesa Siivola: "Building compact language models incrementally", Second Baltic Conference on Human Language Technologies, pp. 183-188, 2005. (pdf)
  • Vesa Siivola and Antti Honkela: "A State-Space Method for Language Modeling", IEEE Workshop on Automatic Speech Recognition and Understanding, pages 548-553, 2003. ©2003 IEEE. Personal use of this material is permitted. However, permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the IEEE. (pdf)
  • Vesa Siivola, Teemu Hirsimäki, Mathias Creutz and Mikko Kurimo: "Unlimited Vocabulary Speech Recognition Based on Morphs Discovered in an Unsupervised Manner", Proceedings of the 8th European Conference on Speech Communication and Technology (EUROSPEECH'03), pp. 2293-2296, 2003. (pdf)
  • Vesa Siivola, Teemu Hirsimäki and Mikko Kurimo: "Äännemallien vertailua jatkuvassa suuren sanaston puheentunnistuksessa", Fonetiikan päivät, pp. 75-82, 2002. (pdf)
  • Vesa Siivola, Mikko Kurimo and Krista Lagus: "Large Vocabulary Statistical Language Modeling for Continuous Speech Recognition in Finnish", Proceedings of the 7th European Conference on Speech Communication and Technology (EUROSPEECH'01), pp. 737-740, 2001. (pdf)

Technical reports

  • Vesa Siivola: "Language modeling based on neural clustering of words" (ps), Technical report IDIAP-COM 00-02, IDIAP, Martigny, Switzerland, 2000 (made while visiting IDIAP, January-March 2000)

Some works from the courses I've taken that may be of use

  • Vesa Siivola: "Speech Synthesis by Concatenating Maximally Fitting Phones", project work for the seminar course on Sound synthesis, 2002 (short version ps)
  • Vesa Siivola: "A survey of methods for the synthesis of the singing voice", paper for the seminar course on Sound synthesis, 2002 (ps)
  • Vesa Siivola: "Segmentation of an audio signal into sentences by temporal features", excercise for the seminar course on Audio mining, 2002 (ps)
  • Vesa Siivola: "Puheäänityksen jako foneemeihin käsikirjoituksen perusteella", 2001 (ps)