PhD Dissertation
Alexandros Lazaridis, "Prosody modelling using machine learning techniques for neutral and emotional speech synthesis", 2011.
Publications in Journals
Theodoros Theodorou, Iosif Mporas, Alexandros Lazaridis, and Nikos Fakotakis. Data-driven audio feature space clustering for automatic sound recognition in radio broadcast news. International Journal on Artificial Intelligence Tools, Vol. 26, No. 2 (2017). [ bib | pdf]
Milos Cernak, Štefan Beňuš, and Alexandros Lazaridis. Speech vocoding for laboratory phonology. Computer Speech and Language, 42: 100-121 (2017). [ bib | .pdf ]
Milos Cernak, Alexandros Lazaridis, Afsaneh Asaei and Philip N. Garner, Composition of Deep and Spiking Neural Networks for Very Low Bit Rate Speech Coding. in: IEEE/ACM Trans. on Audio, Speech and Language Processing, 2016 (pdf).
Milos Cernak, Philip N. Garner, Alexandros Lazaridis, Petr Motlicek and Xingyu Na, Incremental Syllable-Context Phonetic Vocoding. In: IEEE/ACM Transactions on Audio Speech and Language Processing, 23(6), June 2015
Alexandros Lazaridis and Iosif Mporas, "Evaluation of Hidden semi Markov Models Training Methods for Greek Emotional Text-to-Speech Synthesis", International Journal of Information Technology and Computer Science (IJITCS), vol. 5, no. 4, March 2013, pp. 23-29.
Alexandros Lazaridis, Iosif Mporas and Todor Ganchev, "Phone Duration Modeling of Affective Speech using Support Vector Regression", International Journal of Intelligent Systems and Applications (IJISA), vol. 4, no. 8, July 2012, pp.1-9.
Alexandros Lazaridis, Todor Ganchev, Iosif Mporas, Evaggelos Dermatas, Nikos Fakotakis, "Two-Stage Phone Duration Modelling with Feature Construction and Feature Vector Extension for the Needs of Speech Synthesis", Computer Speech & Language, Vol 26:4, August 2012, pp. 274-292. (pdf)
Alexandros Lazaridis, Iosif Mporas, Todor Ganchev, George Kokkinakis, Nikos Fakotakis, "Improving Phone Duration Modelling using Support Vector Regression Fusion", Speech Communication 53(1), 85-97, 2011. (pdf)
Alexandros Lazaridis, Todor Ganchev, Theodoros Kostoulas, Iosif Mporas, Nikos Fakotakis, "Phone Duration Modeling: Overview of Techniques and Performance Optimization via Feature Selection in the context of Emotional Speech", International Journal of Speech Technology, vol. 13, no3, pp. 175-188, 2010. (pdf)
Alexandros Lazaridis, Basiliki Bourna and Nikos Fakotakis, "Comparative Evaluation of Phone Duration Models for Greek Emotional Speech", Journal of Computer Science 6 (3): 341-349, 2010. (pdf)
Publications in Book Chapters
Malo Grisard, Qingran Zhan, Petr Motlicek, Wissem Allouchi, Michael Baeriswyl, and Alexandros Lazaridis. Spoken language identification using language bottleneck features. In Text, speech and dialogue, TSD 2019, Lecture Notes in Computer Science. Springer Berlin Heidelberg, 2019.
Alexandros Lazaridis, Blaise Potard and Philip N. Garner, DNN-based Speech Synthesis: Importance of input features and training data, in: International Conference on Speech and Computer , SPECOM, pages 193-200, Springer Berlin Heidelberg, 2015 (pdf)
Theodoros Kostoulas, Todor Ganchev, Alexandros Lazaridis, Nikos Fakotakis, "Enhancing Emotion Recognition from Speech through Feature Selection", TSD 2010, Lecture Notes in Computer Science, Springer Berlin / Heidelberg, pp. 338-344, 2010. (pdf)
Alexandros Lazaridis, Todor Ganchev, Iosif Mporas, Theodoros Kostoulas and Nikos Fakotakis, "Feature Selection for Improved Phone Duration Modeling of Greek Emotional Speech", SETN 2010, Advances in Artificial Intelligence, Lecture Notes in Computer Science, Springer Berlin / Heidelberg, pp. 357-362. (pdf)
Todor Ganchev, Alexandros Lazaridis, Iosif Mporas, Nikos Fakotakis, "Performance Evaluation for Voice Conversion Systems", Text, Speech and Dialogue 2008, Lecture Notes in Computer Science, Springer Berlin/ Heidelberg, pp. 317-324. (pdf)
Theodoros Kostoulas, Iosif Mporas, Todor Ganchev, Nikos Katsaounos, Alexandros Lazaridis, Stavros Ntalampiras, Nikos Fakotakis, "LOGOS: A Multimodal Dialogue System for Controlling Smart Appliances", KES IIMSS 2008, New Directions in Intelligent Interactive Multimedia, Studies in Computational Intelligence (142), pp. 585-594. (pdf)
Publications in Conferences
An Evaluation Benchmark for Automatic Speech Recognition of German-English Code-Switching, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: IEEE Automatic Speech Recognition and Understanding Workshop, 2021
Learning to Translate Low-Resourced Swiss German Dialectal Speech into Standard German Text, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: IEEE Automatic Speech Recognition and Understanding Workshop, Colombia, Cartagena, IEEE, 2021
Modeling Dialectal Variation for Swiss German Automatic Speech Recognition, Abbas Khosravani, Philip N. Garner and Alexandros Lazaridis, in: Proceedings of Interspeech, 2021
Comparison of Subword Segmentation Methods for Open-vocabulary ASR using a Difficulty Metric, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis attachment
COMPARISON OF SUBWORD SEGMENTATION METHODS FOR OPEN-VOCABULARYEND-TO-END SPEECH RECOGNITION, Abbas Khosravani, Claudiu Musat, Philip N. Garner and Alexandros Lazaridis, Idiap-RR-34-2020
Lorenzo Tarantino, Philip N. Garner, and Alexandros Lazaridis. Self-attention for speech emotion recognition. In Proceedings of Interspeech, 2019
Gaetan Ramet, Philip N. Garner, Michael Baeriswyl, and Alexandros Lazaridis. Context-aware attention mechanism for speech emotion recognition. In 2018 IEEE Spoken Language Technology Workshop, SLT 2018, Athens, Greece, December 18-21, 2018, pages 126–131, 2018.
Alexandros Lazaridis, Ivan Himawan, Petr Motlicek, Iosif Mporas, and Philip N. Garner. Investigating cross-lingual multi-level adaptive networks: The importance of the correlation of source and target languages. In Proceedings of International Workshop on Spoken Language Translation, 2016. [ bib ]
Alexandros Lazaridis, Milos Cernak, Pierre-Edouard Honnet, Philip N. Garner, Investigating Spectral Amplitude Modulation Phase Hierarchy Features in Speech Synthesis, in: 9th ISCA Speech Synthesis Workshop (SSW9), 2016 (pdf).
Jean-Philippe Goldman, Pierre-Edouard Honnet, Rob Clark, Philip N. Garner, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Tiago Macedo, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli and Junichi Yamagishi, The SIWIS database: a multilingual speech database with acted emphasis, in: Proceedings of Interspeech, San Francisco, USA, 2016 (pdf).
Alexandros Lazaridis, Milos Cernak and Philip N. Garner, Probabilistic Amplitude Demodulation Features in Speech Synthesis for Improving Prosody, in: Proceedings of Interspeech, San Francisco, USA, 2016.(pdf)
Serife Kucur Ergunay, Elie Khoury, Alexandros Lazaridis and Sébastien Marcel. On the vulnerability of speaker verification to realistic voice spoofing. In Proceedings of IEEE International Conference on Biometrics: Theory, Applications and Systems, September 2015. (pdf)
Milos Cernak, Alexandros Lazaridis, Philip N. Garner, and Petr Motlicek. Stress and accent transmission in HMM-based syllable-context very low bit rate speech coding. In Proceedings of Interspeech, Singapore, September 2014. (pdf)
Alexandros Lazaridis, Elie Khoury, Jean-Philippe Goldman, Mathieu Avanzi, Sébastien Marcel, and Philip N. Garner. Swiss French regional accent identification. In Proceedings of Odyssey 2014: The Speaker and Language Recognition Workshop, Joensuu, Finland, June 2014. (pdf)
Alexandros Lazaridis, Pierre-Edouard Honnet, and Philip N. Garner. SVR vs MLP for phone duration modelling in HMM-based speech synthesis. In Proceedings of the 7th Speech Prosody Conference, Dublin, Ireland, May 2014. (pdf)
Pierre-Edouard Honnet, Alexandros Lazaridis, Jean-Philippe Goldman, and Philip N. Garner. Prosody in Swiss French accents: Investigation using analysis by synthesis. In Proceedings of the 7th Speech Prosody Conference, Dublin, Ireland, May 2014. (pdf)
Alexandros Lazaridis and Philip N. Garner. Syllable-based regional Swiss French accent identification using prosodic features. Nouveaux cahiers de linguistique française, 31, 2014. 3rd Swiss Workshop on Prosody, Geneva, September 2014. (pdf)
Philip N. Garner, Rob Clark, Jean-Philippe Goldman, Pierre-Edouard Honnet, Maria Ivanova, Alexandros Lazaridis, Hui Liang, Beat Pfister, Manuel Sam Ribeiro, Eric Wehrli, and Junichi Yamagishi. Translation and prosody in Swiss languages. Nouveaux cahiers de linguistique française, 31, 2014. 3rd Swiss Workshop on Prosody, Geneva, September 2014. (pdf)
Alexandros Lazaridis, Iosif Mporas, Todor Ganchev, Nikos Fakotakis, "Support Vector Regression Fusion Scheme in Phone Duration Modeling", Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on , vol., no., pp.4732-4735, 22-27 May 2011. (pdf)
Alexandros Lazaridis, Theodoros Kostoulas, Todor Ganchev, Iosif Mporas, Nikos Fakotakis, "VERGINA: A modern Greek speech database for speech synthesis", LREC-2010, Malta, May 19-21, 2010, pp. 117-121. (pdf)
Dimitrios P. Lyras, George Kokkinakis, Alexandros Lazaridis, Kyriakos Sgarbas, Nikos Fakotakis "A Large Greek-English Dictionary with Incorporated Speech and Language Processing Tools" : Interspeech 2009, 10th Annual Conference of the International Speech Communication Association Brighton, UK, 6-10 September 2009, pp. 1891-1894. (pdf)
Iosif Mporas, Alexandros Lazaridis, Todor Ganchev, Nikos Fakotakis, "Using Hybrid HMM-based Speech Segmentation to improve Synthetic Speech Quality", In Proceedings of the 13th Pan-Hellenic Conference on Informatics, PCI 2009, Corfu, Greece, pp. 118-122. (pdf) .
Vasiliki Bourna, Alexandros Lazaridis, Nikos Fakotakis, "Phone Duration Modeling for Greek Emotional Speech Synthesis", In Proceedings of the 13th International Conference "SPEECH and COMPUTER", SPECOM 2009, Saint-Petersburg, Russia, 21-25 June, pp. 190-195, 2009. (pdf)
Todor Ganchev, Alexandros Lazaridis, Iosif Mporas, Nikos Fakotakis, "Average Performance Loss Measure for Assessment of Voice Conversion Systems", In Proceedings of the 13th International Conference "SPEECH and COMPUTER", SPECOM 2009, Saint-Petersburg, Russia, 21-25 June, pp. 275-280, 2009. (pdf)
Alexandros Lazaridis, Theodoros Kostoulas, Iosif Mporas, Todor Ganchev, Nikos Katsaounos, Stavros Ntalampiras, Nikos Fakotakis, "Human Evaluation of the LOGOS Spoken Dialogue System", In Proceedings of the 1st International International Conference on Pervasive Technologies Related to Assistive Environments, PETRA 2008, Athens, Greece, 15-19 July, 2008. (pdf)
Todor Ganchev, Otilia Kocsis, Nikos Katsaounos, Iosif Mporas, Alexandros Lazaridis, Theodoros Kostoulas, Stavros Ntalampiras, Dimitrios Lyras, George Papadopoulos, Kyriakos Sgarbas, Nikos Fakotakis, "Natural Spoken Dialogue Interaction: Technology, Tools, Resources and Applications", System Demonstrations, 18th European Conference on Artificial Intelligence, ECAI 2008, July 2008, pp. 33-34. (pdf)
Alexandros Lazaridis, Panagiotis Zervas, Nikos Fakotakis, George Kokkinakis, "A CART approach for Duration Modeling of Greek Phonemes", In Proceedings of the 12th International Conference "Speech and Computer", Moscow, Russia, October 2007, pp. 287-292. (pdf)
Alexandros Lazaridis, Panagiotis Zervas, George Kokkinakis, "Segmental duration modeling for Greek Speech Synthesis", In Proceedings of the 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI), Patras, Greece, 2007, pp. 518-521. (pdf)
Technical Reports
SIWIS (Swiss NSF), Spoken Interaction With Interpretation in Switzerland. Annual reports, 2013, 2014, 2015, 2016.
Alexandros Lazaridis, Nikos Katsaunos, "MoveOn deliverable D5.10.2: Report on the MoveOn TTS component", October 2007.
Thomas Winkler, Todor Ganchev, Theodoros Kostoulas, Iosif Mporas, Alexandros Lazaridis, Stavros Ntalampiras, Atta Badii, Rick Adderley, Christian Bonkowski, "MoveOn Deliverable D.5: Report on Audio databases, Noise processing environment, ASR and TTS modules", December 2007.