Michael's Publications

2019

Khoi-Nguyen C. Mac, Xiaodong Cui, Wei Zhang, Michael Picheny Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition, Interspeech 2019 Graz Austria

Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon Challenging the Boundaries of Speech Recognition: The MALACH Corpus Interspeech 2019 Graz Austria

Xiaodong Cui, Michael Picheny Acoustic Model Optimization Based on Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition Interspeech 2019 Graz Austria

Zakaria Aldeneh, Mimansa Jaiswal, Michael Picheny, Melvin G. McInnis, Emily Mower Provost Identifying Mood Episodes Using Dialogue Features from Clinical Interviews Interspeech 2019 Graz Austria

Kartik Audhkhasi, George Saon, Zoltán Tüske, Brian Kingsbury, Michael Picheny Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition Interspeech 2019 Graz Austria

Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David Kung, Michael Picheny A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition, Interspeech 2019 Graz Austria

Samuel Thomas, Kartik Audhkhasi, Zoltán Tüske, Yinghui Huang, Michael Picheny Detection and Recovery of OOVs for Improved English Broadcast News Captioning Interspeech 2019 Graz Austria

S. Settle, K. Audhkhasi, K. Livescu and M. Picheny, "Acoustically Grounded Word Embeddings for Improved Acoustics-to-word Speech Recognition," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 5641-5645.

URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8682903&isnumber=8682151

S. Thomas et al., "English Broadcast News Speech Recognition by Humans and Machines," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 6455-6459.

URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8683211&isnumber=8682151

L. Sarı, S. Thomas, M. Hasegawa-Johnson and M. Picheny, "Pre-training of Speaker Embeddings for Low-latency Speaker Change Detection in Broadcast News," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 6286-6290.

URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8683612&isnumber=8682151

W. Zhang et al., "Distributed Deep Learning Strategies for Automatic Speech Recognition," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 5706-5710.

URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8682888&isnumber=8682151

2018

Xiaodong Cui, Wei Zhang, Zoltan Tuske, and Michael Picheny. Evolutionary Stochastic Gradient Descent for Optimizationof Deep Neural Networks Advances in Neural Information Processing Systems, pp. 6051-6061. 2018.

Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny Building Competitive DirectAcoustics-to-word models for English Conversational Speech Recognition 2018 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP)

2017

English Conversational Telephone Speech Recognition by Humans and Machines

Saon, George and Kurata, Gakuto and Sercu, Tom and Audhkhasi, Kartik and Thomas, Samuel and Dimitriadis, Dimitrios and Cui, Xiaodong and Ramabhadran, Bhuvana and Picheny, Michael and Lim, Lynn-Li and others

arXiv preprint arXiv:1703.02136, 2017

Kernel Approximation Methods for Speech Recognition

May, Avner and Garakani, Alireza Bagheri and Lu, Zhiyun and Guo, Dong and Liu, Kuan and Bellet, Aur{\'e}lien and Fan, Linxi and Collins, Michael and Hsu, Daniel and Kingsbury, Brian and others

arXiv preprint arXiv:1701.03577, 2017

Abstract

2016

A comparison between deep neural nets and kernel acoustic models for speech recognition

Lu, Zhiyun and Quo, Dong and Garakani, Alireza Bagheri and Liu, Kuan and May, Avner and Bellet, Aur{\'e}lien and Fan, Linxi and Collins, Michael and Kingsbury, Brian and Picheny, Michael and others

Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on, pp. 5070--5074

Abstract

Language Modeling/Pronunciation Modeling

Picheny, Michael and Ramabhadran, Bhuvana and Chen, Stanley F and Nussbaum-Thom, Markus

2016 - ee.columbia.edu

Abstract

Advanced Language Modeling

Picheny, Michael and Ramabhadran, Bhuvana and Chen, Stanley F and Nussbaum-Thom, Markus

2016 - ee.columbia.edu

Abstract

Parallel deep neural network training for big data on blue gene/Q

Chung, I-Hsin and Sainath, Tara N and Ramabhadran, Bhuvana and Picheny, Michael and Gunnels, John and Austel, Vernon and Chauhari, Upendra and Kingsbury, Brian

IEEE Transactions on Parallel and Distributed Systems, IEEE, 2016

Abstract

On the importance of event detection for ASR

Haws, David and Dimitriadis, Dimitrios and Saon, George and Thomas, Samuel and Picheny, Michael

Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on, pp. 5705--5709

Abstract

Training variance and performance evaluation of neural networks in speech

Berg, Ewout van den and Ramabhadran, Bhuvana and Picheny, Michael

arXiv preprint arXiv:1606.04521, 2016

Abstract

2015

Order-free spoken term detection

Mangu, Lidia and Saon, George and Picheny, Michael and Kingsbury, Brian

Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on, pp. 5331--5335

Abstract

Multilingual representations for low resource speech recognition and keyword search

Cui, Jia and Kingsbury, Brian and Ramabhadran, Bhuvana and Sethy, Abhinav and Audhkhasi, Kartik and Cui, Xiaodong and Kislal, Ellen and Mangu, Lidia and Nussbaum-Thom, Markus and Picheny, Michael and others

Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on, pp. 259--266

Abstract

The IBM 2015 english conversational telephone speech recognition system

Saon, George and Kuo, Hong-Kwang J and Rennie, Steven and Picheny, Michael

arXiv preprint arXiv:1505.05899, 2015

Abstract

2014

Efficient spoken term detection using confusion networks

Mangu, Lidia and Kingsbury, Brian and Soltau, Hagen and Kuo, Hong-Kwang and Picheny, Michael

Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp. 7844--7848

Abstract

How to scale up kernel methods to be as good as deep neural nets

Lu, Zhiyun and May, Avner and Liu, Kuan and Garakani, Alireza Bagheri and Guo, Dong and Bellet, Aur{\'e}lien and Fan, Linxi and Collins, Michael and Kingsbury, Brian and Picheny, Michael and others

arXiv preprint arXiv:1411.4000, 2014

Abstract

Parallel deep neural network training for LVCSR tasks using blue gene/Q.

Sainath, Tara N and Chung, I-hsin and Ramabhadran, Bhuvana and Picheny, Michael and Gunnels, John A and Kingsbury, Brian and Saon, George and Austel, Vernon and Chaudhari, Upendra V

INTERSPEECH, pp. 1048--1052, 2014

Abstract

Unfolded recurrent neural networks for speech recognition.

Saon, George and Soltau, Hagen and Emami, Ahmad and Picheny, Michael

Interspeech, pp. 343--347, 2014

Abstract

2013

A high-performance Cantonese keyword search system

Kingsbury, Brian and Cui, Jia and Cui, Xiaodong and Gales, Mark JF and Knill, Kate and Mamou, Jonathan and Mangu, Lidia and Nolden, David and Picheny, Michael and Ramabhadran, Bhuvana and others

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 8277--8281

Abstract

System combination and score normalization for spoken term detection

Mamou, Jonathan and Cui, Jia and Cui, Xiaodong and Gales, Mark JF and Kingsbury, Brian and Knill, Kate and Mangu, Lidia and Nolden, David and Picheny, Michael and Ramabhadran, Bhuvana and others

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 8272--8276

Abstract

Developing keyword search under the IARPA Babel program

Mamou, Jonathan and Cui, Jia and Cui, Xiaodong and Gales, Mark JF and Kingsbury, Brian and Knill, Kate and Mangu, Lidia and Nolden, David and Picheny, Michael and Ramabhadran, Bhuvana and others

Proc. Afeka Speech Processing Conference, 2013

Abstract

Speaker adaptation of neural network acoustic models using i-vectors.

Saon, George and Soltau, Hagen and Nahamoo, David and Picheny, Michael

ASRU, pp. 55--59, 2013

Abstract

Developing speech recognition systems for corpus indexing under the IARPA Babel program

Cui, Jia and Cui, Xiaodong and Ramabhadran, Bhuvana and Kim, Janice and Kingsbury, Brian and Mamou, Jonathan and Mangu, Lidia and Picheny, Michael and Sainath, Tara N and Sethy, Abhinav

Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 6753--6757

Abstract

2011

Trends and advances in speech recognition

Michael Picheny, David Nahamoo, Vaibhava Goel, Brian Kingsbury, Bhuvana Ramabhadran, Steven J Rennie, George Saon

IBM Journal of Research and Development 55(5), 2--1, IBM, 2011

Deep belief networks using discriminative features for phone recognition

A.R. Mohamed, T.N. Sainath, G. Dahl, B. Ramabhadran, G.E. Hinton, M.A. Picheny

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pp. 5060--5063

2009

Effects of real-time transcription on non-native speaker's comprehension in computer-mediated communications

Y. Pan, D. Jiang, M. Picheny, Y. Qin

Proceedings of the 27th international conference on Human factors in computing systems, pp. 2353--2356, 2009

An exploration of large vocabulary tools for small vocabulary phonetic recognition

T.N. Sainath, B. Ramabhadran, M. Picheny

Automatic Speech Recognition \& Understanding, 2009. ASRU 2009. IEEE Workshop on, pp. 359--364

Cultural voice markers in speech-to-speech machine translation systems

O Stewart, M Picheny, D Lubensky, B Ramabhadran

Proceeding of the 2009 international workshop on Intercultural collaboration, pp. 313--316, ACM New York, NY, USA

Effects of real-time transcription on non-native speaker's comprehension in computer-mediated communications

Y.X. Pan, D.N. Jiang, M Picheny, Y Qin

ACM CHI, Proceedings of the 27th international conference on Human factors in computing systems, pp. 2353--2356, 2009

2007

Improvements in phone based audio search via constrained match with high order confusion estimates

U.V. Chaudhari, M. Picheny

Automatic Speech Recognition \& Understanding, 2007. ASRU. IEEE Workshop on, pp. 665--670

Voice-melody transcription under a speech recognition framework

D. Jiang, M. Picheny, Y. Qin

Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, pp. IV--617

Lattice-based viterbi decoding techniques for speech translation

G. Saon, M. Picheny

Automatic Speech Recognition \& Understanding, 2007. ASRU. IEEE Workshop on, pp. 386--389

2006

The 2006 TC-STAR evaluation of the IBM text-to-speech synthesis system

R. Fernandez, R. Bakis, E. Eide, W. Hamza, J. Pitrelli, M. Picheny

TC-STAR Workshop on Speech-to-Speech Translation (Barcelona, Spain), pp. 175--180, 2006

The IBM submission to the 2006 Blizzard text-to-speech challenge

E. Eide, R. Fernandez, R. Hoory, W. Hamza, Z. Kons, M. Picheny, A. Sagi, S. Shechtman, Z.W. Shuang

Blizzard Workshop, 2006

Concept-based speech-to-speech translation using maximum entropy models for statistical natural concept generation

L. Gu, Y. Gao, F.H. Liu, M. Picheny

Audio, Speech, and Language Processing, IEEE Transactions on 14(2), 377--392, IEEE, 2006

Towards pooled-speaker concatenative text-to-speech

E.M. Eide, MA Picheny

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, pp. I--I

Cross-language access to recorded speech in the MALACH project

D. Oard, D. Demner-Fushman, J. Haji\v{c}, B. Ramabhadran, S. Gustman, W. Byrne, D. Soergel, B. Dorr, P. Resnik, M. Picheny

Text, Speech and Dialogue, pp. 197--212, 2006

The IBM expressive text-to-speech synthesis system for American English

J.F. Pitrelli, R. Bakis, E.M. Eide, R. Fernandez, W. Hamza, M.A. Picheny

Audio, Speech, and Language Processing, IEEE Transactions on 14(4), 1099--1108, IEEE, 2006

2005

Semantic confidence measurement for spoken dialog systems

R Sarikaya, Y Gao, M Picheny, H Erdogan

IEEE transactions on speech and audio processing 13(4), 534--545, 2005

Toward Multiple-Language TTS: Experiments in English and Mandarin

R. Fernandez, W. Zhang, E. Eide, R. Bakis, W. Hamza, Y. Liu, M. Picheny, J.F. Pitrelli, Y. Qing, Z.W. Shuang, others

Interspeech, Lisbon, Portugal, 2005

Using semantic analysis to improve speech recognition performance

H. Erdogan, R. Sarikaya, S.F. Chen, Y. Gao, M. Picheny

Computer Speech \& Language 19(3), 321--343, Elsevier, 2005

2004

Automatic recognition of spontaneous speech for access to multilingual oral history archives

M Picheny, J Psutka, B Ramabhadran, D Soergel, T …

Speech and Audio ..., 2004 - ieeexplore.ieee.org

A comparison of rule-based and statistical methods for semantic language modeling and confidence measurement

R. Sarikaya, Y. Gao, M. Picheny

Proceedings of HLT-NAACL 2004: Short Papers, pp. 65--68

A corpus-based approach to< ahem/> expressive speech synthesis

E Eide, A Aaron, R Bakis, W Hamza, M Picheny, J Pitrelli

Proccedings of 5th ISSW, 79--84, Citeseer, 2004

Applications of Language Modeling in Speech-To-Speech Translation

F H Liu, L Gu, Y Gao, M Picheny

International Journal of Speech Technology 7(2), 221--229, Springer, 2004

A corpus-based approach to expressive speech synthesis

E. Eide, A. Aaron, R. Bakis, W. Hamza, M. Picheny, J. Pitrelli

Fifth ISCA Workshop on Speech Synthesis, 2004

The IBM expressive speech synthesis system

W. Hamza, R. Bakis, E.M. Eide, M.A. Picheny, J.F. Pitrelli

Proc. ICSLP, pp. 2577--2580, 2004

2003

Forward-backward modeling in statistical natural concept generation for interlingua-based speech-to-speech translation

L Gu, Y Gao, M Picheny

IEEE Workshop on Automatic Speech Recognition and Understanding, 2003

SPEECH-P8. 6: RECENT IMPROVEMENTS TO THE IBM TRAINABLE SPEECH SYNTHESIS SYSTEM

E Eide, A Aaron, R Bakis, P Cohen, R Donovan, W Hamza, T Mathes, M Picheny, M Polkosky, M Smith

IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 2003

Information Access in Large Spoken Archives

M Franz, B Ramabhadran, T Ward, M Picheny

ISCA Workshop on Multilingual Spoken Document Retrieval, 2003

Improving statistical natural concept generation in interlingua-based speech-to-speech translation

L. Gu, Y. Gao, M. Picheny

Eighth European Conference on Speech Communication and Technology, 2003

A hand-held speech-to-speech translation system

B. Zhou, Y. Gao, J. Sorensen, D. D\'echelotte, M. Picheny

Automatic Speech Recognition and Understanding, 2003. ASRU'03. 2003 IEEE Workshop on, pp. 664--669

Noise robustness in speech to speech translation

F. Liu, Y. Gao, L. Gu, M. Picheny

Eighth European Conference on Speech Communication and Technology, 2003

Automated transcription and topic segmentation of large spoken archives

M. Franz, B. Ramabhadran, T. Ward, M. Picheny

Proceedings of Eurospeech (Geneva, Switzerland, pp. 953--956, 2003

Use of statistical N-gram models in natural language generation for machine translation

F.H. Liu, L. Gu, Y. Gao, M. Picheny

Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--636

Toward domain-independent conversational speech recognition.

Brian Kingsbury, Lidia Mangu, George Saon, Geoffrey Zweig, Scott Axelrod, Vaibhava Goel, Karthik Visweswariah, Michael Picheny

INTERSPEECH, 2003

Towards automatic transcription of large spoken archives-english ASR for the MALACH project

B. Ramabhadran, J. Huang, M. Picheny

Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--216

Word level confidence measurement using semantic features

R. Sarikaya, Y. Gao, M. Picheny

Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--604

Recent improvements to the IBM trainable speech synthesis system

E. Eide, A. Aaron, R. Bakis, R. Cohen, R. Donovan, W. Hamza, T. Mathes, M. Picheny, M. Polkosky, M. Smith, others

Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--708

2002

Statistical natural language generation for speech-to-speech machine translation

B. Zhou, Y. Gao, J. Sorensen, Z. Diao, M. Picheny

ICSLP2002, pp. 1897--1900

MARS: A statistical semantic parsing and generation-based multilingual automatic translation system

Y. Gao, B. Zhou, Z. Diao, J. Sorensen, M. Picheny

Machine Translation 17(3), 185--212, Springer, 2002

Large-Vocabulary Speech Recognition Algorithms

M Padmanabhan, M Picheny

COMPUTER, 2002 - doi.ieeecomputersociety.org

A trainable approach for multi-lingual speech-to-speech translation system

Y. Gao, J. Sorensen, H. Erdogan, R. Sarikaya, F. Liu, M. Picheny, B. Zhou, Z. Diao

Proceedings of the second international conference on Human Language Technology Research, pp. 231--234, Morgan Kaufmann Publishers Inc., 2002

Supporting access to large digital oral history archives

S. Gustman, D. Soergel, D. Oard, W. Byrne, M. Picheny, B. Ramabhadran, D. Greenberg

Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, pp. 18--27, 2002

Semantic structured language models

H. Erdogan, R. Sarikaya, Y. Gao, M. Picheny

ICSLP 2002

Statistical natural language generation for speech-to-speech machine translation systems

B Zhou, Y Gao, J Sorensen, Z Diao, M Picheny

Seventh International Conference on Spoken Language Processing, 2002

Turn-based language modeling for spoken dialog systems

R. Sarikaya, Y. Gao, H. Erdogan, M. Picheny

Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. I--781

2001

Rapid adaptation using penalized-likelihood methods

H Erdogan, Y Gao, M Picheny

IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 2001

Recent advances in speech recognition system for ibm darpa communicator

Y. Gao, H. Erdogan, Y. Li, V. Goel, M. Picheny

SMALL 20(17.0), 16--2, Citeseer, 2001

Innovative approaches for large vocabulary name recognition

Y. Gao, B. Ramabhadran, J. Chen, H. Erdogan, M. Picheny

Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP'01). 2001 IEEE International Conference on, pp. 53--56

Current status of the IBM trainable speech synthesis system

R. Donovan, A. Ittycheriah, M. Franz, B. Ramabhadran, E. Eide, M. Viswanathan, R. Bakis, W. Hamza, M. Picheny, P. Gleason, others

4th ISCA Tutorial and Research Workshop (ITRW) on Speech Synthesis, 2001

Speech recognition for DARPA communicator

Andrew Aaron, S Chen, P Cohen, Satya Dharanipragada, Ellen Eide, Martin Franz, J-M Leroux, X Luo, Beno\^\it Maison, Lidia Mangu, others

Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP'01). 2001 IEEE International Conference on, pp. 489--492

2000

Impact of Bucketing on Performance of Linearly Interpolated Language Models

K Visweswariah, H Printz, M Picheny

... International Conference on ..., 2000 - cpfd.cnki.com.cn

Maximal rank likelihood as an optimization function for speech recognition

Y Gao, Y Li, M Picheny

The Proceedings of the 6\~{}(th) International Conference on Spoken Language Processing (Volume Ⅳ), 2000

1998

A new confidence measure based on rank-ordering subphone scores

Q. Lin, S. Das, D. Lubensky, M. Picheny

Proc. ICSLP, pp. 3249--3252, 1998

1996

Speaker clustering and transformation for speaker adaptation in large-vocabulary speech recognition systems

M. Padmanabhan, L.R. Bahl, D. Nahamoo, M.A. Picheny

Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on, pp. 701--704

ISSUES IN PRACTICAL LARGE VOCABULARY ISOLATED WORD RECOGNITION: THE IBM

SK Das, MA Picheny

Automatic Speech and Speaker Recognition: Advanced Topics355, 457, Springer, 1996

A Continuous Speaker-Independent Putonghua Dictation System

CJ Chen, RA Gopinath, MD Monkowski, MA Picheny, …

第四届全国人机语音通讯 ..., 1996 - cpfd.cnki.com.cn

1995

Performance of the IBM LVCSR System on the Switchboard Corpus

FH Liu, MD Monkowski, M. Novak, M. Padmanabhan, MA Picheny, PS Rao

Proceedings of Speech Research Symposium, pp. 189, 1995

IBM Switchboard progress and evaluation site report

F. Liu, MD Monkowski, M. Novak, M. Padmanabhan, MA Picheny, PS Rao

LVCSR Workshop, 1995

Robust speech recognition in noise---performance of the IBM continuous speech recogniser on the ARPA noise spoke task

RA Gopinath, MJF Gales, PS Gopalakrishnan, S. Balakrishnan Aiyer, MA Picheny

Proceedings of the ARPA Spoken Language Systems Technology Workshop, pp. 127--130, 1995

1994

Adaptation techniques for ambience and microphone compensation in the IBM Tangora speech recognition system

S. Das, A. Nadas, D. Nahamoo, M. Picheny

Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on, pp. I--21

1993

Word lookahead scheme for cross-word right context models in a stack decoder

LR Bahl, P V Souza, PS Gopalakrishnan, D Nahamoo, M Picheny

Third European Conference on Speech Communication and Technology, 1993

A method for the construction of acoustic Markov models for words

LR Bahl, PF Brown, PV De Souza, RL Mercer, MA Picheny

Speech and Audio Processing, IEEE Transactions on 1(4), 443--452, IEEE, 1993

1992

Adaptation of large vocabulary recognition system parameters

L. Bahl, PV de Souza, D. Nahamoo, MA Picheny, S. Roukos

Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on, pp. 477--480

1991

Context dependent modeling of phones in continuous speech using decision trees

LR Bahl, PV De Souza, PS Gopalakrishnan, D. Nahamoo, MA Picheny

Proceedings DARPA Speech and Natural Language Processing Workshop, pp. 264--270, 1991

An iterativeflip-flop'approximation of the most informative split in the construction of decision trees

A. N\'adas, D. Nahamoo, M.A. Picheny, J. Powell

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 565--568

Automatic Phonetic Baseform Determination

LR Bahl, S. Das, PV Desouza, M. Epstein, RL Mercer, B. Merialdo, D. Nahamoo, MA Picheny, J. Powell

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 173--176

Decision trees for phonological rules in continuous speech

L.R. Bahl, PV deSouza, PS Gopalakrishnan, D. Nahamoo, MA Picheny

Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 185--188

1989

Speaking clearly for the hard of hearing III: An attempt to determine the contribution of speaking rate to differences in intelligibility between clear and conversational speech

M.A. Picheny, N.I. Durlach, L.D. Braida

Journal of Speech, Language and Hearing Research 32(3), 600, ASHA, 1989

Large vocabulary natural language continuous speech recognition

LR Bahl, R. Bakis, J. Bellegarda, PF Brown, D. Burshtein, SK Das, PV De Souza, PS Gopalakrishnan, F. Jelinek, D. Kanevsky, others

Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on, pp. 465--467

Speech recognition using noise-adaptive prototypes

A. N\'adas, D. Nahamoo, M.A. Picheny

Acoustics, Speech and Signal Processing, IEEE Transactions on 37(10), 1495--1503, IEEE, 1989

1988

Acoustic Markov models used in the Tangora speech recognition system

LR Bahl, PF Brown, PV De Souza, MA Picheny

Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, pp. 497--500

1987

Automatic construction of acoustic markov models for words

LR Bahl, PF Brown, PV de Souza, RL Mercer, MA Picheny

1st IASTED International Symposium on Signal Processing and its Applications, pp. 565, 1987

1986

A real-time IBM PC based large-vocabulary isolated-word speech recognizer

M. Picheny, others

Pinner, England, Voice Processing, Online Publ, 1986

Use of articulatory signals in automatic speech recognition

LD Braida, MA Picheny, JR Cohen, WM Rabinowitz, JS Perkell

The Journal of the Acoustical Society of America 80(S1), S18--S18, Acoustical Society of America, 1986

Speaking clearly for the hard of hearing II: Acoustic characteristics of clear and conversational speech

M.A. Picheny, N.I. Durlach, L.D. Braida

Journal of Speech, Language and Hearing Research 29(4), 434, ASHA, 1986

1985

Speaking clearly for the hard of hearing I: Intelligibility differences between clear and conversational speech

M.A. Picheny, N.I. Durlach, L.D. Braida

Journal of Speech, Language and Hearing Research 28(1), 96, ASHA, 1985

1984

Some experiments with large-vocabulary isolated-word sentence recognition

L. Bahl, S. Das, P. de Souza, F. Jelinek, S. Katz, R. Mercer, M. Picheny

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'84., pp. 395--396, 1984

1983

Recognition of isolated-word sentences from a 5000-word vocabulary office correspondence task

L. Bahl, A. Cole, F. Jelinek, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny

Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'83., pp. 1065--1067, 1983

Page updated

Report abuse