Michael's Publications
2019
Khoi-Nguyen C. Mac, Xiaodong Cui, Wei Zhang, Michael Picheny Large-Scale Mixed-Bandwidth Deep Neural Network Acoustic Modeling for Automatic Speech Recognition, Interspeech 2019 Graz Austria
Michael Picheny, Zoltán Tüske, Brian Kingsbury, Kartik Audhkhasi, Xiaodong Cui, George Saon Challenging the Boundaries of Speech Recognition: The MALACH Corpus Interspeech 2019 Graz Austria
Xiaodong Cui, Michael Picheny Acoustic Model Optimization Based on Evolutionary Stochastic Gradient Descent with Anchors for Automatic Speech Recognition Interspeech 2019 Graz Austria
Zakaria Aldeneh, Mimansa Jaiswal, Michael Picheny, Melvin G. McInnis, Emily Mower Provost Identifying Mood Episodes Using Dialogue Features from Clinical Interviews Interspeech 2019 Graz Austria
Kartik Audhkhasi, George Saon, Zoltán Tüske, Brian Kingsbury, Michael Picheny Forget a Bit to Learn Better: Soft Forgetting for CTC-Based Automatic Speech Recognition Interspeech 2019 Graz Austria
Wei Zhang, Xiaodong Cui, Ulrich Finkler, George Saon, Abdullah Kayi, Alper Buyuktosunoglu, Brian Kingsbury, David Kung, Michael Picheny A Highly Efficient Distributed Deep Learning System for Automatic Speech Recognition, Interspeech 2019 Graz Austria
Samuel Thomas, Kartik Audhkhasi, Zoltán Tüske, Yinghui Huang, Michael Picheny Detection and Recovery of OOVs for Improved English Broadcast News Captioning Interspeech 2019 Graz Austria
S. Settle, K. Audhkhasi, K. Livescu and M. Picheny, "Acoustically Grounded Word Embeddings for Improved Acoustics-to-word Speech Recognition," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 5641-5645.
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8682903&isnumber=8682151
S. Thomas et al., "English Broadcast News Speech Recognition by Humans and Machines," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 6455-6459.
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8683211&isnumber=8682151
L. Sarı, S. Thomas, M. Hasegawa-Johnson and M. Picheny, "Pre-training of Speaker Embeddings for Low-latency Speaker Change Detection in Broadcast News," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 6286-6290.
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8683612&isnumber=8682151
W. Zhang et al., "Distributed Deep Learning Strategies for Automatic Speech Recognition," ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Brighton, United Kingdom, 2019, pp. 5706-5710.
URL: http://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnumber=8682888&isnumber=8682151
2018
Xiaodong Cui, Wei Zhang, Zoltan Tuske, and Michael Picheny. Evolutionary Stochastic Gradient Descent for Optimizationof Deep Neural Networks Advances in Neural Information Processing Systems, pp. 6051-6061. 2018.
Kartik Audhkhasi, Brian Kingsbury, Bhuvana Ramabhadran, George Saon, Michael Picheny Building Competitive DirectAcoustics-to-word models for English Conversational Speech Recognition 2018 IEEE International Conference on Acoustics,Speech and Signal Processing (ICASSP)
2017
English Conversational Telephone Speech Recognition by Humans and Machines
Saon, George and Kurata, Gakuto and Sercu, Tom and Audhkhasi, Kartik and Thomas, Samuel and Dimitriadis, Dimitrios and Cui, Xiaodong and Ramabhadran, Bhuvana and Picheny, Michael and Lim, Lynn-Li and others
arXiv preprint arXiv:1703.02136, 2017
Kernel Approximation Methods for Speech Recognition
May, Avner and Garakani, Alireza Bagheri and Lu, Zhiyun and Guo, Dong and Liu, Kuan and Bellet, Aur{\'e}lien and Fan, Linxi and Collins, Michael and Hsu, Daniel and Kingsbury, Brian and others
arXiv preprint arXiv:1701.03577, 2017
2016
A comparison between deep neural nets and kernel acoustic models for speech recognition
Lu, Zhiyun and Quo, Dong and Garakani, Alireza Bagheri and Liu, Kuan and May, Avner and Bellet, Aur{\'e}lien and Fan, Linxi and Collins, Michael and Kingsbury, Brian and Picheny, Michael and others
Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on, pp. 5070--5074
Language Modeling/Pronunciation Modeling
Picheny, Michael and Ramabhadran, Bhuvana and Chen, Stanley F and Nussbaum-Thom, Markus
2016 - ee.columbia.edu
Picheny, Michael and Ramabhadran, Bhuvana and Chen, Stanley F and Nussbaum-Thom, Markus
2016 - ee.columbia.edu
Parallel deep neural network training for big data on blue gene/Q
Chung, I-Hsin and Sainath, Tara N and Ramabhadran, Bhuvana and Picheny, Michael and Gunnels, John and Austel, Vernon and Chauhari, Upendra and Kingsbury, Brian
IEEE Transactions on Parallel and Distributed Systems, IEEE, 2016
On the importance of event detection for ASR
Haws, David and Dimitriadis, Dimitrios and Saon, George and Thomas, Samuel and Picheny, Michael
Acoustics, Speech and Signal Processing (ICASSP), 2016 IEEE International Conference on, pp. 5705--5709
Training variance and performance evaluation of neural networks in speech
Berg, Ewout van den and Ramabhadran, Bhuvana and Picheny, Michael
arXiv preprint arXiv:1606.04521, 2016
2015
Order-free spoken term detection
Mangu, Lidia and Saon, George and Picheny, Michael and Kingsbury, Brian
Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on, pp. 5331--5335
Multilingual representations for low resource speech recognition and keyword search
Cui, Jia and Kingsbury, Brian and Ramabhadran, Bhuvana and Sethy, Abhinav and Audhkhasi, Kartik and Cui, Xiaodong and Kislal, Ellen and Mangu, Lidia and Nussbaum-Thom, Markus and Picheny, Michael and others
Automatic Speech Recognition and Understanding (ASRU), 2015 IEEE Workshop on, pp. 259--266
The IBM 2015 english conversational telephone speech recognition system
Saon, George and Kuo, Hong-Kwang J and Rennie, Steven and Picheny, Michael
arXiv preprint arXiv:1505.05899, 2015
2014
Efficient spoken term detection using confusion networks
Mangu, Lidia and Kingsbury, Brian and Soltau, Hagen and Kuo, Hong-Kwang and Picheny, Michael
Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on, pp. 7844--7848
How to scale up kernel methods to be as good as deep neural nets
Lu, Zhiyun and May, Avner and Liu, Kuan and Garakani, Alireza Bagheri and Guo, Dong and Bellet, Aur{\'e}lien and Fan, Linxi and Collins, Michael and Kingsbury, Brian and Picheny, Michael and others
arXiv preprint arXiv:1411.4000, 2014
Parallel deep neural network training for LVCSR tasks using blue gene/Q.
Sainath, Tara N and Chung, I-hsin and Ramabhadran, Bhuvana and Picheny, Michael and Gunnels, John A and Kingsbury, Brian and Saon, George and Austel, Vernon and Chaudhari, Upendra V
INTERSPEECH, pp. 1048--1052, 2014
Unfolded recurrent neural networks for speech recognition.
Saon, George and Soltau, Hagen and Emami, Ahmad and Picheny, Michael
Interspeech, pp. 343--347, 2014
2013
A high-performance Cantonese keyword search system
Kingsbury, Brian and Cui, Jia and Cui, Xiaodong and Gales, Mark JF and Knill, Kate and Mamou, Jonathan and Mangu, Lidia and Nolden, David and Picheny, Michael and Ramabhadran, Bhuvana and others
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 8277--8281
System combination and score normalization for spoken term detection
Mamou, Jonathan and Cui, Jia and Cui, Xiaodong and Gales, Mark JF and Kingsbury, Brian and Knill, Kate and Mangu, Lidia and Nolden, David and Picheny, Michael and Ramabhadran, Bhuvana and others
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 8272--8276
Developing keyword search under the IARPA Babel program
Mamou, Jonathan and Cui, Jia and Cui, Xiaodong and Gales, Mark JF and Kingsbury, Brian and Knill, Kate and Mangu, Lidia and Nolden, David and Picheny, Michael and Ramabhadran, Bhuvana and others
Proc. Afeka Speech Processing Conference, 2013
Speaker adaptation of neural network acoustic models using i-vectors.
Saon, George and Soltau, Hagen and Nahamoo, David and Picheny, Michael
ASRU, pp. 55--59, 2013
Developing speech recognition systems for corpus indexing under the IARPA Babel program
Cui, Jia and Cui, Xiaodong and Ramabhadran, Bhuvana and Kim, Janice and Kingsbury, Brian and Mamou, Jonathan and Mangu, Lidia and Picheny, Michael and Sainath, Tara N and Sethy, Abhinav
Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on, pp. 6753--6757
2011
Trends and advances in speech recognition
Michael Picheny, David Nahamoo, Vaibhava Goel, Brian Kingsbury, Bhuvana Ramabhadran, Steven J Rennie, George Saon
IBM Journal of Research and Development 55(5), 2--1, IBM, 2011
Deep belief networks using discriminative features for phone recognition
A.R. Mohamed, T.N. Sainath, G. Dahl, B. Ramabhadran, G.E. Hinton, M.A. Picheny
Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on, pp. 5060--5063
2009
Y. Pan, D. Jiang, M. Picheny, Y. Qin
Proceedings of the 27th international conference on Human factors in computing systems, pp. 2353--2356, 2009
An exploration of large vocabulary tools for small vocabulary phonetic recognition
T.N. Sainath, B. Ramabhadran, M. Picheny
Automatic Speech Recognition \& Understanding, 2009. ASRU 2009. IEEE Workshop on, pp. 359--364
Cultural voice markers in speech-to-speech machine translation systems
O Stewart, M Picheny, D Lubensky, B Ramabhadran
Proceeding of the 2009 international workshop on Intercultural collaboration, pp. 313--316, ACM New York, NY, USA
Y.X. Pan, D.N. Jiang, M Picheny, Y Qin
ACM CHI, Proceedings of the 27th international conference on Human factors in computing systems, pp. 2353--2356, 2009
2007
Improvements in phone based audio search via constrained match with high order confusion estimates
U.V. Chaudhari, M. Picheny
Automatic Speech Recognition \& Understanding, 2007. ASRU. IEEE Workshop on, pp. 665--670
Voice-melody transcription under a speech recognition framework
D. Jiang, M. Picheny, Y. Qin
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on, pp. IV--617
Lattice-based viterbi decoding techniques for speech translation
Automatic Speech Recognition \& Understanding, 2007. ASRU. IEEE Workshop on, pp. 386--389
2006
The 2006 TC-STAR evaluation of the IBM text-to-speech synthesis system
R. Fernandez, R. Bakis, E. Eide, W. Hamza, J. Pitrelli, M. Picheny
TC-STAR Workshop on Speech-to-Speech Translation (Barcelona, Spain), pp. 175--180, 2006
The IBM submission to the 2006 Blizzard text-to-speech challenge
E. Eide, R. Fernandez, R. Hoory, W. Hamza, Z. Kons, M. Picheny, A. Sagi, S. Shechtman, Z.W. Shuang
Blizzard Workshop, 2006
L. Gu, Y. Gao, F.H. Liu, M. Picheny
Audio, Speech, and Language Processing, IEEE Transactions on 14(2), 377--392, IEEE, 2006
Towards pooled-speaker concatenative text-to-speech
E.M. Eide, MA Picheny
Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on, pp. I--I
Cross-language access to recorded speech in the MALACH project
D. Oard, D. Demner-Fushman, J. Haji\v{c}, B. Ramabhadran, S. Gustman, W. Byrne, D. Soergel, B. Dorr, P. Resnik, M. Picheny
Text, Speech and Dialogue, pp. 197--212, 2006
The IBM expressive text-to-speech synthesis system for American English
J.F. Pitrelli, R. Bakis, E.M. Eide, R. Fernandez, W. Hamza, M.A. Picheny
Audio, Speech, and Language Processing, IEEE Transactions on 14(4), 1099--1108, IEEE, 2006
2005
Semantic confidence measurement for spoken dialog systems
R Sarikaya, Y Gao, M Picheny, H Erdogan
IEEE transactions on speech and audio processing 13(4), 534--545, 2005
Toward Multiple-Language TTS: Experiments in English and Mandarin
R. Fernandez, W. Zhang, E. Eide, R. Bakis, W. Hamza, Y. Liu, M. Picheny, J.F. Pitrelli, Y. Qing, Z.W. Shuang, others
Interspeech, Lisbon, Portugal, 2005
Using semantic analysis to improve speech recognition performance
H. Erdogan, R. Sarikaya, S.F. Chen, Y. Gao, M. Picheny
Computer Speech \& Language 19(3), 321--343, Elsevier, 2005
2004
Automatic recognition of spontaneous speech for access to multilingual oral history archives
M Picheny, J Psutka, B Ramabhadran, D Soergel, T …
Speech and Audio ..., 2004 - ieeexplore.ieee.org
R. Sarikaya, Y. Gao, M. Picheny
Proceedings of HLT-NAACL 2004: Short Papers, pp. 65--68
A corpus-based approach to< ahem/> expressive speech synthesis
E Eide, A Aaron, R Bakis, W Hamza, M Picheny, J Pitrelli
Proccedings of 5th ISSW, 79--84, Citeseer, 2004
Applications of Language Modeling in Speech-To-Speech Translation
F H Liu, L Gu, Y Gao, M Picheny
International Journal of Speech Technology 7(2), 221--229, Springer, 2004
A corpus-based approach to expressive speech synthesis
E. Eide, A. Aaron, R. Bakis, W. Hamza, M. Picheny, J. Pitrelli
Fifth ISCA Workshop on Speech Synthesis, 2004
The IBM expressive speech synthesis system
W. Hamza, R. Bakis, E.M. Eide, M.A. Picheny, J.F. Pitrelli
Proc. ICSLP, pp. 2577--2580, 2004
2003
L Gu, Y Gao, M Picheny
IEEE Workshop on Automatic Speech Recognition and Understanding, 2003
SPEECH-P8. 6: RECENT IMPROVEMENTS TO THE IBM TRAINABLE SPEECH SYNTHESIS SYSTEM
E Eide, A Aaron, R Bakis, P Cohen, R Donovan, W Hamza, T Mathes, M Picheny, M Polkosky, M Smith
IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 2003
Information Access in Large Spoken Archives
M Franz, B Ramabhadran, T Ward, M Picheny
ISCA Workshop on Multilingual Spoken Document Retrieval, 2003
Improving statistical natural concept generation in interlingua-based speech-to-speech translation
L. Gu, Y. Gao, M. Picheny
Eighth European Conference on Speech Communication and Technology, 2003
A hand-held speech-to-speech translation system
B. Zhou, Y. Gao, J. Sorensen, D. D\'echelotte, M. Picheny
Automatic Speech Recognition and Understanding, 2003. ASRU'03. 2003 IEEE Workshop on, pp. 664--669
Noise robustness in speech to speech translation
F. Liu, Y. Gao, L. Gu, M. Picheny
Eighth European Conference on Speech Communication and Technology, 2003
Automated transcription and topic segmentation of large spoken archives
M. Franz, B. Ramabhadran, T. Ward, M. Picheny
Proceedings of Eurospeech (Geneva, Switzerland, pp. 953--956, 2003
Use of statistical N-gram models in natural language generation for machine translation
F.H. Liu, L. Gu, Y. Gao, M. Picheny
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--636
Toward domain-independent conversational speech recognition.
Brian Kingsbury, Lidia Mangu, George Saon, Geoffrey Zweig, Scott Axelrod, Vaibhava Goel, Karthik Visweswariah, Michael Picheny
INTERSPEECH, 2003
Towards automatic transcription of large spoken archives-english ASR for the MALACH project
B. Ramabhadran, J. Huang, M. Picheny
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--216
Word level confidence measurement using semantic features
R. Sarikaya, Y. Gao, M. Picheny
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--604
Recent improvements to the IBM trainable speech synthesis system
E. Eide, A. Aaron, R. Bakis, R. Cohen, R. Donovan, W. Hamza, T. Mathes, M. Picheny, M. Polkosky, M. Smith, others
Acoustics, Speech, and Signal Processing, 2003. Proceedings.(ICASSP'03). 2003 IEEE International Conference on, pp. I--708
2002
Statistical natural language generation for speech-to-speech machine translation
B. Zhou, Y. Gao, J. Sorensen, Z. Diao, M. Picheny
ICSLP2002, pp. 1897--1900
MARS: A statistical semantic parsing and generation-based multilingual automatic translation system
Y. Gao, B. Zhou, Z. Diao, J. Sorensen, M. Picheny
Machine Translation 17(3), 185--212, Springer, 2002
Large-Vocabulary Speech Recognition Algorithms
M Padmanabhan, M Picheny
COMPUTER, 2002 - doi.ieeecomputersociety.org
A trainable approach for multi-lingual speech-to-speech translation system
Y. Gao, J. Sorensen, H. Erdogan, R. Sarikaya, F. Liu, M. Picheny, B. Zhou, Z. Diao
Proceedings of the second international conference on Human Language Technology Research, pp. 231--234, Morgan Kaufmann Publishers Inc., 2002
Supporting access to large digital oral history archives
S. Gustman, D. Soergel, D. Oard, W. Byrne, M. Picheny, B. Ramabhadran, D. Greenberg
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries, pp. 18--27, 2002
Semantic structured language models
H. Erdogan, R. Sarikaya, Y. Gao, M. Picheny
ICSLP 2002
Statistical natural language generation for speech-to-speech machine translation systems
B Zhou, Y Gao, J Sorensen, Z Diao, M Picheny
Seventh International Conference on Spoken Language Processing, 2002
Turn-based language modeling for spoken dialog systems
R. Sarikaya, Y. Gao, H. Erdogan, M. Picheny
Acoustics, Speech, and Signal Processing (ICASSP), 2002 IEEE International Conference on, pp. I--781
2001
Rapid adaptation using penalized-likelihood methods
H Erdogan, Y Gao, M Picheny
IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 2001
Recent advances in speech recognition system for ibm darpa communicator
Y. Gao, H. Erdogan, Y. Li, V. Goel, M. Picheny
SMALL 20(17.0), 16--2, Citeseer, 2001
Innovative approaches for large vocabulary name recognition
Y. Gao, B. Ramabhadran, J. Chen, H. Erdogan, M. Picheny
Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP'01). 2001 IEEE International Conference on, pp. 53--56
Current status of the IBM trainable speech synthesis system
R. Donovan, A. Ittycheriah, M. Franz, B. Ramabhadran, E. Eide, M. Viswanathan, R. Bakis, W. Hamza, M. Picheny, P. Gleason, others
4th ISCA Tutorial and Research Workshop (ITRW) on Speech Synthesis, 2001
Speech recognition for DARPA communicator
Andrew Aaron, S Chen, P Cohen, Satya Dharanipragada, Ellen Eide, Martin Franz, J-M Leroux, X Luo, Beno\^\it Maison, Lidia Mangu, others
Acoustics, Speech, and Signal Processing, 2001. Proceedings.(ICASSP'01). 2001 IEEE International Conference on, pp. 489--492
2000
Impact of Bucketing on Performance of Linearly Interpolated Language Models
K Visweswariah, H Printz, M Picheny
... International Conference on ..., 2000 - cpfd.cnki.com.cn
Maximal rank likelihood as an optimization function for speech recognition
Y Gao, Y Li, M Picheny
The Proceedings of the 6\~{}(th) International Conference on Spoken Language Processing (Volume Ⅳ), 2000
1998
A new confidence measure based on rank-ordering subphone scores
Q. Lin, S. Das, D. Lubensky, M. Picheny
Proc. ICSLP, pp. 3249--3252, 1998
1996
M. Padmanabhan, L.R. Bahl, D. Nahamoo, M.A. Picheny
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on, pp. 701--704
ISSUES IN PRACTICAL LARGE VOCABULARY ISOLATED WORD RECOGNITION: THE IBM
SK Das, MA Picheny
Automatic Speech and Speaker Recognition: Advanced Topics355, 457, Springer, 1996
A Continuous Speaker-Independent Putonghua Dictation System
CJ Chen, RA Gopinath, MD Monkowski, MA Picheny, …
第四届全国人机语音通讯 ..., 1996 - cpfd.cnki.com.cn
1995
Performance of the IBM LVCSR System on the Switchboard Corpus
FH Liu, MD Monkowski, M. Novak, M. Padmanabhan, MA Picheny, PS Rao
Proceedings of Speech Research Symposium, pp. 189, 1995
IBM Switchboard progress and evaluation site report
F. Liu, MD Monkowski, M. Novak, M. Padmanabhan, MA Picheny, PS Rao
LVCSR Workshop, 1995
RA Gopinath, MJF Gales, PS Gopalakrishnan, S. Balakrishnan Aiyer, MA Picheny
Proceedings of the ARPA Spoken Language Systems Technology Workshop, pp. 127--130, 1995
1994
S. Das, A. Nadas, D. Nahamoo, M. Picheny
Acoustics, Speech, and Signal Processing, 1994. ICASSP-94., 1994 IEEE International Conference on, pp. I--21
1993
Word lookahead scheme for cross-word right context models in a stack decoder
LR Bahl, P V Souza, PS Gopalakrishnan, D Nahamoo, M Picheny
Third European Conference on Speech Communication and Technology, 1993
A method for the construction of acoustic Markov models for words
LR Bahl, PF Brown, PV De Souza, RL Mercer, MA Picheny
Speech and Audio Processing, IEEE Transactions on 1(4), 443--452, IEEE, 1993
1992
Adaptation of large vocabulary recognition system parameters
L. Bahl, PV de Souza, D. Nahamoo, MA Picheny, S. Roukos
Acoustics, Speech, and Signal Processing, 1992. ICASSP-92., 1992 IEEE International Conference on, pp. 477--480
1991
Context dependent modeling of phones in continuous speech using decision trees
LR Bahl, PV De Souza, PS Gopalakrishnan, D. Nahamoo, MA Picheny
Proceedings DARPA Speech and Natural Language Processing Workshop, pp. 264--270, 1991
A. N\'adas, D. Nahamoo, M.A. Picheny, J. Powell
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 565--568
Automatic Phonetic Baseform Determination
LR Bahl, S. Das, PV Desouza, M. Epstein, RL Mercer, B. Merialdo, D. Nahamoo, MA Picheny, J. Powell
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 173--176
Decision trees for phonological rules in continuous speech
L.R. Bahl, PV deSouza, PS Gopalakrishnan, D. Nahamoo, MA Picheny
Acoustics, Speech, and Signal Processing, 1991. ICASSP-91., 1991 International Conference on, pp. 185--188
1989
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 32(3), 600, ASHA, 1989
Large vocabulary natural language continuous speech recognition
LR Bahl, R. Bakis, J. Bellegarda, PF Brown, D. Burshtein, SK Das, PV De Souza, PS Gopalakrishnan, F. Jelinek, D. Kanevsky, others
Acoustics, Speech, and Signal Processing, 1989. ICASSP-89., 1989 International Conference on, pp. 465--467
Speech recognition using noise-adaptive prototypes
A. N\'adas, D. Nahamoo, M.A. Picheny
Acoustics, Speech and Signal Processing, IEEE Transactions on 37(10), 1495--1503, IEEE, 1989
1988
Acoustic Markov models used in the Tangora speech recognition system
LR Bahl, PF Brown, PV De Souza, MA Picheny
Acoustics, Speech, and Signal Processing, 1988. ICASSP-88., 1988 International Conference on, pp. 497--500
1987
Automatic construction of acoustic markov models for words
LR Bahl, PF Brown, PV de Souza, RL Mercer, MA Picheny
1st IASTED International Symposium on Signal Processing and its Applications, pp. 565, 1987
1986
A real-time IBM PC based large-vocabulary isolated-word speech recognizer
M. Picheny, others
Pinner, England, Voice Processing, Online Publ, 1986
Use of articulatory signals in automatic speech recognition
LD Braida, MA Picheny, JR Cohen, WM Rabinowitz, JS Perkell
The Journal of the Acoustical Society of America 80(S1), S18--S18, Acoustical Society of America, 1986
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 29(4), 434, ASHA, 1986
1985
M.A. Picheny, N.I. Durlach, L.D. Braida
Journal of Speech, Language and Hearing Research 28(1), 96, ASHA, 1985
1984
Some experiments with large-vocabulary isolated-word sentence recognition
L. Bahl, S. Das, P. de Souza, F. Jelinek, S. Katz, R. Mercer, M. Picheny
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'84., pp. 395--396, 1984
1983
Recognition of isolated-word sentences from a 5000-word vocabulary office correspondence task
L. Bahl, A. Cole, F. Jelinek, R. Mercer, A. Nadas, D. Nahamoo, M. Picheny
Acoustics, Speech, and Signal Processing, IEEE International Conference on ICASSP'83., pp. 1065--1067, 1983