Peder Olsen



Publications 


PUBLICATIONS - Ph. D. Thesis

[1]
Peder A. Olsen, "Negative Eigenvalues of the Schrödinger Equation: an Approach through Fractional Integration and Morrey Spaces," University of Michigan,, April 1996.


PUBLICATIONS - Journal Articles

[2]
Peder A. Olsen and Kristian Seip, "A Note on Irregular Discrete Wavelet Transforms," IEEE Transactions on Information Theory,38 (2), 861-864, March 1992..
[3]
Peder Olsen, "Fractional Integration, Morrey Spaces and a Schrödinger Equation," Communications in Partial Differential Equations,20 (11/12), 2004-2057 (1995).
[4]
Peder Olsen and Renming Song, "Diffusion of Directed Polymers in a Strong Random Environment," Journal of Statistical Physics,83 (1/2), 727-738, May 1996.
[5]
Joseph Conlon and Peder Olsen, "A Brownian Motion Version of the Directed Polymer Problem," Journal of Statistical Physics,84 (3/4), 415-454, August 1996.
[6]
Joseph Conlon and Peder Olsen, "Estimates on the solution of an elliptic equation related to Brownian motion with drift, II," Revista Matemática Iberoamericana, 13 (3), 567-711 (1997).
[7]
Joseph Conlon and Peder Olsen, "Fluctuations of Brownian Motion with Drift," Publicacions Matematiques, 43, 85-125 (1999).
[8]
Sankar Basu, Charles Micchelli and Peder Olsen, "Power Exponential Densities for the Training and Classification of Acoustic Feature Vectors in Speech Recognition," Journal of Computational and Graphical Statistics,10 (1), 158-192, March 2001.
[9]
Charles A. Micchelli and Peder A. Olsen, "Penalized Maximum Likelihood Estimation, the Baum Welch Algorithm, Diagonal Balancing of Symmetric Matrices and Applications to Training Acoustic Data," Journal of Computational and Applied Mathematics,119 (1-2), 301-331 (2000). Special issue: dedicated to Prof. L. Schumaker on the occasion of his 60th birthday.
[10]
Scott S. Chen, Ellen M. Eide, Mark J. F. Gales, Ramesh A. Gopinath, Dimitri Kanevsky and Peder Olsen, "Automatic Transcription of Broadcast News," Speech Communications, 37 (1-2), 69-87, May 2002. Special issue: Automatic Transcription of Broadcast News.
[11]
Harry Printz and Peder Olsen, "Theory and Practice of Acoustic Confusability," Computer, Speech and Language,16 (1), 131-164, January 2002. Special issue: Advances in Large Vocabulary Speech Recognition.
[12]
Sankar Basu, Mohammad Saif Ullah Khan, Charles A. Micchelli and Peder A. Olsen, "On an optimization problem arising from probability density estimation," Revista de la Real Academia de Ciencias, Serie A. Matemáticas, 96 (2), 139-156, 2002.
[13]
Sabine Deligne, Satya Dharanipragada, Ramesh Gopinath, Benoit Maison, Peder Olsen and Harry Printz, "A Robust High Accuracy Speech Recognition System for Mobile Applications," IEEE Transactions on Speech and Audio Processing, Special issue on automatic speech recognition for mobile and portable devices, 10 (8), 551-561, November 2002.
[14]
Peder Olsen and Ramesh A. Gopinath, "Modeling Inverse Covariance Matrices by Basis Expansion," Transactions in Speech and Audio Processing,12 (1), 37-46, January 2004.
[15]
Scott Axelrod, Vaibhava Goel, Ramesh A. Gopinath, Peder A. Olsen and Karthik Visweswariah, "Subspace Constrained Gaussian Mixture Models for Speech Recognition," Transactions in Speech and Audio Processing,13 (6), 1144-1160, November 2005.
[16]
Scott Axelrod, Vaibhava Goel, Ramesh A. Gopinath, Peder A. Olsen and Karthik Visweswariah, "Discriminative Estimation of Subspace Constrained Gaussian Mixture Models for Speech Recognition," Transactions in Speech and Audio Processing,15 (1), 172-189, January 2007.
[17]
John R. Hershey, Steven Rennie, Peder A. Olsen and Trausti Kristjansson, "Super-human multi-talker speech recognition: A graphical modeling approach," Computer, Speech and Language, 24, 45-66, January 2010. Special issue: Speech Separation and Recognition.
[18]
Steven J. Rennie, John R. Hershey and Peder A. Olsen, "Graphical Models for Single-channel Multi-talker Speech Recognition," Signal Processing Magazine, 27 (6), 66-80. November 2010. Special issue: Graphical Modeling.


PUBLICATIONS - Conference Proceedings

[19]
Lazaros Polymenakos, Peder Olsen, Dimitri Kanvesky, Ramesh A. Gopinath, Ponani S. Gopalakrishnan, Scott Chen and Harry Printz, "IBM's LVCSR System for Transcription of Broadcast News Used in the 1997 Hub-4 English Evaluation," Proceedings of the Broadcast News Transcription and Understanding Workshop, Lansdowne, Virginia, USA, February 1998.
[20]
Lazaros Polymenakos, Peder Olsen, Dimitri Kanvesky, Ramesh A. Gopinath, Ponani S. Gopalakrishnan, Scott Chen and Harry Printz, "Transcription of Broadcast News Using IBM's LVCSR System," IEEE International Conference on Acoustics, Speech and Signal Processing,2, 901-904, Seattle, Washington, USA, May 1998.
[21]
Scott S. Chen, Ellen M. Eide, Mark J. F. Gales, Ramesh A. Gopinath, Dimitri Kanevsky and Peder Olsen, "Recent Improvements to IBM's Speech Recognition System for Automatic Transcription of Broadcast News," DARPA Broadcast News Workshop, 89-95, Herndon, Virginia, USA, February 1999.
[22]
Scott S. Chen, Ellen M. Eide, Mark J. F. Gales, Ramesh A. Gopinath, Dimitri Kanevsky and Peder Olsen, "Recent Improvements to IBM's Speech Recognition System for Automatic Transcription of Broadcast News," IEEE International Conference on Acoustics, Speech and Signal Processing,1, 37-40, Phoenix, Arizona, USA, March 1999.
[23]
Ellen M. Eide, Benoit Maison, Mark J. F. Gales, Ramesh A. Gopinath, Scott Chen, Dimitri Kanevsky. Miroslav Novak, Lidia L. Mangu and Peder A. Olsen, "IBM's 10X Real-time Broadcast News Transcription System used in the 1999 HUB4 Evaluation," 2000 Speech Transcription Workshop, University of Maryland (2000).
[24]
Ellen M. Eide, Benoit Maison, Dimitri Kanevsky, Peder A. Olsen, Scott Chen, Lidia L. Mangu, Mark J. F. Gales, Mirek Novak and Ramesh A. Gopinath, "Transcription of Broadcast News with a Time Constraint: IBM's 10XRT HUB4 System," Proceedings of the International Conference on Spoken Language Processing, Beijing, China (2000).
[25]
Mark J. F. Gales and Peder Olsen, "Tail Distribution Modelling Using the Richter and Power Exponential Distributions," Eurospeech '99 - 6th European Conference on Speech Communication and Technology,4, 1507-1510, Budapest, Hungary, September 1999.
[26]
Sankar Basu, Charles Micchelli and Peder Olsen, "A Maximum Entropy Criterion for Feature Extraction from Multivariate Data," International Symposium on Wavelets and Applications, Guangzhou, China (1999).
[27]
Sankar Basu, Charles A. Micchelli and Peder Olsen, "Maximum likelihood estimates for exponential type density families," 1999 IEEE International Conference on Acoustics, Speech and Signal Processing,1, 361-364, Phoenix, Arizona, USA, March 1999.
[28]
Phil Fong and Peder Olsen, "The Curse of Dimensionality - Nonparametric Multivariate Density Estimation in Speech Recognition," Joint Mathematics Meetings (2000) Washington D.C.
[29]
Sankar Basu, Charles Micchelli and Peder Olsen, "A Maximum Entropy and Maximum Likelihood Criteria for Feature Selection from Multivariable Data," IEEE International Symposium on Circuits and Systems, III, p. 267-270, Geneva, Switzerland (2000).
[30]
Harry Printz and Peder Olsen, "Theory and practice of acoustic confusability," Automatic Speech Recognition - Challenges for the new millenium, 77-84, Paris, France, September 2000.
[31]
Sabine Deligne, Ellen Eide, Ramesh A. Gopinath, Dimitri Kanevksy, Benoit Maison, Peder Olsen, Harry Printz and Jan Sedivy., "Low-Resource Speech Recognition of 500-word Vocabularies," Proceedings of Eurospeech 2001. .
[32]
Ramesh Gopinath, Vaibhava Goel, Karthik Visweswariah and Peder Olsen, "Adaptation Experiments on the SPINE Database with the Extended Maximum Likelihood Linear Transform (EMLLT) Model," IEEE International Conference on Acoustics, Speech and Signal Processing, I, p. 925-928, May 13-17, 2002 Orlando Florida.
[33]
Peder Olsen and Ramesh Gopinath, "Modeling Inverse Covariance Matrices by Basis Expansion," IEEE International Conference on Acoustics, Speech and Signal ProcessingI, p. 945-948, May 13-17, 2002 Orlando Florida.
[34]
Scott Axelrod, Ramesh Gopinath and Peder Olsen, "Modeling with a Subspace Constraint on Inverse Covariance Matrices," Proceedings of the International Conference on Spoken Language Processing,, 3, p. 2177-2180, September 16-20, 2002, Denver, Colorado.
[35]
Jing Huang, Vaibhava Goel, Ramesh Gopinath, Brian Kingsbury, Peder Olsen and Karthik Visweswariah, "Large Vocabulary Conversational Speech Recognition with the Extended Maximum Likelihood Linear Transformation (EMLLT) Model," Proceedings of the International Conference on Spoken Language Processing,4, p. 2597-2600, September 16-20, 2002, Denver, Colorado.
[36]
Karthik Visweswariah, Peder Olsen, Ramesh Gopinath and Scott Axelrod, "Maximum Likelihood Training of Subspaces for Inverse Covariance Modeling," IEEE International Conference on Acoustics, Speech and Signal Processing,I, p. 896-899, April 6-10, 2003, Hong Kong..
[37]
Scott Axelrod, Ramesh Gopinath, Peder Olsen and Karthik Visweswariah, "Dimensional Reduction, Covariance Modeling and Computational Complexity in ASR Systems,," IEEE International Conference on Acoustics, Speech and Signal Processing,I, p. 912-915, April 6-10, 2003, Hong Kong.
[38]
Vaibhava Goel, Scott Axelrod, Ramesh Gopinath, Peder Olsen and Karthik Visweswariah, "Discriminative Estimation of Subspace Mean and Precision (SPAM) Models," Eurospeech 2003,4, p. 2617-2620, September 1-4, 2003, Geneva Switzerland.
[39]
Peder Olsen and Satya Dharanipragada, "An efficient integrated gender detection scheme and time mediated averaging of gender dependent acoustic models," Eurospeech,4, p. 2509-2512, September 1-4, 2003, Geneva Switzerland.
[40]
Peder A. Olsen, Scott Axelrod, Karthik Visweswariah and Ramesh Gopinath, "Gaussian Mixture Modeling with Volume Preserving Nonlinear Feature Space Transforms," Automatic Speech Recognition and Understanding, 4, p. 285-290, Dec. 1-4, 2003, US Virgin Islands.
[41]
Peder A. Olsen and Karthik Visweswariah, "Fast Clustering of Gaussians and the Virtue of Representing Gaussians in Exponential Model Format," Proceedings of the International Conference on Spoken Language Processing, October, 2004, Jeju, Korea.
[42]
Peder A. Olsen, Karthik Visweswariah and Ramesh Gopinath, "Initializing Subspace Constrained Gaussian Mixture Models," IEEE International Conference on Acoustics, Speech and Signal Processing,I, p. 661-664, vol I, March, 2005, Philadelphia, Pennsylvania, USA.
[43]
Trausti Kristjansson, Sabine Deligne and Peder Olsen, "Voicing Features for Robust Speech Detection," Interspeech 2005, p. 369-372, September 4-8. 2005, Lisbon, Portugal.
[44]
Karthik Visweswariah and Peder Olsen, "Feature adaptation using projection of Gaussian posteriors," Interspeech 2005, p. 1785-1788, September 4-8. 2005, Lisbon, Portugal.
[45]
Steven Rennie, Trausti Kristjansson, Peder Olsen and Ramesh Gopinath, "Dynamic Noise Adaptation," ICASSP 2006,I, p. 1197-1200, vol I, May 14-19, 2006, Toulouse, Franc.
[46]
Trausti Kristjansson, John Hershey, Peder Olsen, Steven Rennie and Ramesh Gopinath, "Super-human multi-talker speech recognition: The IBM 2006 speech separation challenge system," Interspeech 2006 ICSLP, p. 97-100, 17-21 September, 2006, Pittsburgh, Pennsylvania.
[47]
Steven Rennie, Peder Olsen, John Hershey and Trausti Kristjansson, "The Iroquois Model: Using Temporal Dynamics to Separate Speakers," Interspeech 2006 ICSLP, ISCA Tutorial and Research Workshop on Statistical And Perceptual Audition, p. 24-30, September 16 2006, Pittsburgh, Pennsylvania.
[48]
John Hershey, Trausti Kristjansson, Steven Rennie and Peder Olsen, "Single Channel Speech Separation Using Layered Hidden Markov Models," NIPS 2006 .
[49]
John Hershey, Peder Olsen and Ramesh Gopinath, "Variational sampling approaches to word confusability," Information Theory and Applications , February 2007, San Diego, USA.
[50]
John Hershey and Peder Olsen, "Approximating the Kullback Leibler Divergence Between Gaussian Mixture Models, ICASSP 2007, IV, p. 317-320, April 15-20, 2007, Honolulu, Hawaii.
[51]
Jia-Yu Chen, Peder Olsen and John Hershey, "Word Confusability - Measuring Hidden Markov Model Similarity," Interspeech 2007, p. 2089-2092, 27-31 August, 2007, Antwerp, Belgium.
[52]
Peder Olsen and John Hershey, "Bhattacharyya Error and Divergence using Variational Importance Sampling," Interspeech 2007, p. 46-49, 27-31 August, 2007, Antwerp, Belgium.
[53]
John R. Hershey, Peder A. Olsen and Steven J. Rennie, "Variational Kullback-Leibler Divergence for Hidden Markov Models," ASRU 2007 , p. 323-328,December 9-13, Kyoto, Japan.
[54]
Steven J. Rennie, John Hershey and Peder Olsen, "Efficient Model-based Speech Separation and Denoising using Non-negative Subspace Analysis," ICASSP 2008, p. 1833-1836, March 30 - April 4, Las Vegas, Nevada.
[55]
Binit Mohanty, John R. Hershey, Peder A. Olsen, Suleyman S. Kozat and Vaibhava Goel, "Optimizing Speech Recognition Grammars using a Measure of Similarity Between Hidden Markov Models," ICASSP 2008, p. 4593-4596, March 30 - April 4, Las Vegas, Nevada.
[56]
John Hershey and Peder Olsen, "Variational Bhattacharyya Divergence for Hidden Markov Models," ICASSP 2008, p. 4557-4560, March 30 - April 4, Las Vegas, Nevada.
[57]
Jia-Yu Chen, John Hershey, Peder Olsen and Emmanuel Yashchin, "Accelerated Monte Carlo for Kullback-Leibler Divergence between Gaussian Mixture Models," ICASSP 2008, p. 4553-4556, March 30 - April 4, Las Vegas, Nevada.
[58]
Pierre L. Dognin, John R. Hershey, Vaibhava Goel and Peder Olsen, "Refactoring acoustic models using variational density approximation," ICASSP 2009, p. 4473-4476, April 19-24, Taipei, Taiwan.
[59]
Pierre L. Dognin, Vaibhava Goel, Peder A. Olsen and John R. Hershey, "A fast, accurate approximation to log likelihood of Gaussian mixture models," ICASSP 2009, p. 3817-3820, April 19-24, Taipei, Taiwan.
[60]
Steven J. Rennie, John R. Hershey and Peder A. Olsen, "Single-channel speech separation and recognition using loopy belief propagation," ICASSP 2009, p. 3845-3848, April 19-24, Taipei, Taiwan.
[61]
Steven J. Rennie, John R. Hershey and Peder A. Olsen, "Variational Loopy Belief Propagation for Efficient Multi-talker Speech Recognition," Interspeech 2009, p. 1331-1334, September 6-10, Brighton, UK.
[62]
Etienne Marcheret, Jia-Yu Chen, Petr Fousek, Peder Olsen and Vaibhava Goel, "Compacting Discriminative Feature Space Transforms for Embedded Devices," Interspeech 2009, p. 1331-1334, September 6-10, Brighton, UK.
[63]
Vaibhava Goel and Peder Olsen, "Acoustic Modeling Using Exponential Families," Interspeech 2009, p. 1331-1334, September 6-10, Brighton, UK.
[64]
Pierre Dognin, John Hershey, Vaibhava Goel and Peder Olsen, "Refactoring Acoustic Models using Variational Expectation-Maximization," Interspeech 2009, p. 1331-1334, September 6-10, Brighton, UK.
[65]
Etienne Marcheret, Vaibhava Goel and Peder A. Olsen, "Optimal Quantization and Bit Allocation for Compressing Large Discriminative Feature Space Transforms," ASRU 2009, Merano, Italy.
[66]
Steven J. Rennie, John R. Hershey and Peder A. Olsen, "Hierarchical Variational Loopy Belief Propagation for Multi-talker Speech Recognition," ASRU 2009, Merano, Italy.
[67]
Pierre L. Dognin, John R. Hershey, Vaibhava Goel, and Peder A. Olsen, "Restructuring Acoustic Models for Client and Server Based Automatic Speech Recognition," Spoken Query Workshop, ICASSP, Dallas, Texas, March 2010.
[68]
Peder Olsen, Vaibhava Goel, Charles Micchelli, and John Hershey, "Modeling Posterior Probabilities using the Linear Exponential Family," Interspeech 2010, p. 2994-2997, Makuhari, Japan, 26-30 September 2010.
[69]
Pierre L. Dognin, John R. Hershey, Vaibhava Goel, and Peder A. Olsen, "Restructuring Exponential Family Mixture Models," Interspeech 2010, p. 62-65, Makuhari, Japan, 26-30 September 2010.
[70]
Vaibhava Goel, Tara N. Sainath, Bhuvana Ramabhadran, Peder A. Olsen, David Nahamoo and Dimitri Kanevsky, "Incorporating sparse representation phone identification features in automatic speech recognition using exponential families," Interspeech 2010, p. 1345-1348, Makuhari, Japan, 26-30 September 2010.
[71]
John R. Hershey, Peder A. Olsen and Steven J. Rennie, "Signal interaction and the devil function," Interspeech 2010, p. 334-337, Makuhari, Japan, 26-30 September 2010.
[72]
Peder A. Olsen, Vaibhava Goel and Steven J. Rennie, "Discriminative training for full covariance models," ICASSP 2011, p. 5312-5315, Prague, Czech Republic, 22-27 May 2011.
[73]
Shilei Zhang, Peder A. Olsen, Yong Qin, "Rapid feature space MLLR speaker adaptation with bilinear models," ICASSP 2011, p. 4452-4455, Prague, Czech Republic, 22-27 May 2011.
[74]
Jing Huang, Karthik Visweswariah, Peder Olsen, Vaibhav Goel, "Front-end feature transforms with context filtering for speaker adaptation," ICASSP 2011, p. 4440-4443, Prague, Czech Republic, 22-27 May 2011.
[75]
Xin Chen, Xiaodong Cui, Jian Xue, Peder Olsen, John Hersey, Bowen Zhou and Yunxin Zhao, "Clustering of bootstrapped acoustic model with full covariance," ICASSP 2011, p. 4496-4499, Prague, Czech Republic, 22-27 May 2011.
[76]
Dimitri Kanvesky, Tara N. Sainath, David Nahamoo, Bhuvana Ramabhadran, Peder A. Olsen, "A-functions: a generalization of extended Baum-Welch transformations to convex optimization," ICASSP 2011, p. 5164-5167, Prague, Czech Republic, 22-27 May 2011.


PUBLICATIONS - Technical Report
[77]
George Saon and Peder Olsen, "A Lower Bound on the Euclidean Distance for Fast Nearest Neighbor Retrieval in High-dimensional Spaces," IBM Technical Report RC24859, New York, USA, 2009.
[78]
Peder A. Olsen, John Hershey, Steven Rennie and Vaibhava Goel, "A speech recognition solution to an ancient cryptography problem," IBM Technical Report RC25109, New York, USA, 2011.


PUBLICATIONS - Tutorial and Demos

[79]
John R. Heshey, Peder A. Olsen, Steven J. Rennie and Andy Aaron, ""Audio Alchemy: Getting Computers to Understand Overlapping Speech," Scientific American Online Article, April 2011.