Home

https://sites.google.com/site/vivektyagispeechresearch/home/vivek.jpeg?attredirects=0
Deep Learning.  R&D and Product Ideation on Conversational Voice AI Agents by bringing together Deep Learning based Speech Recognition and Natural Language Understanding systems.  




Contact: vivektyagiibm _at_ gmail _dot_ com

Google Scholar Page

Education

Professional Experience
Please refer to my LinkedIn Page

Recent


V. Tyagi, H. Prasad, T. Faruquie, L. V. Subramaniam, N. Ratha, “Fusing Biographical and Biometric Classifiers for Improved Person Identification”, To appear In the Proc. of IEEE International Conference on Pattern Recognition (ICPR), Nov. 2012, Japan.

V. Tyagi and N. Ratha, “Biometric Score Fusion Through Discriminative Training”, In Proc. of IEEE Computer Vision and Pattern Recognition (CVPR) Biometrics Workshop, 2011, Boulder Colorado, USA.


Vivek Tyagi, "Fepstrum Features: Design and Application to Conversational Speech Recognition", IBM Research Report No. RI 11009, 6th June 2011

B. Srivastava, T. Huan, W. Shang, U. Nambiar, V. Tyagi, S. Kalyanaraman, “Towards a Sustainable Services Ecosystem for Traffic Managament,” In Proc. of Service Research and Innovation Institute (SRII) Global Conference, 2011, San Jose, USA.

J. Basak, K. Kate, V. Tyagi, N. Ratha “A Gradient descent approach for multi-modal Biometric Identification”, In the Proc. of IEEE International Conference on Pattern Recognition (ICPR), 2010, Istanbul, Turkey.

K. Kate, J. Basak, N. Ratha, V. Tyagi, “QPLC: A novel multimodal biometrics score fusion method”, In the Proc of IEEE Computer Vision and Pattern REcognition (CVPR) Biometrics Workshop, 2010, USA

Vivek Tyagi, Shivkumar Kalyanaraman, Raghu Krishanpuram, " Vehicular Traffic Density State Estimation Based on Cumulative Road Acoustics", to appear in IEEE Trans on Intelligent Transportation System, 2011.

Vivek Tyagi, Herve Bourlard, Christian Wellekens, "On variable-scale piecewise stationary spectral analysis of speech signals for ASR", Speech Communication, Vol. 48 (2006), pages   1182–1191.

Vivek Tyagi, Christian Wellekens, Dirk Slock, "Least squares filtering of speech signals for robust ASR", Speech Communication Vol. 48 (2006), pages 1528–1544.

M. Benzeghiba, R. De Mori, O. Deroo, S. Dupont *, T. Erbes, D. Jouvet, L. Fissore, P. Laface, A. Mertins, C. Ris, R. Rose, V. Tyagi, C. Wellekens, "Automatic speech recognition and speech variability: A review", Speech Communication Vol. 49 (2007), pages 763–786.
Conferences

Speech Recognition

V. Tyagi, “Tandem Processing of Fepstrum Features, ” In the Proc. of Interspeech, 2008, Brisbane, Australia.


V. Tyagi, “Maximum Accept and Reject (MARS) training of HMM-GMM speech recognition systems, ”, In the Proc. of Interspeech, 2008, Brisbane, Australia.

V. Tyagi, “Fepstrum: An improved modulation spectrum for ASR, ” In the Proc. of Interspeech, 2007, Antwerp, Belgium.

V. Tyagi and C. Wellekens, “Fepstrum and Carrier Signal decomposition of Speech Signals through Homomorphic Filtering,” In the special session, “Dealing with intrinsic speech variabilities in ASR”, IEEE International Conference on Audio, Speech, and Signal Processing (ICASSP), 2006, Toulouse, France.

V. Tyagi and C.Wellekens, “Fepstrum Representation of Speech,” In the Proc. of IEEE Automatic Speech Recognition and Understanding (ASRU), November 2005, Puerto Rico, USA

V. Tyagi, H. Bourlard and C. Wellekens, “On Variable-Scale Piecewise Stationary Spectral Analysis of Speech Signals for ASR,” In the Proc. of Eurospeech, September
2005, Lisbon, Portugal.


V. Tyagi and C. Wellekens, “On Desensitizing the Mel-Cepstrum to Spurious Spectral Components for Robust Speech Recognition,” In the Proc. of IEEE International Conference
on Audio, Speech, and Signal Processing (ICASSP), March 2005, Philadelphia, USA.

V. Tyagi, I. McCowan, H. Bourlard, H. Misra, “Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR,” In the Proc. of IEEE Automatic Speech Recognition
and Understanding (ASRU), December 2003, St. Thomas, US Virgin Islands.

V. Tyagi, I. McCowan, H. Bourlard, H. Misra, “On Factorizing Spectral Dynamics for Robust Speech Recognition,” In the Proc. of EUROSPEECH, Sept. 2003, Geneva, Switzerland

H. Misra, H. Bourlard, V. Tyagi, “Entropy-Based Multi-Stream Combination,” In the Proc. of IEEE International Conference on Audio, Speech, and Signal Processing (ICASSP), 2003, Hong Kong.



Biometrics/Traffic (IBM Smarter Planet Research Theme)

V. Tyagi, H. Prasad, T. Faruquie, L. V. Subramaniam, N. Ratha, “Fusing Biographical and Biometric Classifiers for Improved Person Identification”, To appear In the Proc. of IEEE International Conference on Pattern Recognition (ICPR), Nov. 2012, Japan.

V. Tyagi and N. Ratha, “Biometric Score Fusion Through Discriminative Training”, In Proc. of IEEE Computer Vision and Pattern Recognition (CVPR) Biometrics Workshop, 2011, Boulder Colorado, USA.

B. Srivastava, T. Huan, W. Shang, U. Nambiar, V. Tyagi, S. Kalyanaraman, “Towards a Sustainable Services Ecosystem for Traffic Managament,” In Proc. of Service Research and Innovation Institute (SRII) Global Conference, 2011, San Jose, USA.

J. Basak, K. Kate, V. Tyagi, N. Ratha “A Gradient descent approach for multi-modal Biometric Identification”, In the Proc. of IEEE International Conference on Pattern Recognition (ICPR), 2010, Istanbul, Turkey.

K. Kate, J. Basak, N. Ratha, V. Tyagi, “QPLC: A novel multimodal biometrics score fusion method”, In the Proc of IEEE Computer Vision and Pattern REcognition (CVPR) Biometrics Workshop, 2010, USA

Patents
.
  
Vivek Tyagi, ``DNN Hash Coding For Massive Scale Machine Learning", Filed as patent application to USPTO. 

Vivek Tyagi, N. Viladkar, Prathosh A. P, ``Xerox Real Time Keyword Spotting System in Spoken Conversations", Filed as patent application to USPTO.
 
Prathosh A. P., Vivek Tyagi, ``Novel Harmonic Features for Emotion Recognition in Spoken Conversatins", Filed as patent application to USPTO. 

Vivek Tyagi, Aravind Ganapathiraju, and Felix Immanuel Wyss. "Method and System for Selectively Biased Linear Discriminant Analysis in Automatic Speech Recognition Systems." U.S. Patent Application 13/974,123.
 
Vivek Tyagi, Aravind Ganapathiraju, and Felix Immanuel Wyss. "Method and system for acoustic data selection for training the parameters of an acoustic model." U.S. Patent Application 13/959,171.

 Vivek Tyagi, Kalyanaraman, Shivkumar, Biplav Srivastava. "Systems and methods for road acoustics and road video-feed based traffic estimation and prediction." U.S. Patent No. 8,723,690. 13 May 2014. 

Nambiar, Ullas Balan, Vivek Tyagi et al. "Systems and methods for exploring and utilizing solutions to cyber-physical issues in a sandbox." U.S. Patent No. 8,781,798. 15 Jul. 2014.

“SYSTEM AND COMPUTER PROGRAM PRODUCT FOR PROTECTING AUDIO CONTENT”, Vivek Tyagi et. al., USPTO Patent number: 7978853, Issued Jul 12, 2001.

“METHOD FOR PROTECTING AUDIO CONTENT”, Vivek Tyagi et. al., USPTO Patent number: 7974411, Issued Jul 5, 2011

“VEHICULAR TRAFFIC DENSITY ESTIMATION USING CUMULATIVE ROADSIDE ACOUSTICS”, Vivek Tyagi et. al, US Patent Application # 20120188102,  July 26, 2012