Vivek Tyagi, Applied Artificial Intelligence (AI) Research & Development

Dear Visitor, Welcome to my homepage. I am a hands-on applied AI researcher with a wide experience in Conversational Voice AI, Text AI, Natural Language Processing, and Image AI.

I partner with the Product and Business leaders, and understand their business problems in developing highly accurate AI solutions in NLP, Speech Recognition, Conversational AI and Images AI.

Given a business problem, I have the experience and expertise to solve it by collaborating with the various stakeholders and hands-on leading and inspiring a research engineering team to develop the AI Product solution for the problem.

Email contact: vivektyagiibm _at_ gmail _dot_ com

LinkedIn Profile:

Google Scholar Page:

Education

2006 PhD, School of Computer & Communication Sciences, Swiss Federal Institute of Technology, EPFL, Switzerland

Idiap Research Institute, Eurecom Research Institute

2001 B.Tech./B.S. Electrical Engineering, IIT Kanpur, India

Professional Experience

Please refer to my LinkedIn Page

Awards and Professional Recognition

Co-winner of the International Speech Communication Association (ISCA) Best Paper Award (2009) for the Journal article, ”Automatic speech recognition and speech variability: A review. Speech Communication, Volume 49, Issues 10-11, 2007”

IEEE Senior Member, Signal Processing Society.

Publications:

Recent (Text AI, Speech AI)

This is a public version of the slides approved for public release by LinkedIn Peer Review with our absolute internal results redacted. I and my team have now successfully deployed 15 CNN Text spam classifiers in English and International (i18n) languages such as French, German, Dutch, Spanish & Portuguese which automatically detect and remove spam content from LinkedIn Feed.
Business Impact: Based on our CNN Text classification technology, we delivered to our Partner Engineering and Product teams, 30% relative recall gain at 90% precision over the Liblinear and FastText based text classifiers which were the previous production classifiers. In other words, our CNN Text classifiers is able to detect 30% more spam content compared to our previous Liblinear/Fasttext production classifiers at 90% precision.

Mohit Yadav, Vivek Tyagi, "Deep Triphone Embedding Improves Phoneme Recognition", arXiv:1710.07868, 2017
Vivek Tyagi, "HYBRID CONTEXT DEPENDENT CD-DNN-HMM KEYWORD SPOTTING (KWS) IN SPEECH CONVERSATIONS", To appear in proc. of IEEE Workshop on Machine Learning For Signal Processing (IEEE MLSP 2016).
Vivek Tyagi, et al. "Xerox Conversational AI Agent (XCAI) for Enterprise Knowledgebase Q&A", IEEE ICASSP 2016, China, Show and Tell Session
V. Tyagi, H. Prasad, T. Faruquie, L. V. Subramaniam, N. Ratha, “Fusing Biographical and Biometric Classifiers for Improved Person Identification”, To appear In the Proc. of IEEE International Conference on Pattern Recognition (ICPR), Nov. 2012, Japan.
V. Tyagi and N. Ratha, “Biometric Score Fusion Through Discriminative Training”, In Proc. of IEEE Computer Vision and Pattern Recognition (CVPR) Biometrics Workshop, 2011, Boulder Colorado, USA.
Vivek Tyagi, "Fepstrum Features: Design and Application to Conversational Speech Recognition", IBM Research Report No. RI 11009, 6th June 2011
B. Srivastava, T. Huan, W. Shang, U. Nambiar, V. Tyagi, S. Kalyanaraman, “Towards a Sustainable Services Ecosystem for Traffic Managament,” In Proc. of Service Research and Innovation Institute (SRII) Global Conference, 2011, San Jose, USA.
J. Basak, K. Kate, V. Tyagi, N. Ratha “A Gradient descent approach for multi-modal Biometric Identification”, In the Proc. of IEEE International Conference on Pattern Recognition (ICPR), 2010, Istanbul, Turkey.
K. Kate, J. Basak, N. Ratha, V. Tyagi, “QPLC: A novel multimodal biometrics score fusion method”, In the Proc of IEEE Computer Vision and Pattern REcognition (CVPR) Biometrics Workshop, 2010, USA

Journals

Conferences

Speech Recognition

1. V. Tyagi and C. Wellekens, “On Desensitizing the Mel-Cepstrum to Spurious Spectral Components for Robust Speech Recognition,” In the Proc. of IEEE International Conference on Audio, Speech, and Signal Processing (ICASSP), March 2005, Philadelphia, USA.

1. V. Tyagi, I. McCowan, H. Bourlard, H. Misra, “Mel-Cepstrum Modulation Spectrum (MCMS) Features for Robust ASR,” In the Proc. of IEEE Automatic Speech Recognition and Understanding (ASRU), December 2003, St. Thomas, US Virgin Islands.
2. V. Tyagi, I. McCowan, H. Bourlard, H. Misra, “On Factorizing Spectral Dynamics for Robust Speech Recognition,” In the Proc. of EUROSPEECH, Sept. 2003, Geneva, Switzerland
3. H. Misra, H. Bourlard, V. Tyagi, “Entropy-Based Multi-Stream Combination,” In the Proc. of IEEE International Conference on Audio, Speech, and Signal Processing (ICASSP), 2003, Hong Kong.
Biometrics/Traffic (IBM Smarter Planet Research Theme)
1. V. Tyagi, H. Prasad, T. Faruquie, L. V. Subramaniam, N. Ratha, “Fusing Biographical and Biometric Classifiers for Improved Person Identification”, To appear In the Proc. of IEEE International Conference on Pattern Recognition (ICPR), Nov. 2012, Japan.
2. V. Tyagi and N. Ratha, “Biometric Score Fusion Through Discriminative Training”, In Proc. of IEEE Computer Vision and Pattern Recognition (CVPR) Biometrics Workshop, 2011, Boulder Colorado, USA.
3. B. Srivastava, T. Huan, W. Shang, U. Nambiar, V. Tyagi, S. Kalyanaraman, “Towards a Sustainable Services Ecosystem for Traffic Managament,” In Proc. of Service Research and Innovation Institute (SRII) Global Conference, 2011, San Jose, USA.
4. J. Basak, K. Kate, V. Tyagi, N. Ratha “A Gradient descent approach for multi-modal Biometric Identification”, In the Proc. of IEEE International Conference on Pattern Recognition (ICPR), 2010, Istanbul, Turkey.
5. K. Kate, J. Basak, N. Ratha, V. Tyagi, “QPLC: A novel multimodal biometrics score fusion method”, In the Proc of IEEE Computer Vision and Pattern REcognition (CVPR) Biometrics Workshop, 2010, USA

Patents.

Vivek Tyagi, ``DNN Hash Coding For Massive Scale Machine Learning", Filed as patent application to USPTO.
Vivek Tyagi, N. Viladkar, Prathosh A. P, ``Xerox Real Time Keyword Spotting System in Spoken Conversations", Filed as patent application to USPTO.
Prathosh A. P., Vivek Tyagi, ``Novel Harmonic Features for Emotion Recognition in Spoken Conversatins", Filed as patent application to USPTO.
Vivek Tyagi, Aravind Ganapathiraju, and Felix Immanuel Wyss. "Method and System for Selectively Biased Linear Discriminant Analysis in Automatic Speech Recognition Systems." U.S. Patent Application 13/974,123.
Vivek Tyagi, Aravind Ganapathiraju, and Felix Immanuel Wyss. "Method and system for acoustic data selection for training the parameters of an acoustic model." U.S. Patent Application 13/959,171.
Vivek Tyagi, Kalyanaraman, Shivkumar, Biplav Srivastava. "Systems and methods for road acoustics and road video-feed based traffic estimation and prediction." U.S. Patent No. 8,723,690. 13 May 2014.
Nambiar, Ullas Balan, Vivek Tyagi et al. "Systems and methods for exploring and utilizing solutions to cyber-physical issues in a sandbox." U.S. Patent No. 8,781,798. 15 Jul. 2014.
“SYSTEM AND COMPUTER PROGRAM PRODUCT FOR PROTECTING AUDIO CONTENT”, Vivek Tyagi et. al., USPTO Patent number: 7978853, Issued Jul 12, 2001.
“METHOD FOR PROTECTING AUDIO CONTENT”, Vivek Tyagi et. al., USPTO Patent number: 7974411, Issued Jul 5, 2011
“VEHICULAR TRAFFIC DENSITY ESTIMATION USING CUMULATIVE ROADSIDE ACOUSTICS”, Vivek Tyagi et. al, US Patent Application # 20120188102, July 26, 2012