About Us‎ > ‎

Phil Shinn's Resume


Software architect and speech scientist with over twenty years of experience in new product development and project management. Founder of the Voice Users Interface Design Yahoo Group and principal designer of the Open Source VUID toolkit. Has expertise in linguistics, biometrics, human factors, IVR design and natural language processing:

•    Programming: Python, Java, C, C++, VB, LISP
•    Voice User Interface Design
•    Strategic planning, project estimation, project plan development
•    VXML application and grammar development
•    Custom recognition and TTS lexicon development
•    Speech & IVR application usability assessment
•    Speech recognition and TTS application tuning
•    Speech Application QA
•    RFI & RFP response authoring, SOW authoring
•    Patent development


Morgan Stanley Smith Barney: New York, NY, 04/2010-present
Call Center Engineering
Speech Science, Usability, UI Design, front end development for call center re-engineering of client facing brokerage applications and internal-facing call routing applications.

IVR Design Group: New York, NY, 10/09-04/2010
Voice user interface design, speech application prototyping and development.

Genesys Telecommunication Laboratories: Daly City, CA (Seaside Park, NJ), 4/04-10/09
Principal Consultant, Professional Services Speech Group. 
Performed pre-sales and post-sales technical consulting and design and development of speech applications.

Countrywide Financial: Calabasas, CA, 1/03-4/04
VP Application Development. 
Designed self-service voice applications for mortgage insurance servicing, human resources call routing, privacy preferences setting, credit card activation and pay plan enrollment.

HeyAnita: Los Angeles, CA, 3/01-1/03
Director of Speech Science.
Managed team of linguists, created voice portals for Sprint-PCS, Verizon Wireless and Vodaphone. Developed requirements, call flows, grammars and dictionaries, testing and system tuning.

EnglishXchange: Los Angeles, CA, 1/99-1/06, Consultant building ESL tutoring software.

Hughes Research Labs: Malibu, CA, 9/00-3/01
Technical Consultant, Speech Understanding Research Lab
Developed voice-activated applications for corporate clients, including telematics for GM and air traffic control for Raytheon.

Text Analysis International: Laguna Beach, CA, 8/99-3/01. 
Worked on integrated development environment for creating text analysis applications, such as resume reading.

Indicast: Del Mar, CA, 9/99-1/00
Voice user interface designer and developer.

Unisys: Malvern PA, 1/98-1/00
Technical Consultant
Performed technical pre- and post-sales support for Natural Language Speech Assistant product.

Echobuzz Software: Irvine, CA, 1/98-8/99
Vice President, Research & Development
Developed voice activated voice mail.

Citigroup, Advanced Development Group: Los Angeles, CA 90-98.
Systems Architect 96; Computer Scientist 95; Principal, Technical Staff 92; Technical Staff 90.
Worked in biometrics lab; Built patented multi-lingual IVR system; Analyzed & deployed NLP and translation technologies; Implemented ASR systems for consumer and corporate financial services, including auto-attendants & phone banking; Implemented voice input & output for FX trading, patent issued; Assessed and selected vendors of telecom technologies for trading floors; Designed SEI CMM SDM for R&D department.

LSI: Woodland Hills, CA, 90-95, 99.  Speech Consultant.
Developed speech to speech translation device for military intelligence.

Speech Systems: Tarzana, CA 84-89. Director, Contract Research 86; Technical Staff 84
Coded acoustic phonetic features, dictionaries, grammars and tools; Wrote proposals for and managed activity on government and privately sponsored contract R&D, including programs for DARPA, DOD, FAA, NASA, DOE, & DOT.

Language Systems: Providence, RI, 83-84.  Programmer. 
Worked on a grammar checker for word processing.  Code licensed to Houghton Mifflin and delivered in Microsoft Word.


Ph.D., Linguistics, Brown University, thesis in acoustics of speech.


•    US5794218: Automated multilingual interactive system.
•    US5806050: Electronic transaction terminal for vocalization of transactional data.
•    US6529586: System for secure telephonic transmission of audio data.
•    US6655585: System and method of biometric smart card user authentication.
•    US7376622: System for automated debiting and settling of financial transactions

Selected Publications & Presentations

“The Impact of Voice Talent Persona Selection on Intelligibility for the Elderly,” SpeechTek, New York, 8/2009
“Improving Alphanumeric Recognition,” SpeechTek, New York, 8/2009

“DTMF and/or Speech,” SpeechTek, New York, 8/2008

“N-Best Skip Lists: A Practical Guide,” Proceedings of the Applied Voice Input Output Society, AVIOS, 1/2006, pp 100-108.

 “Ten Criteria for Measuring Effective Voice User Interfaces,” with J. Larson et al., Speech Technology Magazine, 12/2005, www.speechtechmag.com/issues/10_6/cover/12612-1.html

“Call Center Automation with a Human Touch,” SpeechTek, New York, 8/2005

“Designing Language Models for Voice Portal Applications,” with M. Shomphe, M. Lewis, K. Carey and D. Kim, International Journal of Speech Technology, Kluwer Academic Publishers, Vol. 7, pp 93-99, 2004.

“Supporting Multiple Speech Technology Vendors: Challenges & Rewards,” Proceedings of the Applied Voice Input Output Society, San Jose, 4/2003.

“Architecture Eyes Network, Local Speech,” with A. Schwartz, EE Times, 4/2002, p. 68.

“Speaker Verification,” Proceedings of  TVUI 99, TMA Associates, Phoenix, 1/1999

“A Continuous Speech Recognizer Using Two-Stage Encoder Neural Nets,” with W.S. Meisel, et al., Proc Int. Joint Conference on Neural Networks, Washington DC, pp 306-306, 1/1990

“Phoneme-based, Speaker Independent, Continuous Speech Recognition,” Presentation at the American Association for Artificial Intelligence Conference, Minneapolis, 8/1988.

“Methods for the Identification of Evoked Response Components in the Frequency and combined Time/Frequency Domains”, with A. M. Norcia, T. Sato, and J. Mertus, Electroencephalography and Clinical Neurophysiology, 1986, 65, pp. 212-226.

“Limitations on Context-Conditioned Effects in the Perception of [b] and [w],” with S. E. Blumstein and A. Jongman, Perception and Psychophysics, 1985, 38(5), pp. 397-407.
A Cross-language Investigation of the Stop, Affricate and Fricative Manners of Articulation, University Microfilms, Ann Arbor, MI, 1984 (Ph.D. thesis).
“On the Role of the Amplitude Envelope for the Perception of [b] and [w],” with S. E. Blumstein, Journal of the Acoustical Society of America, 1984, Vol. 75, No. 4, pp. 1243-1252.
“Phonetic Disintegration in Aphasia: Acoustic Analysis of Spectral Characteristics for Place of Articulation,” with S. E. Blumstein, Brain and Language, 1983, Vol. 20, pp. 90-114.