Investigates various Speech signal processing schemes for acoustic modeling so that more robust speech recognition can be achieved. Our aim is to perform the state-of-art research providing effective means for achieving:
Abnormal Acoustic Event Localization and Recognition
Admin 2021-04-08 👁️ 498
Abnormal acoustic event localization and detection system
Contents
Introduction
Main algorithm and principle
Application demo
1. Introduction
- This is multiple abnormal acoustic event localization and detection system using 5 microphones. There is a time delay corresponding to the direction of sound between microphones. We estimate the time delay and find the direction of the abnormal sounds. After finding the direction of sound, we use acoustic beamforming technique to separate the input signal into each direction, one by one. Then we classify the each of separated acoustic signals into one of the pre-defined abnormal events. Even when multiple acoustic events occurs simultaneously, we can recognize and localize each of them
2. Main algorithm and principle
- Relevant algorithms
Steered Response Power – Phase Transform
After computing Generalized Cross Correlation of microphone pairs, use minimum filter to find peaks of steered response power
The peaks are candidates for direction of acoustic event
Detect the directions of acoustic events by applying threshold to the candidates
Beamforming technique for source separation
Compensate the arrival time difference between input signals of each microphone. The arrival time difference is computed using detected direction of acoustic event. This step makes input signals of each microphone corresponding to the detected direction have same phase in time-frequency domain
By applying phase error based masking, enhance the input signal from the detected direction
Abnormal event classification
Train Gaussian mixture models with MFCC (Mel-frequency Cepstrum Coefficient) features of abnormal acoustic events (man yelling, woman screaming, baby crying, glass breaking, siren, skidding) and normal acoustic event (babble)
Extract MFCC feature of input signal and evaluate it using likelihood classification
- System structure
3. Application demo