Costas Panagiotakis Homepage - SoundSegmentationClassification

Audio Signal Segmentation and Classification

Figure 1: Real time Segmentation and Classification (Speech,Music,Silence) .

Our goal was to rst develop a system for segmentation of the audio signal, and then classification into one of two main categories: speech or music .

Audio signal segmentation is based on mean signal amplitude distribution, whereas classification utilizes an additional characteristic related to the frequency.
The classification algorithm may be used either in conjunction with the segmentation algorithm, in which case it verifies or refutes a music-speech or speech-music change, or autonomously, with given audio segments.
The basic characteristics (RMS and Zero Crossings) are computed in 20 msec intervals, resulting in the segments' limits being specified within an accuracy of 20 msec.

- You can download the matlab code of the proposed in [1].
- You can download the ppt presentation of [2] .

[1].C. Panagiotakis and G. Tziritas, A speech/music discriminator based on RMS and zero-crossings, IEEE Transactions on Multimedia, Vol. 7, No. 1, Feb. 2005.

[2] C. Panagiotakis and G. Tziritas, A Speech/Music Discriminator using RMS and Zero-crossings, European Signal Processing Conference, 2002.