Introduction

Figure 1: Real time Segmentation and Classification (Speech,Music,Silence) .

Our goal was to rst develop a system for segmentation of the audio signal, and then classification into one of two main categories: speech or music .

Methodology

  • Audio signal segmentation is based on mean signal amplitude distribution, whereas classification utilizes an additional characteristic related to the frequency.

  • The classification algorithm may be used either in conjunction with the segmentation algorithm, in which case it verifies or refutes a music-speech or speech-music change, or autonomously, with given audio segments.

  • The basic characteristics (RMS and Zero Crossings) are computed in 20 msec intervals, resulting in the segments' limits being specified within an accuracy of 20 msec.

Downloads

    • You can download the matlab code of the proposed in [1].

    • You can download the ppt presentation of [2] .

Related Publications

[1].C. Panagiotakis and G. Tziritas, A speech/music discriminator based on RMS and zero-crossings, IEEE Transactions on Multimedia, Vol. 7, No. 1, Feb. 2005.

[2] C. Panagiotakis and G. Tziritas, A Speech/Music Discriminator using RMS and Zero-crossings, European Signal Processing Conference, 2002.