Search this site
Embedded Files
Costas Panagiotakis Homepage
  • Home
  • Contact
  • Courses
  • Education
  • Publications
  • Research
    • Cell Cegmentation
    • Community Detection in Graphs
    • Detecting Common Actions in Motion Capture Data and Videos
    • Detection of Geological Faults
    • Faults Detection
    • FlowPro
    • Image Segmentation
    • Interactive Image Segmentation Based on Synthetic Graph Coordinates
    • Parameter-free Modelling of 2D Shapes with Ellipses
    • Periodicity Detection
    • SCoR-DTEC
    • HurryAttackRS
    • SCoR-UTEC
    • Synthesizing novel animations of periodic dances
    • Unsupervised Image Sorting
    • Unsupervised Detection of Topographic Highs
    • SoundSegmentationClassification
    • Curve Equipartition
    • Partial Curve Matching applied on Coastlines
    • PIREM
    • RecSys Challenge 2022
    • Tree Detection DCFA
    • Unconstrained Polygonal Fitting
    • Top Scientists
    • Flood Detection
    • Shape Equipartition
  • Software
    • Matlab - Java
    • Mobile App - Virus Spreading
  • Press and Media
Costas Panagiotakis Homepage
  • Home
  • Contact
  • Courses
  • Education
  • Publications
  • Research
    • Cell Cegmentation
    • Community Detection in Graphs
    • Detecting Common Actions in Motion Capture Data and Videos
    • Detection of Geological Faults
    • Faults Detection
    • FlowPro
    • Image Segmentation
    • Interactive Image Segmentation Based on Synthetic Graph Coordinates
    • Parameter-free Modelling of 2D Shapes with Ellipses
    • Periodicity Detection
    • SCoR-DTEC
    • HurryAttackRS
    • SCoR-UTEC
    • Synthesizing novel animations of periodic dances
    • Unsupervised Image Sorting
    • Unsupervised Detection of Topographic Highs
    • SoundSegmentationClassification
    • Curve Equipartition
    • Partial Curve Matching applied on Coastlines
    • PIREM
    • RecSys Challenge 2022
    • Tree Detection DCFA
    • Unconstrained Polygonal Fitting
    • Top Scientists
    • Flood Detection
    • Shape Equipartition
  • Software
    • Matlab - Java
    • Mobile App - Virus Spreading
  • Press and Media
  • More
    • Home
    • Contact
    • Courses
    • Education
    • Publications
    • Research
      • Cell Cegmentation
      • Community Detection in Graphs
      • Detecting Common Actions in Motion Capture Data and Videos
      • Detection of Geological Faults
      • Faults Detection
      • FlowPro
      • Image Segmentation
      • Interactive Image Segmentation Based on Synthetic Graph Coordinates
      • Parameter-free Modelling of 2D Shapes with Ellipses
      • Periodicity Detection
      • SCoR-DTEC
      • HurryAttackRS
      • SCoR-UTEC
      • Synthesizing novel animations of periodic dances
      • Unsupervised Image Sorting
      • Unsupervised Detection of Topographic Highs
      • SoundSegmentationClassification
      • Curve Equipartition
      • Partial Curve Matching applied on Coastlines
      • PIREM
      • RecSys Challenge 2022
      • Tree Detection DCFA
      • Unconstrained Polygonal Fitting
      • Top Scientists
      • Flood Detection
      • Shape Equipartition
    • Software
      • Matlab - Java
      • Mobile App - Virus Spreading
    • Press and Media

Audio Signal Segmentation and Classification

Introduction

Figure 1: Real time Segmentation and Classification (Speech,Music,Silence) .

Our goal was to rst develop a system for segmentation of the audio signal, and then classification into one of two main categories: speech or music .

Methodology

  • Audio signal segmentation is based on mean signal amplitude distribution, whereas classification utilizes an additional characteristic related to the frequency.

  • The classification algorithm may be used either in conjunction with the segmentation algorithm, in which case it verifies or refutes a music-speech or speech-music change, or autonomously, with given audio segments.

  • The basic characteristics (RMS and Zero Crossings) are computed in 20 msec intervals, resulting in the segments' limits being specified within an accuracy of 20 msec.

Downloads

    • You can download the matlab code of the proposed in [1].

    • You can download the ppt presentation of [2] .

Related Publications

[1].C. Panagiotakis and G. Tziritas, A speech/music discriminator based on RMS and zero-crossings, IEEE Transactions on Multimedia, Vol. 7, No. 1, Feb. 2005.

[2] C. Panagiotakis and G. Tziritas, A Speech/Music Discriminator using RMS and Zero-crossings, European Signal Processing Conference, 2002.

© 2014-24 Costas Panagiotakis

Google Sites
Report abuse
Google Sites
Report abuse