Research

Research

Speech technology: speaker diarisation, speaker linking, speech emotion recognition, machine learning

Tools

The segment F-measure is a new evaluation technique for speaker diarisation and it's based on segment matches using the F-measure. This gives the user a deeper insight into how well matched the hypothesised segments are to the reference segments.

https://github.com/rosannamilner/segment-f-measure

Data

The speaker diarisation reference for NIST RT07 meeting data has been improved by manually re-segmenting and it is now accurate to within 0.1 seconds and has speech segments with speaker labels for the complete audio files. The reference RTTM files can be downloaded below.

https://mini.dcs.shef.ac.uk/resources/dia-improvedrt07reference/

Publications