Samples for decoder-error based similarity matrices for Spanish, English and Basque are below.
The scores in our similarity matrix represent error-percentages from our phone-decoder, normalized into a 1-1000 range. A score of -500 i.e.,
1/2 × ( 0 – max( {ScoreRange} ) was stipulated for cases where no confusion had taken place for a phoneme-pair.
For instance, a score of 85 for phoneme pair [θ, f] in Spanish is computed from the observation that 8.5% of the occurences of /θ/ were decoded as /f/.
The error counts were obtained with HTK's HResults, when aligning the phonetic recognition and the G2P transcription for sequences of ca. 25000 phonemes in Spanish and ca. 12700 phonemes in English.
As tor the decoders, depending on the study we've used a monophone or a triphone decoder. The differences in the scoring matrices were not large when using either decoder.
Monophone decoder
It was trained with HTK. The acoustic models had three left-to-right emitting states using 32 Gaussian mixture components. The parametrization of the signal consisted of 18 Mel-Frequency Cepstral Coefficients plus the enery and their delta and delta-delta coefficients, using 16-bit PCM audios sampled at 16 KHz.
Triphone decoder
Cross-word triphone models were trained with HTK. The parametrization of the signal consisted of 18 Mel-Frequency Cepstral Coefficients plus the energy and their delta and delta-delta coefficients, using 16-bit PCM audios sampled at 16 KHz.
English Sample
IPA
æ
b
d
f
i:
k
n
p
r
s
θ
aj
æ
836
-500
-500
-500
2
-500
2
-500
-500
3
-500
-500
b
-500
684
21
-500
2
2
6
22
2
-500
35
-500
d
-500
38
870
8
-500
-500
4
11
2
3
105
-500
f
-500
-500
4
911
-500
2
-500
22
-500
5
35
-500
i:
1
-500
-500
-500
948
-500
-500
-500
6
-500
35
-500
k
1
-500
4
-500
2
967
2
11
4
-500
-500
-500
n
1
13
8
-500
4
-500
864
-500
-500
-500
-500
-500
p
-500
139
4
-500
-500
-500
-500
854
-500
-500
93
-500
r
-500
-500
-500
-500
2
-500
4
-500
887
-500
-500
-500
s
-500
-500
4
-500
2
-500
2
-500
-500
930
12
-500
θ
-500
-500
-500
-500
-500
-500
-500
-500
-500
-500
395
-500
aj
19
-500
-500
-500
-500
2
2
-500
4
-500
-500
932
aj
Spanish Sample
Basque Sample
IPA
a
b
d
f
i
k
n
r
s
s̪
a
932
1
1
2
1
1
4
5
-500
1
b
-500
533
17
4
-500
1
2
1
-500
1
d
1
35
602
2
-500
1
4
3
-500
1
f
1
2
1
806
1
7
-500
1
2
25
i
1
1
1
1
852
4
4
2
1
1
k
1
1
3
13
1
790
2
2
-500
3
n
1
3
5
1
3
5
726
9
1
2
r
2
5
5
1
1
1
8
873
1
1
s
1
1
1
1
1
1
1
4
918
53
s̪
-500
1
1
31
-500
3
1
2
21
803