VoxForge Dataset¶
Dataset Description¶
VoxForge is a collection of voice recordings from various languages. The set that we
use in bob.bio.spear
is a part of the English VoxForge corpus. It contains:
Identities |
Sample count |
||
train |
10 |
3148 |
|
dev |
references |
10 |
1304 |
probes |
300 |
||
eval |
references |
10 |
1509 |
probes |
300 |
GMM¶
To run the baseline, use the following command:
bob bio pipeline simple -d voxforge -p gmm-default -g dev -g eval -l sge -o results/gmm_voxforge
Then, to generate the scores, use:
bob bio metrics -e ./results/gmm_voxforge/scores-{dev,eval}.csv
Development |
Evaluation |
|
---|---|---|
Failure to Acquire |
0.0% |
0.0% |
False Match Rate |
2.0% (54/2700) |
1.8% (48/2700) |
False Non Match Rate |
2.0% (6/300) |
1.3% (4/300) |
False Accept Rate |
2.0% |
1.8% |
False Reject Rate |
2.0% |
1.3% |
Half Total Error Rate |
2.0% |
1.6% |
On 128[1] CPU nodes on the SGE Grid: Ran in 13 minutes (5 minutes of training).
ISV¶
To run the baseline, use the following command:
bob bio pipeline simple -d voxforge -p isv-default -g dev -g eval -l sge -o results/isv_voxforge
Then, to generate the scores, use:
bob bio metrics -e ./results/isv_voxforge/scores-{dev,eval}.csv
Development |
Evaluation |
|
---|---|---|
Failure to Acquire |
0.0% |
0.0% |
False Match Rate |
1.3% (36/2700) |
0.7% (20/2700) |
False Non Match Rate |
1.3% (4/300) |
2.7% (8/300) |
False Accept Rate |
1.3% |
0.7% |
False Reject Rate |
1.3% |
2.7% |
Half Total Error Rate |
1.3% |
1.7% |
On 128[1] CPU nodes on the SGE Grid: Ran in 13 minutes (7 minutes of training).
I-Vector¶
To run the baseline, use the following command:
bob bio pipeline simple -d voxforge -p ivector-default -g dev -g eval -l sge -o results/ivector_voxforge
Then, to generate the scores, use:
bob bio metrics -e ./results/ivector_voxforge/scores-{dev,eval}.csv
Development |
Evaluation |
|
---|---|---|
Failure to Acquire |
0.0% |
0.0% |
False Match Rate |
4.3% (116/2700) |
6.9% (186/2700) |
False Non Match Rate |
4.3% (13/300) |
4.3% (13/300) |
False Accept Rate |
4.3% |
6.9% |
False Reject Rate |
4.3% |
4.3% |
Half Total Error Rate |
4.3% |
5.6% |
I-Vector PLDA¶
To run the baseline, use the following command:
bob bio pipeline simple -d voxforge -p ivector-plda -g dev -g eval -l sge -o results/ivector_plda_voxforge
Then, to generate the scores, use:
bob bio metrics -e ./results/ivector_plda_voxforge/scores-{dev,eval}.csv
Speechbrain ECAPA-TDNN¶
To run the baseline, use the following command:
bob bio pipeline simple -d voxforge -p speechbrain-ecapa-voxceleb -g dev -g eval -l sge -o results/speechbrain_voxforge
Then, to generate the scores, use:
bob bio metrics -e ./results/speechbrain_voxforge/scores-{dev,eval}.csv
Development |
Evaluation |
|
---|---|---|
Failure to Acquire |
0.0% |
0.0% |
False Match Rate |
0.0% (0/2700) |
0.8% (21/2700) |
False Non Match Rate |
0.0% (0/300) |
0.0% (0/300) |
False Accept Rate |
0.0% |
0.8% |
False Reject Rate |
0.0% |
0.0% |
Half Total Error Rate |
0.0% |
0.4% |
On 128[1] CPU nodes on the SGE Grid: Ran in 9 minutes (no training).
Footnotes