Changelog

Version 4, 30/Jan/2018:
- Port to beat.backend.python v1.5.0
Version 3, 30/Oct/2017:
- Port to beat.backend.python v1.4.2
Version 2, 20/Jan/2016:
- Port to Bob v2
Version 1, 01/Apr/2014:
- Initial release

Description

This database was designed to provide data that was recorded in a natural way, using various devices in different environments. Hence, algorithms that perform well on this database are expected to be suitable for other real-world applications that do not require a predefined audio/video recording setup.

Database participants were selected among employees of CPqD Foundation who volunteered to make recordings. A unique ID was assigned for each participant, composed by a prefix (M for male and F for female) followed by a 4-digit number (odd for males and even for females). Each participant recorded up to five sessions, with a time lapse of at least 10 days between sessions.

Sessions consisted of 27 recorded sentences, whose content was specified in a script. Each sentence was recorded on three different devices types:

Laptops (audio and video content);
Smartphones (audio and video content);
Phone calls (only audio).

For each device type, a set of devices was used, as specified below:

Laptops: - Compaq 510 with embedded mic and camera; - Toshiba with USB Logitech QuickCam Pro 9000 webcam; - DELL Latitude embedded mic and camera.
Smartphones: - Samsung Galaxy S II; - Apple iPhone 4; - Apple iPhone 4.
Phone calls: - landline phone call; - personal mobile phone call.

Recordings were made in three environments with different characteristics: garden, restaurant (public indoor) and office. The idea behind this strategy was to exploit the influence of environmental noise in audio recordings and the effect of illumination and background conditions in the video recordings. Since the database includes recordings captured on different devices of different types and in different environments, it allows a large number of experimental setups.

Content

The data collection followed a simple recording protocol that was replicated for all sessions. For each session there was a corresponding script describing the whole content to be recorded, as follows:

Text reading:

a pre-defined text (extracted from the database's consent form);
four phonetically rich sentences (randomly selected among 562 options);
passphrase: three repetitions a single sentence (the same sentence for all participants in all sessions).

Spontaneous speech:

answers for generic questions (all participants answered all 15 questions selected form a fixed set, distributed along the 5 sessions in random order);
a fake name;
a fake address;
a fake birthday date;
a fake ID number;
a fake phone number;
two command words (all participants spoke 10 words along the 5 sessions in random order).

Numbers, digits, time values and alphanumeric strings:

a monetary amount between 10 and 10 000, randomly generated;
a number between 10 and 1000, randomly generated;
a number between 1000 and 10 million, randomly generated;
three repetitions of a random digit sequence (first one read in a slow pace and others naturally read);
a fake credit card number;
an alphanumeric string composed of 6 characters, randomly generated;
a time value, selected among a predefined set with 181 samples, equally distributed among participants.

It is important to note that all content was recorded in Brazilian Portuguese language.

BioCPqD Phase I database provides unbiased biometric verification protocols, one for male and one for female participants, based on the MOBIO database protocols. These protocols partition the database in three different groups:

a Training set: used to train the parameters of algorithm to be tested, e.g., to create the projection matrix, Universal Background Models, etc.;
a Development set: used to evaluate hyper-parameters of the tested algorithms;
a Test set: used to evaluate the generalization performance of the tested algorithms with previously unseen data.

Both development and test sets are further split into an enrollment subset (used to enroll participants' models), and a probe set (whose files will be tested against all participants' models).

Output name	Data format
image:	system/array_3d_uint8/1
eye_centers:	system/eye_positions/1 (The position of the eyes of a person in an image)
file_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
client_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)

Output name	Data format
image:	system/array_3d_uint8/1
eye_centers:	system/eye_positions/1 (The position of the eyes of a person in an image)
file_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
client_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
template_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)

Output name	Data format
image:	system/array_3d_uint8/1
probe_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
file_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
client_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
eye_centers:	system/eye_positions/1 (The position of the eyes of a person in an image)
template_ids:	system/array_1d_uint64/1 (Basic format containing a one-dimensional array of unsigned integer (64 bits) values)

Output name	Data format
image:	system/array_3d_uint8/1
eye_centers:	system/eye_positions/1 (The position of the eyes of a person in an image)
file_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
client_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
template_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)

Output name	Data format
image:	system/array_3d_uint8/1
probe_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
file_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
client_id:	system/uint64/1 (Basic format containing only a single unsigned integer (64 bits) value)
eye_centers:	system/eye_positions/1 (The position of the eyes of a person in an image)
template_ids:	system/array_1d_uint64/1 (Basic format containing a one-dimensional array of unsigned integer (64 bits) values)

databases cpqd 4

Changelog

Description

Content

l2s_female (advanced_face_recognition) 5 sets

train

dev_templates

dev_probes

test_templates

test_probes

l2s_male (advanced_face_recognition) 5 sets

train

dev_templates

dev_probes

test_templates

test_probes

laptop_female (advanced_face_recognition) 5 sets

train

dev_templates

dev_probes

test_templates

test_probes

laptop_male (advanced_face_recognition) 5 sets

train

dev_templates

dev_probes

test_templates

test_probes

s2l_female (advanced_face_recognition) 5 sets

train

dev_templates

dev_probes

test_templates

test_probes

s2l_male (advanced_face_recognition) 5 sets

train

dev_templates

dev_probes

test_templates

test_probes

smartphone_female (advanced_face_recognition) 5 sets

train

dev_templates

dev_probes

test_templates

test_probes

smartphone_male (advanced_face_recognition) 5 sets

train

dev_templates

dev_probes

test_templates

test_probes

This database is Public

databases

cpqd

4