KBOC16: Participant block

This algorithm is a legacy one. The API has changed since its implementation. New versions and forks will need to be updated.
This algorithm is splittable

Algorithms have at least one input and one output. All algorithm endpoints are organized in groups. Groups are used by the platform to indicate which inputs and outputs are synchronized together. The first group is automatically synchronized with the channel defined by the block in which the algorithm is deployed.

Group: probes

Endpoint Name Data Format Nature
keystroke system/kboc16_keystroke/1 Input
file system/uint64/1 Input
client_id system/text/1 Input
score_file robertodaza/competition_kboc16/1 Output

Group: templates

Endpoint Name Data Format Nature
features system/kboc16_keystroke/1 Input
id system/text/1 Input

Algorithms may use functions and classes declared in libraries. Here you can see the libraries and import names used by this library. You don't need to import the library manually on your code, the platform will do it for you. Just use the object as it has been imported with the selected named. For example, if you choose to import a library using the name lib, then access function f within your code like lib.f().

Library Import as
robertodaza/kboc16_baseline_matchers/5 kboc16_baseline_matchers
xxxxxxxxxx
127
 
1
# Keystroke Biometric Ongoing Competition (KBOC) is an official competition of 
2
# The IEEE Eighth International Conference on Biometrics: Theory, Applications, and Systems (BTAS 2016) 
3
# organized by ATVS Biometric Research Group
4
5
# Participant Block: this code (in Python) comprises the evaluation block of the KBOC16 competition.  
6
# The genuine and impostor samples are unknown except for the training samples (first 4 samples).
7
# In order to avoid overtifing of the systems and any possible misconduct, the performance evaluation is made over 100 of the 300 users.
8
# This first 100 users are representative of the complete set of 300 users. As an example, the difference between the performance of the baseline algorithms is less than 1%
9
# The evaluation over the 300 users will be done during the final weeks of the competition
10
# Together with this block, you can access the library kboc16_baseline_matchers (robertodaza/kboc16_baseline_matchers/5) with 3 baseline systems (see the examples below).
11
12
# HOW TO PARTICIPATE: participants can modify the code of this algorithm to include their keystroke recognition systems. It is allow the use of libraries and toolboxes out of the included in this example. The participant code could be private while its results should be available for the competition organizers (in order to include it in the final competition report).  
13
14
import numpy 
15
import bob  #This is a useful toolbox including the most popular machine learning and apttern recognition functions (see http://idiap.github.io/bob/ )
16
17
18
class Algorithm:
19
    def __init__(self):#Module where we define the globale  variables .
20
        self._enrollment_set = [] #We  define 1 list for the enrollment samples of one specific user
21
        self._templates=None #This variable will contain the training matrix with the enrollment sample sof all users
22
        
23
24
    def process(self, inputs, outputs):#We work with objects
25
        
26
        ###############################################################
27
        ###                    TRAINING                             ###
28
        ###############################################################
29
        
30
        if self._templates is None:#The first time entry here.
31
            self._templates = {}#It defines a dictionary
32
            group = inputs.groupOf('features')#we have a block with inputs of two different bases and are not synchronized. We have to make groups, with this fuction we open a group.                      
33
            
34
            while group.hasMoreData():#While the group has data, it runs.
35
                
36
                group.next()# Request new data group
37
                
38
                pre_features = group['features'].data.timestamps# In this example, we use only the 'timestamps'. Information about the key events can be obtained and used from group['features'].data.key events' 
39
                
40
                #It is necessary to manage the case insensitive samples.
41
                #some samples include mistakes in form of extra keys pressed. Those keys (mainly "shift" key) do not change the text but the number of keys pressed.               
42
                if numpy.size(self._enrollment_set,axis=0)==0:
43
                    l1=len(pre_features)#It is the length of a list 'pre_features'
44
                    feature_vector=[]
45
                    for i in range(1,l1):
46
                        feature_vector.append(float(pre_features[i]))#initial time value (time reference always equal to 0) is removed
47
                    self._enrollment_set.append(feature_vector)
48
                    
49
                else:#Case insensitive problem: the solution adopted in this example is to equalize the lengths in a very simplist way (let's improve it)
50
                    l2=len(pre_features)
51
                    feature_vector=[]
52
                    if l1<=l2:
53
                        for i in range(1,l1):
54
                            feature_vector.append(float(pre_features[i]))#we take the length of the first vector
55
                        self._enrollment_set.append(feature_vector)
56
                    else:
57
                        for i in range(1,l1):
58
                            if i<l2:
59
                                feature_vector.append(float(pre_features[i]))
60
                            else:
61
                                feature_vector.append(float(0))#Add zeros to have the same length
62
                        self._enrollment_set.append(feature_vector) 
63
                        
64
                        
65
                if inputs["id"].isDataUnitDone():#'id' is the number of users. Once all the enrollment samples of the user id are received, we save the enrollment set into the variable self._templates which contains the length of features and the enrollment set.  You can use this enrollment set for train your models.
66
                    
67
                    template_id = group['id'].data.text  # template_id will contain the number of user
68
                    self._templates[template_id] = dict(
69
                            f= l1,
70
                            enrollment_set=numpy.array(self._enrollment_set) 
71
                        )
72
                    self._enrollment_set= [] #When it ends, the variable is initialized
73
          
74
        
75
        
76
        ###################################################################            
77
        #### TEST(according the remaining twenty samples of each user) ####
78
        ###################################################################
79
        # The probe samples will be processed after all the templates samples 
80
        
81
        comparison_ids = inputs['client_id'].data.text  #This is the user number of the probe sample.
82
        file_number= str(inputs['file'].data.value) #file number of the probe sample
83
        data = inputs['keystroke'].data  #timestamps of the probe sample
84
        pre_features = data.timestamps ##These are the times of the test samples
85
        
86
        #Once again, it is necessary to manage the case insensitive samples             
87
        l2=len(pre_features)
88
        l1=self._templates[comparison_ids]['f']
89
        feature_vector=[]
90
        if l1<=l2:##Case insensitive problem: the solution adopted in this example is to equalize the lengths in a very simplist way (let's improve it)
91
            for i in range(1,l1):
92
                feature_vector.append(float(pre_features[i]))#initial time value (time reference always equal to 0) is removed
93
        else:
94
            for i in range(1,l1):
95
                if i<l2:
96
                    feature_vector.append(float(pre_features[i]))
97
                else:
98
                    feature_vector.append(float(0))##Add zeros to have the same length
99
                    
100
        probe_features=numpy.array(feature_vector)
101
        score_info = []
102
         
103
        # As a baseline, we include 3 matchers in the library kboc16_baseline_matchers. Here you can add you own matchers
104
        d=kboc16_baseline_matchers.classifier_manhattand_scaled_modified(self._templates[comparison_ids]['enrollment_set'],probe_features);
105
        #d=kboc16_baseline_matchers.classifier_knn_norm(self._templates[comparison_ids]['enrollment_set'],probe_features);
106
        #d=kboc16_baseline_matchers.classifier_knn_mahal(self._templates[comparison_ids]['enrollment_set'],probe_features);
107
            
108
        score = d*-1# score=-d is better than score=1/(d+0.001).In some cases (with large dynamic margin between scores), the inverse of the distance can be problematic.
109
        
110
        
111
        score_info.append({#'score' and  ' file_id' are  included in 'score_info'.
112
           'score': score,     
113
           'file_id': file_number,
114
               
115
        })
116
        outputs['score_file'].write({#Write  on the output 'score_info' and 'client_identity' which contain the user number.
117
                
118
               'client_identity': comparison_ids,
119
               'scores': score_info
120
            
121
            },
122
         )                        
123
                
124
            
125
                        
126
        return True
127

The code for this algorithm in Python
The ruler at 80 columns indicate suggested POSIX line breaks (for readability).
The editor will automatically enlarge to accomodate the entirety of your input
Use keyboard shortcuts for search/replace and faster editing. For example, use Ctrl-F (PC) or Cmd-F (Mac) to search through this box

Keystroke Biometric Ongoing Competition (KBOC) is an official competition of the IEEE Eighth International Conference on Biometrics: Theory, Applications, and Systems (BTAS 2016) organized by ATVS Biometric Research Group.

Participant Block: this code (in Python) comprises the evaluation block of the KBOC16 competition.

The genuine and impostor samples are unknown except for the training samples (first 4 samples). In order to avoid overtifing of the systems and any possible misconduct, the performance evaluation is made over 100 of the 300 users. This first 100 users are representative of the complete set of 300 users. As an example, the difference between the performance of the baseline algorithms is less than 1%. The evaluation over the 300 users will be done during the final weeks of the competition. Together with this block, you can access the library kboc16_baseline_matchers (robertodaza/kboc16_baseline_matchers/5) with 3 baseline systems (see the examples below).

HOW TO PARTICIPATE: participants can modify the code of this algorithm to include their keystroke recognition systems. It is allow the use of libraries and toolboxes out of the included in this example. The participant code could be private while its results should be available for the competition organizers (in order to include it in the final competition report).

Modified in the version 2:score=1/(d+0.001) by score=-d. In some cases (with large dynamic margin between scores), the inverse of the distance can be problematic.

No experiments are using this algorithm.
Created with Raphaël 2.1.2[compare]robertodaza/KBOC16-COMPETITION-BASELINE-COMBINED-MANHATTAN-MAHALANOBIS-DISTANCE/1robertodaza/KBOC16-COMPETITION-BASELINE-MAHALANOBIS-NEAREST_NEIGHBOR/1robertodaza/KBOC16_PARTICIPANT_BLOCK/1aythamimm/KBOC16_UAM_Tutorial_Example/1Feb13robertodaza/KBOC16_PARTICIPANT_BLOCK/22anjithgeorge/KBOC_IITKGP_ANJITH_APRIL7_1/14Mar8Alberto/proof_1/1Apr7IDRND_DSPlabs_/ksd_system/12016May5loginofdeath/kboc_neoeyed/12017Apr292018Aug21

This table shows the number of times this algorithm has been successfully run using the given environment. Note this does not provide sufficient information to evaluate if the algorithm will run when submitted to different conditions.

Terms of Service | Contact Information | BEAT platform version 2.2.1b0 | © Idiap Research Institute - 2013-2025