Software

Idiap has a successful track record of dissemination of data and software that are widely adopted, and has a longstanding tradition of delivering research outcomes through open-source licenses and making frameworks accessible to others. This commitment extends beyond mere software contributions. Idiap actively participates in numerous projects, ranging from small-scale initiatives to high-profile endeavors. To facilitate the sharing of improvements made by Idiap's researchers with the wider community, the institute has entered into various corporate contributor license agreements (CLA) or equivalent arrangements with The Qt Company, The Python Project, The Cloud Native Computing Foundation, The Cloud Foundry Foundation, and Google. Thanks to these CLAs, Idiap employees are able to contribute to large-scale open-source projects.

Idiap creates and makes available a significant number of professional software. Over the past five years, Idiap has filed 206 software disclosures, which enabled the distribution of 122 open source software packages and granted 84 commercial licenses on patents and software.

Examples of software packages created by Idiap are Fast Transformers, PyDHN, Kaldi and Bob. The Github project Fast Transformers, which has over 1400 stars and over 160 forks, was used in Muzic, a Microsoft project for music understanding and generation. The open-source package PyDHN for the physics-based simulation of district heating networks, computes pressure, temperature, and mass flow within the pipework knowing only the boundary conditions at the central(s) and sub-stations; it is included in open-source GIS tools for a wider impact on society. The Kaldi open source tool for speech modeling, which is considered one of the main technologies in the community for research and innovation, received 6781 citations. Bob, a set of open-source tools to promote reproducible research, has 25 releases, over 100 satellite repositories and over 5000 commits for the core repository.

Idiap contributes to the research community open-source software as listed below. If you have questions about a specific software, please use our contact form.

Filter :

219 rows visible

Name	Description	Date
nvr_transformers	Nonparametric Variational Regularisation of Pretrained Transformers	2025-04-11
knn-tts	kNN Retrieval for Simple and Effective Zero-Shot Multi-speaker Text-to-Speech	2025-04-03
code.iclr2025_hyperface	HyperFace: Generating Synthetic Face Recognition Datasets by Exploring Face Embedding Hypersphere	2025-02-27
geomgaze	ChildPlay: A New Benchmark for Understanding Children's Gaze Behaviour	2025-02-13
gafro_ros2	This repository is part of the gafro project. It provides an interface to ROS2 and enables the visualization of geometric primitives and robots using the gafro library.	2025-01-31
linearize-distill-pretrained-transformers	This repository contains the reference code for the paper Joint Fine-tuning and Conversion of Pretrained Speech and Language Models towards Linear Complexity, accepted by ICLR2025.	2025-01-27
demo_opt	This repository contains a demo application for Optical Projection Tomography.	2025-01-14
bob.paper.wacv2025_chatgpt_face_pad	Code for the paper: Exploring ChatGPT for Face Presentation Attack Detection in Zero and Few-Shot in-Context Learning	2025-01-09
ssl-human-animal	Comparing Self-Supervised Learning Models Pre-Trained on Human Speech and Animal Vocalizations for Bioacoustics Processing	2025-01-07
cdf	Code for paper "Configuration Space Distance Fields for Manipulation Planning" (RSS 2024).	2024-12-20
arduino_pytwister	This Python library allows interacting with a stepper motor that rotates a stage. The stepper motor is connected to a Sparkfun driver which is itself connected to an Arduino microcontroller.	2024-12-18
LogicLfD	This is a python implementation of our paper "Logic-LfD: Logic Learning from Demonstrations for Multi-step Manipulation Tasks in Dynamic Environments", published in IEEE RA-L 2024.	2024-12-16
dialog2flow	Dialog2Flow: Convert Your Dialogs to Flows!	2024-11-08
robust_pl	Robust Manipulation Primitive Learning via Domain Contraction (CoRL 2024).	2024-11-06
policy-interpretations	This repository contains the code and links to the data and trained models for the paper Generating Interpretations of Policy Announcements presented at the NLP4DH workshop at EMNLP 2024.	2024-11-05
sharingan	A Transformer Architecture for Multi-Person Gaze Following.	2024-11-05
ccdbhg-head-gesture-recognition	CCDbHG Head Gesture Dataset.	2024-10-31
identifying-privacy-personas	Identifying Privacy Personas.	2024-09-27
RDF	Code for paper "Learning Robot Geometry as Distance Fields: Applications to Whole-body Manipulation".	2024-09-04
vilora	A Bayesian Interpretation of Adaptive Low-Rank Adaptation.	2024-09-03
sparse	SPARSE: Spiking Architectures towards Realistic Speech Encoding.	2024-08-22
itm	Image-guided topic modeling for interpretable privacy classification.	2024-08-21
bob.paper.ijcb2024_agnostic_features_mad	Morphing attack detection using attack-agnostic features.	2024-07-31
morphgen	Face morphing attack generation.	2024-07-31
bob.paper.ijcb2023_face_ti	Inversion of Deep Facial Templates using Synthetic Data.	2024-07-30
bob.paper.tbiom2024_face_ti	Template Inversion Attack Using Synthetic Face Images Against Real Face Recognition Systems.	2024-07-30
bob.paper.tifs2024_model_pairing	Model Pairing Using Embedding Translation for Backdoor Attack Detection on Open-Set Classification Tasks.	2024-07-30
bob.paper.fg2024_breaking_btp	Breaking Template Protection: Reconstruction of Face Images from Protected Facial Templates.	2024-07-29
speech-utility-bioacoustics	On the Utility of Speech and Audio Foundation Models for Marmoset Call Analysis.	2024-07-23
bob.paper.deft_ijcb2024	Demographic Fairness Transformer for Bias Mitigation in Face Recognition.	2024-06-22
sigma-gpt	σ-GPT: A New Approach to Autoregressive Models.	2024-06-21
Factual-Reporting-and-Political-Bias-Web-Interactions	Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions.	2024-06-20
News-Media-Reliability	Reliability Estimation of News Media Sources: "Birds of a Feather Flock Together"	2024-06-19
analogy_learning	Can language models learn analogical reasoning? Investigating training objectives and comparisons to human performance.	2024-06-18
ppsdf	Piecewise polynomial SDF(ppSDF) Supplementary code examples for Online learning of Continuous Signed Distance Fields Using Piecewise Polynomials. Contains code demos of continuous learning of a basis-function SDF representation from point cloud data, sampled from mesh files. Also contains a script for downloading the YCB dataset examples. Based on the paper: Online Learning of Continuous Signed Distance Fields Using Piecewise Polynomials, Ante Marić, Yiming Li and Sylvain Calinon, in: IEEE Robotics and Automation Letters (RA-L), 2024	2024-06-18
bob.paper.ijcb2024_moe_hfr	Modality Agnostic Heterogeneous Face Recognition with Switch Style Modulators.	2024-06-17
bob.paper.tifs2024_face_ti	Vulnerability of State-of-the-Art Face Recognition Models to Template Inversion Attack.	2024-05-17
bob.paper.vrbiom_pad_ijcb2024	Assessing the Reliability of Biometric Authentication on Virtual Reality Devices.	2024-05-01
inference-from-real-world-sparse-measurements	Inference from Real-World Sparse Measurements - MALAT.	2024-04-10
bob.paper.sensl2023_hires_codedaper	Toward High-Resolution Face Image Generation From Coded Aperture Camera.	2024-04-01
pygafro	Geometric Algebra For RObotics in Python.	2024-03-15
code.group_membership_verification	Group Membership Verification via Nonlinear Sparsifying Transform Learning.	2024-03-14
gafro_examples	This repository contains some examples on how to use the gafro library.	2024-03-13
gafro_benchmarks	This repository contains benchmarks for the gafro library compared to other robot kinematics and dynamics libraries.	2024-03-07
gafro_robot_descriptions	This repository contains classes of different robot descriptions for the usage with the gafro library.	2024-03-07
gafro_ros	ROS visualization and URDF conversion for the gafro library.	2024-03-07
bayesian-peft	Bayesian Parameter-Efficient Fine-Tuning for Overcoming Catastrophic Forgetting.	2024-02-23
TactileErgodicExploration	A Python package for ergodic control on point cloud using diffusion. It is supplementary material for the paper "Tactile Ergodic Control Using Diffusion and Geometric Algebra." The package uses Laplacian eigenbasis for computing the potential field resulting from the diffusion on the point cloud. Then, it uses the heat-equation-driven area coverage (HEDAC) method to guide the exploration agents for tactile ergodic control tasks. This research is conducted at the Robot Learning and Interaction group of the Idiap Research Institute.	2024-02-09
bob.paper.neurips2023_face_ti	Face Reconstruction from Facial Templates by Learning Latent Space of a Generator Network.	2024-02-06
ergodic_sketching_ros	This repository contains the source code to run the drozBot portraitist robot over ROS1. It contains 3 different packages: ergodic_sketching; ergodic_sketching_msgs; ergodic_sketching_ros; ilqr_planner.	2024-01-19
bob.paper.icassp2024_face_ti_partial	Face Reconstruction from Partially Leaked Facial Embeddings.	2024-01-16
code.face_rec_lensless	Face Recognition Using Lensless Camera (ICASSP 2024).	2024-01-16
Word-Confusion-Network-to-Text-Alignment	This repo contains all the needed files to replicate the WCN-to-Text experiments reported in our ICASSP 2024 paper.	2024-01-15
pydhn	PyDHN is a Python library for storing, visualizing and managing District Heating Network (DHN) data, running simulations and automate I/O workflow, built on top of Networkx.	2024-01-10
icassp2024.dvpf	Deep Variational Privacy Funnel: General Modeling with Applications in Face Recognition.	2024-01-04
Anonymization	Text anonymization is a Python library for anonymizing sensitive information in text data. Focused on Swiss French banking data.	2023-12-05
bob.paper.icassp2024_diu_hfr	Heterogeneous Face Recognition Using Domain Invariant Units.	2023-11-23
gafar	GaFaR: Geometry-aware Face Reconstruction.	2023-11-22
bob.paper.wacv2024_dvpba	Mitigating Demographic Bias in Face Recognition via Regularized Score Calibration.	2023-11-16
bob.paper.iccv2023_face_ti	Template Inversion Attack against Face Recognition Systems using 3D Face Reconstruction.	2023-11-14
idiap_spe	Set of exercises on automatic speech processing.	2023-11-06
benefits-of-max-pooling	This folder contains the supplementary material for the paper 'Benefits of Max Pooling in Neural Networks: Theoretical and Experimental Evidence' TMLR 2023.	2023-10-20
abroad-re	Relation Extraction in underexplored biomedical domains: A diversity-optimised sampling and synthetic data generation approach.	2023-10-18
gme-sampler	Greedy Maximum Entropy Sampler.	2023-10-18
language-label-bias	This repository contains the code for our paper "Understanding the effects of language-specific class-imbalance in multilingual fine-tuning".	2023-10-02
translation-aided-slu	This it the reference code for the paper The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation.	2023-09-15
sg_latent_modeling	StyleGAN3 Latent Space Modeling.	2023-07-20
bob.paper.icip2023_blackbox_face_reconstruction	Blackbox Face Reconstruction from Deep Facial Embeddings Using A Different Face Recognition Model (ICIP 2023).	2023-07-19
bob.paper.ijcb2023_vuln_analysis_hyg_mask_attack	This package is part of the signal-processing and machine learning toolbox Bob.	2023-07-12
bob.paper.ijcb2023_caim_hfr	Bridging the Gap: Heterogeneous Face Recognition with Conditional Adaptive Instance Modulation.	2023-07-11
HyperMixing	HyperMixing is a token-mixing techniques to be used as linear-time alternative to attention, for example in Transformer-like architecture like HyperMixer.	2023-06-12
ssl-caller-detection	Can Self-Supervised Neural Representations Pre-Trained on Human Speech distinguish Animal Callers?	2023-06-01
Node_weighted_GCN_for_depression_detection	Node-weighted Graph Convolutional Network for Depression Detection in Transcribed Clinical Interviews.	2023-05-31
contextual-biasing-on-gpus	The implementation of the contextual biasing for ASR decoding on GPUs without lattice generation.	2023-05-25
slu_representations	Effectiveness of text, acoustic, and lattice-based representations in spoken language understanding tasks.	2023-03-13
NVIB	Nonparametric Variational Information Bottleneck.	2023-02-06
nvib_transformers	A VAE for transformers using Nonparametric Information bottleneck.	2023-02-06
gafro	Geometric Algebra For RObotics.	2022-12-14
bob.paper.icip2022_face_reconstruction	Face Reconstruction from Deep Facial Embeddings using a Convolutional Neural Network.	2022-11-30
zff_vad	Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering.	2022-11-22
ESLAM	Efficient Dense SLAM System Based on Hybrid Representation of Signed Distance Fields.	2022-11-21
atco2-corpus	A Large-Scale Dataset for Research on Automatic Speech Recognition and Natural Language Understanding of Air Traffic Control Communications.	2022-11-10
bob.paper.tpami2023_face_TI	Comprehensive Vulnerability Evaluation of Face Recognition Systems to Template Inversion Attacks via 3D Face Reconstruction.	2022-11-02
hallucination-detection	This repository contains the code and data for the paper Unsupervised Token-level Hallucination Detection from Summary Generation By-products by Andreas Marfurt and James Henderson, presented at the GEM workshop at EMNLP 2022.	2022-10-28
multimodal_gaze_target_prediction	This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings" published at the GAZE workshop at CVPR 2022.	2022-10-18
bert-text-diarization-atc	BERTraffic: BERT-based Joint Speaker Role and Speaker Change Detection for Air Traffic Control Communications.	2022-10-12
cncsharedtask	IDIAPERS @ CASE22-TASK 3: Event Causality Identification.	2022-10-12
w2v2-air-traffic	How Does Pre-trained Wav2Vec 2.0 Perform on Domain Shifted ASR? An Extensive Benchmark on Air Traffic Control Communications.	2022-10-12
DHgeN	is a Python module for generating District Heating Networks layouts.	2022-09-22
SpArch	Spiking Architectures for Speech Technology.	2022-08-10
ExVo-2022	Extracting pre-trained self-supervised embeddings for ICML ExVO 2022 challenge.	2022-07-19
bayesian-recurrence	A Bayesian Interpretation of Recurrence in Neural Networks.	2022-07-18
ttgo	TTGO:Tensor Train for Global Optimization Problems in Robotics. A PyTorch implementation of TTGO algorithm and the applications presented in the paper "Tensor Train for Global Optimization Problems in Robotics "	2022-06-21
GeoNeRF	Generalizing NeRF with Geometry Priors.	2022-05-30
sleepless	Benchmarks for sleep phase detection from polysomnographs.	2022-05-05
wav2vec-lfmmi	wav2vec-lfmmi provides recipes from fine-tuning a pre-trained wav2vec 2.0 model using the espresso tool kit.	2022-03-17
bob.paper.icassp2022_morph_generate	Source code for generating the morphs described in the ICASSP 2022 paper 'Are GAN-based Morphs Threatening Face Recognition?'.	2022-01-31
bob.paper.tbiom2021_protect_vascular_dnn_biohash	Code to reproduce the results for the paper "Towards Protecting and Enhancing Vascular Biometric Recognition methods via Biohashing and Deep Neural Networks" in IEEE-TBIOM.	2022-01-31
unsupervised_gaze_calibration	This code allows the robust and unsupervised calibration of a gaze estimator used in a conversation or an object manipulation setting. It relies on task-related contextual attention prior to gather calibration samples and on robust estimation to compute the calibration parameters.	2021-12-13
CBI-MMTools	This repository contains plugins, device adapters and libraries for the operation of microscopy platforms using Micro-Manager, developed by the Computational BioImaging group at Idiap Research Institute.	2021-11-25
rethinking-saliency	Saliency Map Interpretability as Generative Modelling.	2021-10-14
sentence-planner	This is the code for the paper Sentence-level Planning for Especially Abstractive Summarization presented at the New Frontiers in Summarization workshop at EMNLP 2021.	2021-09-30
DepthInSpace	[ICCV 2021] DepthInSpace: Exploitation and Fusion of Multiple Frames of a Video for Structured-Light Depth Estimation.	2021-09-28
bob.paper.wifs2021_biohashing_sota_face	On the Recognition Performance of BioHashing on state-of-the-art Face Recognition models.	2021-09-27
potr	Pose Transformers: Human Motion Prediction with Non-Autoregressive Transformers.	2021-08-13
depth_human_synthesis	DepthHuman: A tool for depth image synthesis for human pose estimation.	2021-07-28
distance-based-cnn	Automatic Dysarthric Speech Detection Exploiting Pairwise Distance-based Convolutional Neural Networks.	2021-07-12
pddetection-reps-learning	Supervised Speech Representation Learning for Parkinson's Disease Classification.	2021-07-12
tnn	TNN - Trajectory Nearest Neighbors. This code was developed as a part of the Innosuisse MALAT: Machine Learning for Air Traffic project, which is a partnership between SkySoft ATM and the Idiap Research Institute.	2021-06-23
hourglass_push	This repository contains Python code for the work presented in the IROS 2021 paper "An Efficient Image-to-Image Translation HourGlass-based Architecture for Object Pushing Policy Learning" by M. Ewerton, A. Martínez-González and JM. Odobez.	2021-04-12
model-uncertainty-for-adaptation	Code for paper Uncertainty Reduction for Model Adaptation in Semantic Segmentation at CVPR 2021.	2021-04-07
als-classification	Classification of ALS and Stress in Cultures of Motor Neurons.	2021-04-06
CNN-based Models	CNN-based Models for ALS and stressed MNs cultures classification.	2021-04-06
deepdefresneling	Deep Learning Methods for Digital Holography in an Embedded System.	2021-04-06
flowestimation	Code for the PyTorch implementation of "Estimating Nonplanar Flow from 2D Motion-blurred Widefield Microscopy Images via Deep Learning", submitted to IEEE ISBI, 2021.	2021-04-06
TIDIGITSRecipe.jl	This repository contains a recipe for training an automatic speech recognition (ASR) system using the TIDIGITS database.	2021-02-26
icassp-oov-recognition	This has data and code related to the paper accepted at ICASSP21 "A comparison of methods for OOV-word recognition on a new Public Dataset".	2021-02-17
FiniteStateTransducers.jl	Play with Weighted Finite State Transducers (WFSTs) using the Julia language.	2021-02-09
cbi_toolbox	CBI Toolbox is a collection of algorithms used for computational bioimaging and microscopy.	2021-02-01
ihper	(Idiap human perception system). An audio-visual system for human perception, human-robot interaction. This ROS-compatible system detects tracks faces, re-identifies people, detect speaking people, and non-verbal cues (nod, visual focus of attention).	2020-10-12
bob.rppg.base	This package provides three baseline algorithms to perform remote photoplethysmography (rPPG), which consists in measuring the heart rate from a face video sequence. The software package implements three different algorithms to retrieve the pulse signal from skin color variations: an approach based on colorspace transformation, another approach solely based on signal processing, and a more recent approach, which analyzes the subspace spanned by skin-colored pixels in the RGB colorspace.	2020-10-02
tf_robot_learning	Tensorflow robot learning library.	2020-09-18
sae_lang_detect	sae_lang_detect: Supervised Autoencoder for Language Detection. The Supervised Autoencoder (SAE) with Bayesian Optimization (BO) for the language detection task found effectively for discriminating between very close languages or dialects. This library contains the PyTorch implementation of SAE with one sample code for using it for the language detection task. The library can be used for other NLP classification tasks (e.g. Fake News Detection, Operant Motive Detection) easily. It supports both CPU and GPU versions with just turn on/off the GPU flag ("is_gpu = True or False").	2020-08-28
pkwrap	This is a (yet another!) python wrapper for Kaldi. The main goal is to be able to train acoustic models in Pytorch so that we can use MMI cost function during training use NG-SGD for affine transformations, which enables multi-GPU training with SGE	2020-07-31
Residual_pose	Residual Pose: A Decoupled Approach for Depth-based 3D Human Pose Estimation.	2020-07-27
BEAT platform	The BEAT platform is a European computing e-infrastructure for Open Science proposing a solution for open access, scientific information sharing and re-use including data and source code while protecting privacy and confidentiality. It allows easy online access to experimentation and testing in computational science.	2020-07-13
fast-transformers	This library aims to facilitate research on efficient transformer models and provides PyTorch implementations for several efficient transformers.	2020-06-15
bob.paper.nir_patch_pooling	This package contains python code to reproduce experiments and results described in the IEEE ICIP paper: "CNN Patch Pooling for Detecting 3D Mask Presentation Attacks in NIR".	2020-05-18
bob.med.tb	Active Tuberculosis Detection On CXR Package for Bob.	2020-05-08
psfestimation	Code for the PyTorch implementation of "Spatially-Variant CNN-based Point Spread Function Estimation for Blind Deconvolution and Depth Estimation in Optical Microscopy", IEEE Transactions on Image Processing, 2020.	2020-05-05
beat.accumos.exporter	This module implements a tool that will generate Acumos compatible Docker images from simple BEAT algorithms (sequential, without setup nor prepare).	2020-05-04
t-softmax	t-softmax pytorch reproducibility code. The repository contains the code to reproduce the results of the paper: Niccolò Antonello, Philip N. Garner "A t-distribution based operator for enhancing out of distribution robustness of neural network classifiers," IEEE Signal Processing Letters, 2020, to appear.	2020-04-30
DeepFocus	Code for the PyTorch implementation of "DeepFocus: a Few-Shot Microscope Slide Auto-Focus using a Sample Invariant CNN-based Sharpness Function".	2020-03-30
aberration_correction	Sorting and scanning aberration correction of periodic image time-series.	2020-03-13
bioformats_io	Takes as input NPY files and saves them to OME or OME-TIFF, and conversely, takes as input microscopy-format files and saves them as NPY.	2020-03-13
torgo_asr	This is a Kaldi recipe to build automatic speech recognition systems on the Torgo corpus of dysarthric speech.	2020-03-06
fast_pose_machines	Efficient Pose Machines for Multi-Person Pose Estimation.	2019-12-20
DeepOBS	A Deep Learning Optimizer Benchmark Suite.	2019-10-29
bob.paper.icassp2020_facepad_generalization_infovae	Code to reproduce "Improving Cross-dataset Performance Of Face Presentation Attack Detection Systems Using Face Recognition Datasets" ICASSP 2020 paper.	2019-10-20
bob.paper.icassp2020_domain_guided_pruning	Code to reproduce "Domain Adaptation for Generalization of Face presentation Attack Detection in Mobile Settings with Minimal Information" ICASSP 2020 paper.	2019-10-19
fullgrad-saliency	This code is the reference implementation of the methods described in our NeurIPS 2019 publication "Full-Gradient Representation for Neural Network Visualization. This repository implements two methods: the reference FullGrad algorithm, and a variant called "simple FullGrad", which omits computation of bias parameters for bias-gradients.	2019-10-03
buslr	Build System for Speech and Language Research.	2019-09-23
bob.paper.makeup_aim	This package contains python code to reproduce experiments and results described in the IEEE T-BIOM paper: "Detection of Age-Induced Makeup Attacks on Face Recognition Systems Using Multi-Layer Deep Features".	2019-07-18
hesm_distrib hesm_distrib_data	Temporal Super-Resolution Microscopy Using a Hue-Encoded Shutter.	2019-07-16
RawSpeechClassification	Trains CNN (or any neural network based) classifiers from raw speech using Keras and tests them. The inputs are lists of wav files, where each file is labelled. It then creates fixed length signals and processes them. During testing, it computes scores at the utterance or speaker levels by averaging the corresponding frame-level scores from the fixed length signals.	2019-05-24
DRILL	Deep residual output layers for neural language generation.	2019-05-10
nnsslm	Neural Network based Sound Source Localization Models.	2019-05-10
attention-sampling	Python library to accelerate the training and inference of neural networks on large data. This code is the reference implementation of the methods described in our ICML 2019 publication "Processing Megapixel Images with Deep Attention-Sampling Models".	2019-05-08
bob.ip.binseg	Binary Segmentation Benchmark Package for Bob.	2019-04-15
unet.interspeech2019	U-NET based feature extractor for text-independent speaker verification.	2019-03-26
LR-CNN	Trains low-rank CNNs from raw speech using Keras/Tensorflow, with inputs from Kaldi directories.	2019-03-15
bob.paper.deep_pix_bis_pad.icb2019	This package is part of the signal-processing and machine learning toolbox Bob. This package contains source code to replicate the experimental results published in the following paper: Deep Pixel-wise Binary Supervision for Face Presentation Attack Detection	2019-03-12
bob.paper.xcsmad_facepad	Face PAD for Silicone mask-based attack detection.	2019-02-19
bob.paper.mcae.icb2019	Face PAD using multi-channel autoencoders.	2019-02-11
IdiapTTS	Idiap Text-to-Speech system developed at the Idiap Research Institute.	2019-02-07
bob.paper.mccnn.tifs2018	Face PAD using Multi-Channel CNN.	2018-11-05
joint-embedding-nmt	Pytorch implementation of the structure-aware output layer for neural machine translation which was presented at WMT 2018.	2018-10-30
HAN_NMT	Document-Level Neural Machine Translation with Hierarchical Attention Networks.	2018-09-21
KiSC	K.I.S.S. Cluster (KiSC) - with K.I.S.S. as in "Keep It Stupid Simple" - is a utility that aims to simplify the life of administrators managing resources accross a cluster of hosts.	2018-09-21
CNN_QbE_STD	Implementation of the work presented in "CNN based Query by Example Spoken Term Detection".	2018-09-03
human-detection	Background substraction and Human Detection.	2018-07-27
bob.paper.eusipco2018	Speaker Inconsistency Detection in Tampered Video. Source code for reproducing the speaker inconsistency detection experiments of the paper "Speaker Inconsistency Detection in Tampered Video" in EUSIPCO 2018 conference.	2018-06-22
semiblindpsfdeconv	Semi-blind Spatially-Variant Deconvolution. Code for "Semi-Blind Spatially-Variant Deconvolution in Optical Microscopy with Local Point Spread Function Estimation By Use Of Convolutional Neural Networks" ICIP 2018.	2018-05-29
Attentive_Residual_Connections_NMT	Implementation and output data of "Global-Context Neural Machine Translation through Target-Side Attentive Residual Connections".	2018-01-12
eigenposterior	eigenposterior (Senone Class Principal Components) based approach for purifying DNN posterior estimates.	2018-01-08
multicamera-calibration	This toolset provides the basics for calibrating a multi-camera scene. it contains six utilities for different purposes. In this README I will walk the user through the calibration of a multi camera scene using this toolset.	2017-12-06
trimed	The trimed algorithm for obtaining the medoid of a set.	2017-11-15
inv-tn	Inverse Text Normalization using NMT models.	2017-09-27
mhan	Multilingual hierarchical attention networks toolkit.	2017-09-27
CNN-voice-PAD	The purpose of this software is to train Convolutional Neural Networks on raw speech signals in order to detect voice presentation attacks.	2017-08-04
importance-sampling	This python package provides a library that accelerates the training of arbitrary neural networks created with Keras using importance sampling.	2017-07-13
simple-imager	Simple Imager (Linux Imaging and Deployment Made Easy) is a set of tools allowing an imaging server to retrieve a copy of Linux reference hosts (sources) and allowing those images to be deployed to other target hosts by the mean of RSync or BitTorrent files download.	2017-05-08
APT	The APT software is a reference-based metric to evaluate the accuracy of pronoun translation.	2017-02-22
bob.ip.qualitymeasure	This package is part of the signal-processing and machine learning toolbox Bob. It provides functions for extracting image-quality features proposed for PAD experiments by different research groups. Image quality measures proposed by Galbally et al. (IEEE TIP 2014) and by Wen et al. (IEEE TIFS 2015) are implemented in this package.	2017-02-07
Exact Acceleration of Linear Object Detectors	We describe a general and exact method to considerably speed up linear object detection systems operating in a sliding, multi-scale window fashion, such as the individual part detectors of part-based models.	2016-11-04
IBDiarization	Speaker Diarization Toolkit. The toolkit is intended to facilitate research in multistream speaker diarization providing a platform for research in novel audio, video or location features. It is based on the Information Bottleneck principle and is explicitely designed to use of several hetergenous feature streams.	2016-11-04
warca	WARCA is a simple and fast algorithm for metric learning.	2016-10-07
symfony-bundle-datacryptographer	The Data Cryptographer Bundle is a PHP/Symfony bundle which provides a cryptographer resource/service for common cryptographic operations.	2016-09-15
zentas	Software for doing k-medoids using an accelerated CLARANS algorithm.	2016-09-15
bob.bio.spear	Implements speaker recognition algorithms.	2016-08-04
bob.bio.vein	Vein biometrics recognition baselines.	2016-07-08
HOOSC	Histogram of Orientation Shape Context.	2016-04-11
mash-web	Front-end of the MASH computation farm.	2016-03-09
mash	Back-end of the MASH computation farm.	2016-03-02
kaldi-ivector	The code is an implementation of the standard i-vector extraction algorithm for the Kaldi toolkit.	2016-01-07
phonvoc	Phonetic and phonological vocoding platform. Phonvoc is a cascaded deep neural network composed of speech analyser and synthesizer that use shared phonological speech representation.	2015-12-11
eakmeans	Implementation of fast exact k-means algorithms.	2015-12-04
acoustic-simulator	Implementation of audio degradation processes.	2015-11-18
DocRec	KEYWORD EXTRACTION AND DOCUMENT RECOMMENDATION IN CONVERSATIONS (DocRec). The package contains several pieces of Matlab code. Taken together, they extract keywords from a conversation, then use them to build implicit queries, and then consolidate the sets of retrieved documents to recommend to the conversation participants.	2015-10-26
HPCA	hpca is a C++ toolkit providing an efficient implementation of the Hellinger PCA for computing word embeddings.	2015-09-16
symfony-bundle-datajukebox	The Data Jukebox Bundle is a PHP/Symfony bundle which aims to provide - for common CRUD (Create-Read-Update-Delete) operations - the same level of abstraction that Symfony does for forms.	2015-09-01
libssp	Library for speech signal processing.	2015-06-18
asrt	A python library that facilitate the extraction of text sentences from multilingual 'pdf' documents.	2015-05-12
GC.MI	The gc_MI.cpp file includes C++ code implementing the GC.MI algorithm.	2015-02-06
wmil-sgd	A weighted multiple-instance learning algorithm based on stochastic gradient descent.	2015-01-21
cbrec	Content-Based Recommendation Generator (CBRec v1.0). A Python library which generates content-based recommendations for a set of items described by textual metadata using four possible vector space methods, namely TF-IDF, LSI, RP and LDA.	2014-12-12
emorec	Emotion-Based Recommendation Generator (EMORec v1.0). A Python library which performs emotion-based analysis and recommendation using a multiple-instance regression algorithm for a set of multimedia items described by transcripts.	2014-12-12
g3e	HG3D - A module for 3D head pose and gaze tracking from RGB-D sensors. This software contains the implementation of algorithms related to 3D head pose and gaze tracking tasks based on RGB-D cameras (standard vision and depth).	2014-09-16
rgbd	RGBD: A Python based RGB-D data processing module. This python module implements the streaming, calibration and visualization of RGB-D data, that is, combined color and depth images.	2014-09-15
pbdlib-matlab	PbDlib is a set of tools combining statistical learning, dynamical systems and optimal control approaches for programming-by-demonstration applications.	2014-07-08
mash-simulator	mash-simulator is a 3D simulator for Linux and MacOS where a robot must complete a certain number of tasks in different randomized environments.	2014-05-09
Webvalidation	This software is a multi users, multi projects web annotation tool that help to organize the process of validating automatically generated transcriptions.	2014-04-16
facereclib	This library is designed to perform a fair comparison of face recognition algorithms. It contains scripts to execute various kinds of face recognition experiments on a variety of facial image databases.	2013-11-05
slog	Similarity Learning on Graph. SLOG contains implementation of similarity learning methods over relational data, where the relation between data points are given explicitly.	2013-11-05
ML3	ML3 is an open source implementation of the Multiclass Latent Locally Linear Support Vector Machine algorithm, a multi-class local classifier based on a latent SVM formulation.	2013-10-16
xbob.thesis.elshafey2014	This package contains scripts to reproduce the experiments of Laurent El Shafey's Ph.D. thesis at Ecole Polytechnique Fédérale de Lausanne (EPFL).	2013-09-17
DiscoConn-Classifier	Classifier models and feature extractors for discourse relations.	2013-08-28
probamod-v1	Probabilistic Models: temporal topic models and more. Topic models such as Latent Dirichlet Allocation (LDA) have been used successfully in many domains for data mining. Originally designed for text documents, these methods find some hidden “topics” considering that each document is a weighted mixture of topics. Each topic expresses itself in a document by generating some specific words with more probability than others.	2013-07-23
ISS	The Idiap Speech Scripts (ISS) is a collection of speech databases and dictionaries, and for training and testing of models for ASR. The scripts in turn are reliant on many other packages including HTK/HTS, Juicer and the ICSI speech tools.	2013-07-11
SSP	SSP stands for Speech Signal Processing. It is a fairly small package written in python. Its functionality is similar to tracter, with some overlap and some additional capabilities. In particular, SSP contains a parametric vocoder, a pitch extractor and feature extraction for ASR.	2013-06-10
act	ACT for Accuracy of Connective Translation is a reference-based metric to measure the accuracy of discourse connective translation, mainly for statistical machine translation systems.	2013-03-20
BOB	Bob is a free signal-processing and machine learning toolbox developed by the Biometrics group at Idiap Research Institute, Switzerland. The toolbox is written in a mix of Python and C++ and is designed to be both efficient and reduce development time.	2012-11-26
MSER	Linear time Maximally Stable Extremal Regions (MSER) implementation as described in D. Nistér and H. Stewénius, Linear Time Maximally Stable Extremal Regions".	2012-11-20
HEAT Image Retrieval System	HEAT is an image retrieval web-application that is intended for large unstructured collections of images without semantic annotations. The system implements a novel searching paradigm that does not require any explicit query. At each iteration, the system displays a small set of images and the user chooses the image that best matches what she is looking for. After a few iterations, the sets of displayed images are gradually concentrated on images that satisfy the user.	2012-09-21
The Multi-Tracked Paths	This is an implementation of the variant of KSP for tracking presented in (Berclaz et al. 2011). You can get more information and the reference implementation from the CVLab's web page about multi-camera tracking.	2012-09-07
Tasting Families of Features for Image Classification	Please find below the code necessary to reproduce the experiments of the paper Tasting Families of Features for Image Classification "under the GPL v2 license."	2011-11-29
HTS-VTLN	This software is a patch to HMM based statistical parametric speech synthesis toolkit (HTS 2.2).	2011-10-27
facecolormodel	This page contains the source code and data needed to train and use a model for skin, hair, clothing and background color modelling and segmentation.	2011-08-01
Torch	Statistical machine learning library containing most of the state-of-the-art algorithms. Written in Lua and C, the library is distributed under a BSD license.	2011-01-01

Obsoleted Software

Juicer	Juicer is a Weighted Finite State Transducer (WFST) based decoder for Automatic Speech Recognition (ASR).	1970-01-01
Tracter	Tracter is a data flow framework.	1970-01-01
Torch3vision	Common softwagre library for computer vision with machine learning algorithms. Written in simple C++, this library is based on Torch and distributed under a BSD license.	1970-01-01

Software

Obsoleted Software

About

Research

Innovation

Education

News

Events

Careers