We focus on face reconstruction attacks and propose a new method to reconstruct face images from different face recognition models using a foundation model. We use Arc2Face, which is a recently proposed face foundation model, and is capable of generating identity-consistent images given an embedding of a specific face recognition model FFM. The Arc2Face pipeline includes a modified CLIP model to accept face embeddings from the face recognition model (FFM) and a Stable Diffusion UNet decoder. This pipeline is end-to-end trained on an upscaled version of WebFace42M, resulting in the generation of high-quality, identity-consistent images. It is evident that this model can be readily used for inverting the embeddings of FFM, as it is specifically trained for those embeddings. The face recognition model FFM used to train Arc2Face has a ResNet100 backbone and is trained with WebFace42M. To attack any other face recognition system, one would need to redo the entire process of adapting a large model on a large-scale dataset, which is impractical. In this work, we propose a simple approach to adapt this model for reconstructing images from embeddings of any face recognition system (Fvictim). The main idea is to develop an adapter module that can transform the embeddings from the space of the leaked embedding into the embedding space of FFM. Figure below depicts our face reconstruction attack.
Essentially, we propose to learn a mapping M which would map the leaked embeddings to the embedding space of FFM. We implement this mapping function, M, as a simple linear layer. The parameters of this adapter layer can be learned using a set of pairs of embeddings extracted from Fvictim and FFM and Mean Square Error (MSE) as the loss function:
Below, you can see sample reconstructed face images from embeddings of different face recognition models using our adapter module:
The source code of our experiments will be available soon.
@article{face_adapter,
author = {Hatef Otroshi Shahreza and Anjith George and S{\'e}bastien Marcel},
title = {Face Reconstruction from Face Embeddings using Adapter to a Face Foundation Model},
journal = {arXiv preprint arXiv:2411.03960},
year = {2024}
}