Perceiving Systems, Computer Vision

Towards Racially Unbiased Skin Tone Estimation via Scene Disambiguation

2022

Conference Paper

ps


Virtual facial avatars will play an increasingly important role in immersive communication, games and the metaverse, and it is therefore critical that they be inclusive. This requires accurate recovery of the albedo, regardless of age, sex, or ethnicity. While significant progress has been made on estimating 3D facial geometry, appearance estimation has received less attention. The task is fundamentally ambiguous because the observed color is a function of albedo and lighting, both of which are unknown. We find that current methods are biased towards light skin tones due to (1) strongly biased priors that prefer lighter pigmentation and (2) algorithmic solutions that disregard the light/albedo ambiguity. To address this, we propose a new evaluation dataset (FAIR) and an algorithm (TRUST) to improve albedo estimation and, hence, fairness. Specifically, we create the first facial albedo evaluation benchmark where subjects are balanced in terms of skin color, and measure accuracy using the Individual Typology Angle (ITA) metric. We then address the light/albedo ambiguity by building on a key observation: the image of the full scene –as opposed to a cropped image of the face– contains important information about lighting that can be used for disambiguation. TRUST regresses facial albedo by conditioning on both the face region and a global illumination signal obtained from the scene image. Our experimental results show significant improvement compared to state- of-the-art methods on albedo estimation, both in terms of accuracy and fairness. The evaluation benchmark and code are available for research purposes at https://trust.is.tue.mpg.de.

Author(s): Haiwen Feng and Timo Bolkart and Joachim Tesch and Michael J. Black and Victoria Fernandez Abrevaya
Book Title: Computer Vision – ECCV 2022
Volume: 13
Pages: 72--90
Year: 2022
Month: October

Series: Lecture Notes in Computer Science, 13673
Editors: Avidan, Shai and Brostow, Gabriel and Cissé, Moustapha and Farinella, Giovanni Maria and Hassner, Tal
Publisher: Springer

Department(s): Perceiving Systems
Bibtex Type: Conference Paper (inproceedings)
Paper Type: Conference

DOI: 10.1007/978-3-031-19778-9_5
Event Name: 17th European Conference on Computer Vision (ECCV 2022)
Event Place: Tel Aviv, Israel

Address: Cham
ISBN: 978-3-031-19777-2
State: Published

Links: pdf
project
code

BibTex

@inproceedings{TRUST:ECCV2022,
  title = {Towards Racially Unbiased Skin Tone Estimation via Scene Disambiguation},
  author = {Feng, Haiwen and Bolkart, Timo and Tesch, Joachim and Black, Michael J. and Abrevaya, Victoria Fernandez},
  booktitle = {Computer Vision – ECCV 2022},
  volume = {13},
  pages = {72--90},
  series = {Lecture Notes in Computer Science, 13673},
  editors = {Avidan, Shai and Brostow, Gabriel and Cissé, Moustapha and Farinella, Giovanni Maria and Hassner, Tal},
  publisher = {Springer},
  address = {Cham},
  month = oct,
  year = {2022},
  doi = {10.1007/978-3-031-19778-9_5},
  month_numeric = {10}
}