Log RGB Images Provide Invariance to Intensity and Color Balance Variation for Convolutional Networks


Bruce A Maxwell (Northeastern University),* Sumegha Singhania (Northeastern University), Heather Fryling (Northeastern University), Haonan Sun (Northeastern University)
The 34th British Machine Vision Conference

Abstract

The interaction of light and matter follows physical rules that have been well-modeled in the vision community. These rules should be available to deep networks when learn- ing vision tasks. However, typical signal processing pipelines, conversion to sRGB, and JPEG compression break the rules and make them unavailable for learning. This, in turn, makes color and intensity unreliable as features and more difficult to use. Using linear or log RGB images that preserve the rules of the physics of reflection should make certain visual tasks simpler to learn and increase robustness to certain types of visual variation. We demonstrate that using linear RGB or log RGB improves the performance of a deep network on an image classification task when the same network architecture is trained on the same images but in different formats. Furthermore, the linear and log RGB networks are more robust to intensity and color balance variation. In particular, the network trained on log RGB inputs shows invariance to intensity and color balance variation when that variation is not included in the training set, while the network trained on the same images in JPEG format shows severe reductions in performance. We further explore why this difference exists by visualizing low-level features in log RGB, linear RGB, and JPEG data and show that log space preserves certain types of features across intensity and color balance variation.

Video



Citation

@inproceedings{Maxwell_2023_BMVC,
author    = {Bruce A Maxwell and Sumegha Singhania and Heather Fryling and Haonan Sun},
title     = {Log RGB Images Provide Invariance to Intensity and Color Balance Variation for Convolutional Networks},
booktitle = {34th British Machine Vision Conference 2023, {BMVC} 2023, Aberdeen, UK, November 20-24, 2023},
publisher = {BMVA},
year      = {2023},
url       = {https://papers.bmvc2023.org/0635.pdf}
}


Copyright © 2023 The British Machine Vision Association and Society for Pattern Recognition
The British Machine Vision Conference is organised by The British Machine Vision Association and Society for Pattern Recognition. The Association is a Company limited by guarantee, No.2543446, and a non-profit-making body, registered in England and Wales as Charity No.1002307 (Registered Office: Dept. of Computer Science, Durham University, South Road, Durham, DH1 3LE, UK).

Imprint | Data Protection