Since the introduction of NeRFs, considerable attention has been focused on improving their training and inference times, leading to the development of Fast-NeRFs models. Despite demonstrating impressive rendering speed and quality, the rapid convergence of such models poses challenges for further improving reconstruction quality. Common strategies to improve rendering quality involves augmenting model parameters or increasing the number of sampled points. However, these computationally intensive approaches encounter limitations in achieving significant quality enhancements. This study introduces a model-agnostic framework inspired by Sparsely-Gated Mixture of Experts to enhance rendering quality without escalating computational complexity. Our approach enables specialization in rendering different scene components by employing a mixture of experts with varying resolutions. We present a novel gate formulation designed to maximize expert capabilities and propose a resolution-based routing technique to effectively induce sparsity and decompose scenes. Our work significantly improves reconstruction quality while maintaining competitive performance
Increase Resolution
Increase MLP size
Sample more points along the ray
Typical naive approaches to enhance the reconstruction quality of Fast-NeRFs models include:
However the (marginal) increase in reconstruction quality results in a significant increase in computational costs.
Our methods can overcome these limitations, while being model-agnostic by design. Although Top-1 already achieves state-of-the-art performance, Top-2 strikes an optimal balance between the efficiency of the Top-1 approach and the quality of the Ensemble method, where all experts are used for every input.
Our method allows for a frequency-based scene decomposition, where high-res models render complex parts of the scene; whereas low-res models render low-frequency parts. The output of the gate module for each model is visualized on the left; on the right, the rendered part of the scene for each model is shown. Outputs are arranged in increasing order of resolution.
Our method ensures superior reconstruction quality compared to the baselines, effectively reproducing sharper details while reducing noise on texture-less spots.
Our novel gate design is lightweight (comparable to a linear gate) but achieves way superior quality. Furthermore, it achieves good results with a very-low resolution grid.
inproceedings{boostnerf,
author = "Di Sario, Francesco and Renzulli, Riccardo and Tartaglione, Enzo and Grangetto, Marco",
editor = "Leonardis, Ale{\v{s}} and Ricci, Elisa and Roth, Stefan and Russakovsky, Olga and Sattler, Torsten and Varol, G{\"u}l",
title = "Boost Your NeRF: A Model-Agnostic Mixture of Experts Framework for High Quality and Efficient Rendering",
booktitle = "Computer Vision -- ECCV 2024",
year = "2025",
publisher = "Springer Nature Switzerland",
address = "Cham",
pages = "176--192",
isbn = "978-3-031-73010-8"
}