Draft:3D Gaussian Splatting for Real-Time Radiance Field Rendering

Review waiting, please be patient.

This may take 2 months or more, since drafts are reviewed in no specific order. There are 1,319 pending submissions waiting for review.

If the submission is accepted, then this page will be moved into the article space.
If the submission is declined, then the reason will be posted here.
In the meantime, you can continue to improve this submission by editing normally.

Where to get help

If you need help editing or submitting your draft, please ask us a question at the AfC Help Desk or get live help from experienced editors. These venues are only for help with editing and the submission process, not to get reviews.
If you need feedback on your draft, or if the review is taking a lot of time, you can try asking for help on the talk page of a relevant WikiProject. Some WikiProjects are more active than others so a speedy reply is not guaranteed.

How to improve a draft

Wikipedia:Contributing to Wikipedia – a basic overview on how to edit Wikipedia.
Help:Wikitext – how to use the markup
Help:Referencing for beginners – how to include references
Wikipedia:Article development – how to develop your article
Wikipedia:Writing better articles – how to improve your article
Wikipedia:Verifiability – make sure your article includes reliable third-party sources

You can also browse Wikipedia:Featured articles and Wikipedia:Good articles to find examples of Wikipedia's best writing on topics similar to your proposed article.

Improving your odds of a speedy review

To improve your odds of a faster review, tag your draft with relevant WikiProject tags using the button below. This will let reviewers know a new draft has been submitted in their area of interest. For instance, if you wrote about a female astronomer, you would want to add the Biography, Astronomy, and Women scientists tags.

Add tags to your draft

Editor resources

Easy tools: Citation bot (help) | Advanced: Fix bare URLs

Reviewer tools

Instructions · What links here · 3D Gaussian Splatting for Real-Time Radiance Field Rendering (talk: + · bio) · (log) · Copyvios report · reFill · Citation Bot · (Search: Google, Wikipedia) · Submitted 2 days ago by Too lazy to (talk: D · +) · Last edited 26 hours ago by Citation bot

Introduction

The field of Real-Time Radiance Field Rendering has been revolutionized by the usage of 3D Gaussian Splatting since the introduction of the methodology in this seminal ACM Transactions on Graphics Paper^[1] in 2023. The work enables the creation of high-quality real-time (1080p, >=30 fps) novel-view scenes by stringing together multiple photos or videos, which had historically been a big challenge.

As of September 2024, the paper has 1000+ citations.

Contributions

The paper introduced several creative ideas to the field of novel-view generation, towards improving the speed and quality of generation, especially over Neural Radiance Field based approaches.

For the first time, anisotropic 3D Gaussians were used to represent radiance fields.
A new optimization method was introduced.
A fast rendering approach, catered to usage of GPUs was proposed.

Overview

The approach can be summarized as follows: (insert image)

Input: Set of images of a static scene along with camera positions, expressed as a sparse point-cloud.
3D Gaussians: Define mean, covariance matrix and opacity.
Color: Represented by spherical harmonics.
Algorithm: Optimize the above mentioned parameters.
Rasterizer: Fast sorting and fast backward pass.

Method

This diagram illustrates the working of the proposed algorithm.

The approach has the following salient points:

Usage of Differentiable 3D Gaussian Splatting, that is unstructured and explicit, hence allowing for rapid rendering, and also can be projected to 2D splats. Intuitively the covariance of the gaussian's can be thought of as configurations of an ellipsoid, which can be mathematically broken down into a scaling matrix and a rotation matrix. For all parameters, the gradients are derived explicitly to overcome any overhead due to autodiff.

The optimization creates a dense set of such 3D Gaussians which represent the scene as accurately as possible. Each step of rendering is followed by a step of comparison to the training views available in the dataset. Stochastic Gradient Descent is used to optimize the following loss function: ${\mathcal {L}}=(1-\lambda ){\mathcal {L}}_{1}+\lambda {\mathcal {L}}_{D-SSIM}$ inspired by the Plenoxels^[2] work. The trick is to control the number of Gaussians and the density via this procedure.

The blending of different Gaussian components is then achieved via a tile-based rasterizer. The approach can handle scenes with varying complexities without explicitly tuning the hyperparameters.

Results and Evaluation^[3]

Authors tested their algorithm on 13 real scenes from previously published datasets and the synthetic Blender dataset^[4].
They compared their method against state-of-the-art techniques like Mip-NeRF360^[5], InstantNGP^[6], and Plenoxels^[2].
Quantitative evaluation metrics used were PSNR, L-PIPS, and SSIM.
The authors report results for two configurations of their method: 7K and 30K iterations.
Their fully converged model (30K iterations) achieves quality on par with or slightly better than Mip-NeRF360^[5], but with significantly reduced training time (35-45 minutes vs. 48 hours) and faster rendering (real-time vs. 10s/frame).
At 7K iterations (5-10 minutes of training), their method achieves comparable quality to InstantNGP^[6] and Plenoxels^[2].
For synthetic bounded scenes (Blender dataset^[4]), they achieved state-of-the-art results even with random initialization, starting from 100K uniformly random Gaussians.

Limitations

Elongated artifacts or "splotchy" Gaussians in some areas.
Occasional popping artifacts due to large Gaussians created by the optimization, especially in regions with view-dependent appearance.
Higher memory consumption compared to NeRF-based solutions, though still more compact than previous point-based approaches.
May require hyperparameter tuning (e.g., reducing position learning rate) for very large scenes.
Peak GPU memory consumption during training can be high (over 20GB) in the current unoptimized prototype.

Authors note that some of these limitations, such as popping artifacts and memory consumption, could potentially be addressed through future improvements like better culling approaches, antialiasing, regularization, and compression techniques.

Applications of 3D Gaussian Splatting

Below are a few examples of how Gaussian Splatting has been adapted and extended across various computer vision and graphics applications, from dynamic scene rendering to autonomous driving simulations and 4D content creation:

4D Gaussian Splatting for Real-Time Dynamic Scene Rendering^[7]:
- Extends 3D Gaussian Splatting to dynamic scenes (4D representation)
Text-to-3D using Gaussian Splatting^[8]:
- Applies 3D Gaussian Splatting to text-to-3D generation
End-to-end Autonomous Driving: Challenges and Frontiers^[9]:
- Mentions 3D Gaussian Splatting as a data-driven sensor simulation method for autonomous driving
- Highlights its ability to generate realistic novel views of a scene
SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering^[10]:
- Proposes a method to extract precise and fast meshes from 3D Gaussian Splatting
SplaTAM: Splat, Track and Map 3D Gaussians for Dense RGB-D SLAM^[11]:
- Applies 3D Gaussian-based radiance fields to Simultaneous Localization and Mapping (SLAM)
- Leverages fast rendering and rich optimization capabilities of 3D Gaussians
- Achieves state-of-the-art results in camera pose estimation, map estimation, and novel-view synthesis
Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models^[12]:
- Uses dynamic 3D Gaussians for 4D content creation from text

Authors

The authors of the 3D Gaussian Splatting for Real-Time Radiance Field Rendering^[13] paper are Bernhard Kerbl, Georgios Kopanas, Thomas Leimkühler, and George Drettakis.

References

^ "3D Gaussian Splatting for Real-Time Radiance Field Rendering". repo-sam.inria.fr. Retrieved 2024-09-25.
^ ^a ^b ^c Fridovich-Keil, Sara; Yu, Alex; Tancik, Matthew; Chen, Qinhong; Recht, Benjamin; Kanazawa, Angjoo (June 2022). "Plenoxels: Radiance Fields without Neural Networks". 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 5491–5500. arXiv:2112.05131. doi:10.1109/cvpr52688.2022.00542. ISBN 978-1-6654-6946-3.
^ "3D Gaussian Splatting for Real-Time Radiance Field Rendering". repo-sam.inria.fr. Retrieved 2024-09-25.
^ ^a ^b Mildenhall, Ben; Srinivasan, Pratul P.; Tancik, Matthew; Barron, Jonathan T.; Ramamoorthi, Ravi; Ng, Ren (2020), "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis", Lecture Notes in Computer Science, Cham: Springer International Publishing, pp. 405–421, doi:10.1007/978-3-030-58452-8_24, ISBN 978-3-030-58451-1, retrieved 2024-09-25
^ ^a ^b Barron, Jonathan T.; Mildenhall, Ben; Verbin, Dor; Srinivasan, Pratul P.; Hedman, Peter (June 2022). "Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields". 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 5460–5469. arXiv:2111.12077. doi:10.1109/cvpr52688.2022.00539. ISBN 978-1-6654-6946-3.
^ ^a ^b Müller, Thomas; Evans, Alex; Schied, Christoph; Keller, Alexander (July 2022). "Instant neural graphics primitives with a multiresolution hash encoding". ACM Transactions on Graphics. 41 (4): 1–15. doi:10.1145/3528223.3530127. ISSN 0730-0301.
^ Wu, Guanjun; Yi, Taoran; Fang, Jiemin; Xie, Lingxi; Zhang, Xiaopeng; Wei, Wei; Liu, Wenyu; Tian, Qi; Wang, Xinggang (2024-06-16). "4D Gaussian Splatting for Real-Time Dynamic Scene Rendering". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Vol. 38. IEEE. pp. 20310–20320. arXiv:2310.08528. doi:10.1109/cvpr52733.2024.01920. ISBN 979-8-3503-5300-6.
^ Chen, Zilong; Wang, Feng; Wang, Yikai; Liu, Huaping (2024-06-16). "Text-to-3D using Gaussian Splatting". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Vol. abs/2211.0 1324. IEEE. pp. 21401–21412. arXiv:2309.16585. doi:10.1109/cvpr52733.2024.02022. ISBN 979-8-3503-5300-6.
^ Chen, Li; Wu, Penghao; Chitta, Kashyap; Jaeger, Bernhard; Geiger, Andreas; Li, Hongyang (2024). "End-to-end Autonomous Driving: Challenges and Frontiers". IEEE Transactions on Pattern Analysis and Machine Intelligence. PP: 1–20. doi:10.1109/tpami.2024.3435937. ISSN 0162-8828. PMID 39078757.
^ Guédon, Antoine; Lepetit, Vincent (2024-06-16). "SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 5354–5363. arXiv:2311.12775. doi:10.1109/cvpr52733.2024.00512. ISBN 979-8-3503-5300-6.
^ Keetha, Nikhil; Karhade, Jay; Jatavallabhula, Krishna Murthy; Yang, Gengshan; Scherer, Sebastian; Ramanan, Deva; Luiten, Jonathon (2024-06-16). "SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 21357–21366. doi:10.1109/cvpr52733.2024.02018. ISBN 979-8-3503-5300-6.
^ Ling, Huan; Kim, Seung Wook; Torralba, Antonio; Fidler, Sanja; Kreis, Karsten (2024-06-16). "Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 8576–8588. arXiv:2312.13763. doi:10.1109/cvpr52733.2024.00819. ISBN 979-8-3503-5300-6.
^ Kerbl, Bernhard; Kopanas, Georgios; Leimkuehler, Thomas; Drettakis, George (2023-07-26). "3D Gaussian Splatting for Real-Time Radiance Field Rendering". ACM Transactions on Graphics. 42 (4): 1–14. doi:10.1145/3592433. ISSN 0730-0301.

[1] "3D Gaussian Splatting for Real-Time Radiance Field Rendering". repo-sam.inria.fr. Retrieved 2024-09-25.

[:3-2] Fridovich-Keil, Sara; Yu, Alex; Tancik, Matthew; Chen, Qinhong; Recht, Benjamin; Kanazawa, Angjoo (June 2022). "Plenoxels: Radiance Fields without Neural Networks". 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 5491–5500. arXiv:2112.05131. doi:10.1109/cvpr52688.2022.00542. ISBN 978-1-6654-6946-3.

[3] "3D Gaussian Splatting for Real-Time Radiance Field Rendering". repo-sam.inria.fr. Retrieved 2024-09-25.

[:0-4] Mildenhall, Ben; Srinivasan, Pratul P.; Tancik, Matthew; Barron, Jonathan T.; Ramamoorthi, Ravi; Ng, Ren (2020), "NeRF: Representing Scenes as Neural Radiance Fields for View Synthesis", Lecture Notes in Computer Science, Cham: Springer International Publishing, pp. 405–421, doi:10.1007/978-3-030-58452-8_24, ISBN 978-3-030-58451-1, retrieved 2024-09-25

[:1-5] Barron, Jonathan T.; Mildenhall, Ben; Verbin, Dor; Srinivasan, Pratul P.; Hedman, Peter (June 2022). "Mip-NeRF 360: Unbounded Anti-Aliased Neural Radiance Fields". 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 5460–5469. arXiv:2111.12077. doi:10.1109/cvpr52688.2022.00539. ISBN 978-1-6654-6946-3.

[:2-6] Müller, Thomas; Evans, Alex; Schied, Christoph; Keller, Alexander (July 2022). "Instant neural graphics primitives with a multiresolution hash encoding". ACM Transactions on Graphics. 41 (4): 1–15. doi:10.1145/3528223.3530127. ISSN 0730-0301.

[7] Wu, Guanjun; Yi, Taoran; Fang, Jiemin; Xie, Lingxi; Zhang, Xiaopeng; Wei, Wei; Liu, Wenyu; Tian, Qi; Wang, Xinggang (2024-06-16). "4D Gaussian Splatting for Real-Time Dynamic Scene Rendering". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Vol. 38. IEEE. pp. 20310–20320. arXiv:2310.08528. doi:10.1109/cvpr52733.2024.01920. ISBN 979-8-3503-5300-6.

[8] Chen, Zilong; Wang, Feng; Wang, Yikai; Liu, Huaping (2024-06-16). "Text-to-3D using Gaussian Splatting". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). Vol. abs/2211.0 1324. IEEE. pp. 21401–21412. arXiv:2309.16585. doi:10.1109/cvpr52733.2024.02022. ISBN 979-8-3503-5300-6.

[9] Chen, Li; Wu, Penghao; Chitta, Kashyap; Jaeger, Bernhard; Geiger, Andreas; Li, Hongyang (2024). "End-to-end Autonomous Driving: Challenges and Frontiers". IEEE Transactions on Pattern Analysis and Machine Intelligence. PP: 1–20. doi:10.1109/tpami.2024.3435937. ISSN 0162-8828. PMID 39078757.

[10] Guédon, Antoine; Lepetit, Vincent (2024-06-16). "SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 5354–5363. arXiv:2311.12775. doi:10.1109/cvpr52733.2024.00512. ISBN 979-8-3503-5300-6.

[11] Keetha, Nikhil; Karhade, Jay; Jatavallabhula, Krishna Murthy; Yang, Gengshan; Scherer, Sebastian; Ramanan, Deva; Luiten, Jonathon (2024-06-16). "SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 21357–21366. doi:10.1109/cvpr52733.2024.02018. ISBN 979-8-3503-5300-6.

[12] Ling, Huan; Kim, Seung Wook; Torralba, Antonio; Fidler, Sanja; Kreis, Karsten (2024-06-16). "Align Your Gaussians: Text-to-4D with Dynamic 3D Gaussians and Composed Diffusion Models". 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE. pp. 8576–8588. arXiv:2312.13763. doi:10.1109/cvpr52733.2024.00819. ISBN 979-8-3503-5300-6.

[13] Kerbl, Bernhard; Kopanas, Georgios; Leimkuehler, Thomas; Drettakis, George (2023-07-26). "3D Gaussian Splatting for Real-Time Radiance Field Rendering". ACM Transactions on Graphics. 42 (4): 1–14. doi:10.1145/3592433. ISSN 0730-0301.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]