StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting

StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting

12 Mar 2024 | Kunhao Liu1, Fangneng Zhan2, Muyu Xu1, Christian Theobalt2, Ling Shao3, and Shijian Lu1
**StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting** **Authors:** Kunhao Liu, Fangneng Zhan, Muyu Xu, Christian Theobalt, Ling Shao, Shijian Lu **Institution:** Nanyang Technological University, Max Planck Institute for Informatics, UCAS-Terminus AI Lab, UCAS **Abstract:** StyleGaussian is a novel 3D style transfer technique that enables instant style transfer of any image's style to a 3D scene at 10 frames per second (fps). It leverages 3D Gaussian Splatting (3DGS) to achieve real-time rendering and multi-view consistency without compromising style transfer quality. The process consists of three steps: embedding, transfer, and decoding. Initially, 2D VGG scene features are embedded into reconstructed 3D Gaussians. Next, the embedded features are transformed according to a reference style image. Finally, the transformed features are decoded into the stylized RGB. StyleGaussian introduces an efficient feature rendering strategy that first renders low-dimensional features and then maps them to high-dimensional features, significantly reducing memory consumption. It also employs a K-nearest-neighbor-based 3D CNN as the decoder, preserving strict multi-view consistency. Extensive experiments demonstrate superior stylization quality, real-time rendering, and multi-view consistency. **Contributions:** 1. Introduces StyleGaussian, a novel 3D style transfer pipeline for instant style transfer with real-time rendering and strict multi-view consistency. 2. Develops an efficient feature rendering strategy to handle high-dimensional VGG features. 3. Designs a KNN-based 3D CNN for decoding stylized features, maintaining multi-view consistency. **Keywords:** 3D Gaussian Splatting, 3D Style Transfer, 3D Editing **Related Work:** - **Radiance Fields:** Advances in radiance fields for 3D reconstruction. - **3D Appearance Editing:** Challenges and methods for editing 3D representations. - **Neural Style Transfer:** Techniques for neural style transfer in 2D and 3D. **Preliminary: 3D Gaussian Splatting** - Representation of 3D scenes using 3D Gaussians. - Rapid reconstruction and rendering capabilities. **StyleGaussian: Overview** - Embedding VGG features into 3D Gaussians. - Transforming features based on a style image. - Decoding transformed features back to RGB. **Feature Embedding:** - Embedding VGG features into 3D Gaussians. - Efficient rendering strategy for high-dimensional features. **Style Transfer:** - Using AdaIN for parameter-free style transfer. **RGB Decoding:** - KNN-based 3D CNN for decoding transformed features to RGB. **Experiments:** - Evaluations on real-world datasets. - Comparison with state-of-the-art methods.**StyleGaussian: Instant 3D Style Transfer with Gaussian Splatting** **Authors:** Kunhao Liu, Fangneng Zhan, Muyu Xu, Christian Theobalt, Ling Shao, Shijian Lu **Institution:** Nanyang Technological University, Max Planck Institute for Informatics, UCAS-Terminus AI Lab, UCAS **Abstract:** StyleGaussian is a novel 3D style transfer technique that enables instant style transfer of any image's style to a 3D scene at 10 frames per second (fps). It leverages 3D Gaussian Splatting (3DGS) to achieve real-time rendering and multi-view consistency without compromising style transfer quality. The process consists of three steps: embedding, transfer, and decoding. Initially, 2D VGG scene features are embedded into reconstructed 3D Gaussians. Next, the embedded features are transformed according to a reference style image. Finally, the transformed features are decoded into the stylized RGB. StyleGaussian introduces an efficient feature rendering strategy that first renders low-dimensional features and then maps them to high-dimensional features, significantly reducing memory consumption. It also employs a K-nearest-neighbor-based 3D CNN as the decoder, preserving strict multi-view consistency. Extensive experiments demonstrate superior stylization quality, real-time rendering, and multi-view consistency. **Contributions:** 1. Introduces StyleGaussian, a novel 3D style transfer pipeline for instant style transfer with real-time rendering and strict multi-view consistency. 2. Develops an efficient feature rendering strategy to handle high-dimensional VGG features. 3. Designs a KNN-based 3D CNN for decoding stylized features, maintaining multi-view consistency. **Keywords:** 3D Gaussian Splatting, 3D Style Transfer, 3D Editing **Related Work:** - **Radiance Fields:** Advances in radiance fields for 3D reconstruction. - **3D Appearance Editing:** Challenges and methods for editing 3D representations. - **Neural Style Transfer:** Techniques for neural style transfer in 2D and 3D. **Preliminary: 3D Gaussian Splatting** - Representation of 3D scenes using 3D Gaussians. - Rapid reconstruction and rendering capabilities. **StyleGaussian: Overview** - Embedding VGG features into 3D Gaussians. - Transforming features based on a style image. - Decoding transformed features back to RGB. **Feature Embedding:** - Embedding VGG features into 3D Gaussians. - Efficient rendering strategy for high-dimensional features. **Style Transfer:** - Using AdaIN for parameter-free style transfer. **RGB Decoding:** - KNN-based 3D CNN for decoding transformed features to RGB. **Experiments:** - Evaluations on real-world datasets. - Comparison with state-of-the-art methods.
Reach us at info@study.space