[slides and audio] Pixel2Mesh%3A Generating 3D Mesh Models from Single RGB Images

The paper "Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images" introduces an end-to-end deep learning architecture that generates 3D triangular mesh models from a single color image. Unlike previous methods that represent 3D shapes as volumes or point clouds, this approach uses a graph-based convolutional neural network (GCN) to deform an initial ellipsoid into the desired 3D shape. The network incorporates perceptual features extracted from the input image to guide the deformation process, ensuring accurate and visually appealing results. The method employs a coarse-to-fine strategy and defines various losses to capture different levels of detail and maintain physical accuracy. Extensive experiments demonstrate that the proposed method outperforms state-of-the-art techniques in terms of both qualitative details and quantitative accuracy. The paper also discusses the challenges and contributions of the proposed approach, including the representation of irregular graphs in neural networks, the design of perceptual feature pooling, and the use of graph unpooling layers to handle vertex degrees.The paper "Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images" introduces an end-to-end deep learning architecture that generates 3D triangular mesh models from a single color image. Unlike previous methods that represent 3D shapes as volumes or point clouds, this approach uses a graph-based convolutional neural network (GCN) to deform an initial ellipsoid into the desired 3D shape. The network incorporates perceptual features extracted from the input image to guide the deformation process, ensuring accurate and visually appealing results. The method employs a coarse-to-fine strategy and defines various losses to capture different levels of detail and maintain physical accuracy. Extensive experiments demonstrate that the proposed method outperforms state-of-the-art techniques in terms of both qualitative details and quantitative accuracy. The paper also discusses the challenges and contributions of the proposed approach, including the representation of irregular graphs in neural networks, the design of perceptual feature pooling, and the use of graph unpooling layers to handle vertex degrees.

Pixel2Mesh: Generating 3D Mesh Models from Single RGB Images

3 Aug 2018 | Nanyang Wang, Yinda Zhang, Zhuwen Li, Yanwei Fu, Wei Liu, Yu-Gang Jiang