Microwave Photonic Harmonic Down-Conversion Based on Four-Wave Mixing in ...
A Method for Batch Modification of Neural Microelectrodes Via Removable E...
Microwave Photonic Harmonic Down-Conversion Based on Four-Wave Mixing in ...
Relative Gradient Matching Cost for Mitigating Feature Discrepancies in S...
 976 nm Fundamental Transverse Mode Ridge Diode Laser with Narrow Far-Fi...
Annealing-induced defects and optical degradation in sputter-deposited si...
Spectral Beam Combining with Beam Shaping of Distributed 2D Multi-Single ...
AlGaN/GaN HEMT Sub-Terahertz Detectors Integrated with Spiral Antennas
Multifunctional buried interface modification for efficient and stable Sn...
The photoinduced hidden metallic phase of monoclinic VO2 driven by local ...
官方微信
友情链接

PIFU-RGBD: Single-view RGB-D Pixel-aligned Implicit Function for 3D Human Reconstruction

2024-05-14


Wang, Yingli; Zhang, Liping; Li, Weijun; Dong, Xiaoli; Li, Li; Qin, Hong Source: 2023 International Conference on High Performance Big Data and Intelligent Systems, HDIS 2023, p 93-99, 2023, 2023 International Conference on High Performance Big Data and Intelligent Systems, HDIS 2023;

Abstract:

Recent advances in IMAGE-BASED parsing of human bodies have been driven by the significant improvement in successful deep learning methods for 2D image processing. Although current methods have demonstrated outstanding global reconstruction capability, they still fail to process inherent depth ambiguity in 2D image images. In this paper, we propose PIFU-RGBD, a new pixel-aligned function representation method to reconstruct the complete and detailed 3D human from a single RGB-D image. The PIFU-RGBD method is mainly structured into two stages. The initial stage involves transforming a single RGB-D image into a single-view human point cloud, and then the single-view mesh is modeled based on the point cloud data, and the binocular view is rendered. Moving on to the second stage, the depth information and voxel alignment features of binocular view are obtained through the stereoscopic vision network and input into the implicit function estimation network. By using the Marching Cubes algorithm, a complete three-dimensional reconstruction of the human body model is obtained. It is worth noting that the RGBD images obtained by any camera can be converted into the input of unified camera parameters after processing in the first stage, which makes the depth information and voxel alignment features extracted in the second stage are camera-independent. The trained network performs depth-aware reconstruction under unified parameter settings. Compared with previous works, our proposed method can effectively improve the pose ambiguity problem of the reconstruction of human model with single view input, and significantly improve the reconstruction accuracy. Compared with the current SOTA method, which uses single-view RGB-D input to reconstruct the complete human body, the scheme proposed in this paper can reconstruct the human body model with accurate posture on the data captured by cameras with different parameters, and has the advantage of stronger generalization capability.

©2023 IEEE. (20 refs.)




关于我们
下载视频观看
联系方式
通信地址

北京市海淀区清华东路甲35号(林大北路中段) 北京912信箱 (100083)

电话

010-82304210/010-82305052(传真)

E-mail

semi@semi.ac.cn

交通地图
版权所有 中国科学院半导体研究所

备案号:京ICP备05085259-1号 京公网安备110402500052 中国科学院半导体所声明