Factored Neural Representation for Scene Understanding

Wong, Yu-Shiang; Mitra, Niloy J.

dc.contributor.author	Wong, Yu-Shiang	en_US
dc.contributor.author	Mitra, Niloy J.	en_US
dc.contributor.editor	Memari, Pooran	en_US
dc.contributor.editor	Solomon, Justin	en_US
dc.date.accessioned	2023-06-30T06:19:13Z
dc.date.available	2023-06-30T06:19:13Z
dc.date.issued	2023
dc.identifier.issn	1467-8659
dc.identifier.uri	https://doi.org/10.1111/cgf.14911
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14911
dc.description.abstract	A long-standing goal in scene understanding is to obtain interpretable and editable representations that can be directly constructed from a raw monocular RGB-D video, without requiring specialized hardware setup or priors. The problem is significantly more challenging in the presence of multiple moving and/or deforming objects. Traditional methods have approached the setup with a mix of simplifications, scene priors, pretrained templates, or known deformation models. The advent of neural representations, especially neural implicit representations and radiance fields, opens the possibility of end-to-end optimization to collectively capture geometry, appearance, and object motion. However, current approaches produce global scene encoding, assume multiview capture with limited or no motion in the scenes, and do not facilitate easy manipulation beyond novel view synthesis. In this work, we introduce a factored neural scene representation that can directly be learned from a monocular RGB-D video to produce object-level neural presentations with an explicit encoding of object movement (e.g., rigid trajectory) and/or deformations (e.g., nonrigid movement). We evaluate ours against a set of neural approaches on both synthetic and real data to demonstrate that the representation is efficient, interpretable, and editable (e.g., change object trajectory). Code and data are available at: http://geometry.cs.ucl.ac.uk/projects/2023/factorednerf/.	en_US
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Computing methodologies -> Reconstruction; Volumetric models; Tracking
dc.subject	Computing methodologies
dc.subject	Reconstruction
dc.subject	Volumetric models
dc.subject	Tracking
dc.title	Factored Neural Representation for Scene Understanding	en_US
dc.description.seriesinformation	Computer Graphics Forum
dc.description.sectionheaders	Point Clouds and Scenes
dc.description.volume	42
dc.description.number	5
dc.identifier.doi	10.1111/cgf.14911
dc.identifier.pages	14 pages

Files in this item

Name:: v42i5_15_14911.pdf
Size:: 42.23Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

42-Issue 5
Geometry Processing 2023 - Symposium Proceedings

Show simple item record