Cinematographic Camera Diffusion Model

Jiang, Hongda; Wang, Xi; Christie, Marc; Liu, Libin; Chen, Baoquan

dc.contributor.author	Jiang, Hongda	en_US
dc.contributor.author	Wang, Xi	en_US
dc.contributor.author	Christie, Marc	en_US
dc.contributor.author	Liu, Libin	en_US
dc.contributor.author	Chen, Baoquan	en_US
dc.contributor.editor	Bermano, Amit H.	en_US
dc.contributor.editor	Kalogerakis, Evangelos	en_US
dc.date.accessioned	2024-04-16T14:43:16Z
dc.date.available	2024-04-16T14:43:16Z
dc.date.issued	2024
dc.identifier.issn	1467-8659
dc.identifier.uri	https://doi.org/10.1111/cgf.15055
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf15055
dc.description.abstract	Designing effective camera trajectories in virtual 3D environments is a challenging task even for experienced animators. Despite an elaborate film grammar, forged through years of experience, that enables the specification of camera motions through cinematographic properties (framing, shots sizes, angles, motions), there are endless possibilities in deciding how to place and move cameras with characters. Dealing with these possibilities is part of the complexity of the problem. While numerous techniques have been proposed in the literature (optimization-based solving, encoding of empirical rules, learning from real examples,...), the results either lack variety or ease of control. In this paper, we propose a cinematographic camera diffusion model using a transformer-based architecture to handle temporality and exploit the stochasticity of diffusion models to generate diverse and qualitative trajectories conditioned by high-level textual descriptions. We extend the work by integrating keyframing constraints and the ability to blend naturally between motions using latent interpolation, in a way to augment the degree of control of the designers. We demonstrate the strengths of this text-to-camera motion approach through qualitative and quantitative experiments and gather feedback from professional artists. The code and data are available at https://github.com/jianghd1996/Camera-control.	en_US
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Computing methodologies -> Procedural animation; Artificial intelligence
dc.subject	Computing methodologies
dc.subject	Procedural animation
dc.subject	Artificial intelligence
dc.title	Cinematographic Camera Diffusion Model	en_US
dc.description.seriesinformation	Computer Graphics Forum
dc.description.sectionheaders	Camera Paths and Motion Tracking
dc.description.volume	43
dc.description.number	2
dc.identifier.doi	10.1111/cgf.15055
dc.identifier.pages	14 pages

Files in this item

Name:: v43i2_51_15055.pdf
Size:: 43.10Mb
Format:: PDF

View/Open

Name:: paper1140.mp4
Size:: 63.42Mb
Format:: Unknown

View/Open

Name:: paper1140.pdf
Size:: 76.97Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

43-Issue 2
EG 2024 - Conference Issue
EG 2024 - Full Papers - CGF 43-Issue 2

Show simple item record