Statistics-based Motion Synthesis for Social Conversations
Abstract
Plausible conversations among characters are required to generate the ambiance of social settings such as a restaurant, hotel lobby, or cocktail party. In this paper, we propose a motion synthesis technique that can rapidly generate animated motion for characters engaged in two-party conversations. Our system synthesizes gestures and other body motions for dyadic conversations that synchronize with novel input audio clips. Human conversations feature many different forms of coordination and synchronization. For example, speakers use hand gestures to emphasize important points, and listeners often nod in agreement or acknowledgment. To achieve the desired degree of realism, our method first constructs a motion graph that preserves the statistics of a database of recorded conversations performed by a pair of actors. This graph is then used to search for a motion sequence that respects three forms of audio-motion coordination in human conversations: coordination to phonemic clause, listener response, and partner's hesitation pause. We assess the quality of the generated animations through a user study that compares them to the originally recorded motion and evaluate the effects of each type of audio-motion coordination via ablation studies.
BibTeX
@article {10.1111:cgf.14114,
journal = {Computer Graphics Forum},
title = {{Statistics-based Motion Synthesis for Social Conversations}},
author = {Yang, Yanzhe and Yang, Jimei and Hodgins, Jessica},
year = {2020},
publisher = {The Eurographics Association and John Wiley & Sons Ltd.},
ISSN = {1467-8659},
DOI = {10.1111/cgf.14114}
}
journal = {Computer Graphics Forum},
title = {{Statistics-based Motion Synthesis for Social Conversations}},
author = {Yang, Yanzhe and Yang, Jimei and Hodgins, Jessica},
year = {2020},
publisher = {The Eurographics Association and John Wiley & Sons Ltd.},
ISSN = {1467-8659},
DOI = {10.1111/cgf.14114}
}