dc.contributor.author | Müller-Huschke, Julian | en_US |
dc.contributor.author | Ritter, Marcel | en_US |
dc.contributor.author | Harders, Matthias | en_US |
dc.contributor.editor | Pelechano, Nuria | en_US |
dc.contributor.editor | Vanderhaeghe, David | en_US |
dc.date.accessioned | 2022-04-22T08:16:11Z | |
dc.date.available | 2022-04-22T08:16:11Z | |
dc.date.issued | 2022 | |
dc.identifier.isbn | 978-3-03868-169-4 | |
dc.identifier.issn | 1017-4656 | |
dc.identifier.uri | https://doi.org/10.2312/egs.20221026 | |
dc.identifier.uri | https://diglib.eg.org:443/handle/10.2312/egs20221026 | |
dc.description.abstract | Most current research on automatically captioning and describing scenes with spatial content focuses on images. We outline that generating descriptive text for a synthesized 3D scene can be achieved via a suitable intermediate representation employed in the synthesis algorithm. As an example, we synthesize scenes of medieval village settings, and generate their descriptions. Our system employs graph grammars, Markov Chain Monte Carlo optimization, and a natural language generation pipeline. Randomly placed objects are evaluated and optimized by a cost function capturing neighborhood relations, path layouts, and collisions. Further, in a pilot study we assess the performance of our framework by comparing the generated descriptions to others provided by human subjects. While the latter were often short and low-effort, the highest-rated ones clearly outperform our generated ones. Nevertheless, the average of all collected human descriptions was indeed rated by the study participants as being less accurate than the automated ones. | en_US |
dc.publisher | The Eurographics Association | en_US |
dc.rights | Attribution 4.0 International License | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | CCS Concepts: Computing methodologies --> Computer graphics; Natural language generation | |
dc.subject | Computing methodologies | |
dc.subject | Computer graphics | |
dc.subject | Natural language generation | |
dc.title | Scene Synthesis with Automated Generation of Textual Descriptions | en_US |
dc.description.seriesinformation | Eurographics 2022 - Short Papers | |
dc.description.sectionheaders | Procedural Modelling | |
dc.identifier.doi | 10.2312/egs.20221026 | |
dc.identifier.pages | 33-36 | |
dc.identifier.pages | 4 pages | |