WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval

Xiao, Shishi; Hou, Yihan; Jin, Cheng; Zeng, Wei

View/Open

v42i3pp311-322_cgf14832.pdf (2.804Mb)

1068-file-i7.pdf (1.789Mb)

Date

2023

Author

Xiao, Shishi

Hou, Yihan

Jin, Cheng

Zeng, Wei

Pay-Per-View via TIB Hannover:

Try if this item/paper is available.

Metadata

Show full item record

Abstract

Retrieving charts from a large corpus is a fundamental task that can benefit numerous applications such as visualization recommendations. The retrieved results are expected to conform to both explicit visual attributes (e.g., chart type, colormap) and implicit user intents (e.g., design style, context information) that vary upon application scenarios. However, existing examplebased chart retrieval methods are built upon non-decoupled and low-level visual features that are hard to interpret, while definition-based ones are constrained to pre-defined attributes that are hard to extend. In this work, we propose a new framework, namely WYTIWYR (What-You-Think-Is-What-You-Retrieve), that integrates user intents into the chart retrieval process. The framework consists of two stages: first, the Annotation stage disentangles the visual attributes within the query chart; and second, the Retrieval stage embeds the user's intent with customized text prompt as well as bitmap query chart, to recall targeted retrieval result. We develop a prototype WYTIWYR system leveraging a contrastive language-image pre-training (CLIP) model to achieve zero-shot classification as well as multi-modal input encoding, and test the prototype on a large corpus with charts crawled from the Internet. Quantitative experiments, case studies, and qualitative interviews are conducted. The results demonstrate the usability and effectiveness of our proposed framework.

BibTeX

@article {10.1111:cgf.14832,
journal = {Computer Graphics Forum},
title = {{WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval}},
author = {Xiao, Shishi and Hou, Yihan and Jin, Cheng and Zeng, Wei},
year = {2023},
publisher = {The Eurographics Association and John Wiley & Sons Ltd.},
ISSN = {1467-8659},
DOI = {10.1111/cgf.14832}
}

URI

https://doi.org/10.1111/cgf.14832
https://diglib.eg.org:443/handle/10.1111/cgf14832

Collections

42-Issue 3