LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity

Arunkumar, Anjana; Sharma, Shubham; Agrawal, Rakhi; Chandrasekaran, Sriram; Bryan, Chris

dc.contributor.author	Arunkumar, Anjana	en_US
dc.contributor.author	Sharma, Shubham	en_US
dc.contributor.author	Agrawal, Rakhi	en_US
dc.contributor.author	Chandrasekaran, Sriram	en_US
dc.contributor.author	Bryan, Chris	en_US
dc.contributor.editor	Bujack, Roxana	en_US
dc.contributor.editor	Archambault, Daniel	en_US
dc.contributor.editor	Schreck, Tobias	en_US
dc.date.accessioned	2023-06-10T06:17:34Z
dc.date.available	2023-06-10T06:17:34Z
dc.date.issued	2023
dc.identifier.issn	1467-8659
dc.identifier.uri	https://doi.org/10.1111/cgf.14840
dc.identifier.uri	https://diglib.eg.org:443/handle/10.1111/cgf14840
dc.description.abstract	Cross-task generalization is a significant outcome that defines mastery in natural language understanding. Humans show a remarkable aptitude for this, and can solve many different types of tasks, given definitions in the form of textual instructions and a small set of examples. Recent work with pre-trained language models mimics this learning style: users can define and exemplify a task for the model to attempt as a series of natural language prompts or instructions. While prompting approaches have led to higher cross-task generalization compared to traditional supervised learning, analyzing 'bias' in the task instructions given to the model is a difficult problem, and has thus been relatively unexplored. For instance, are we truly modeling a task, or are we modeling a user's instructions? To help investigate this, we develop LINGO, a novel visual analytics interface that supports an effective, task-driven workflow to (1) help identify bias in natural language task instructions, (2) alter (or create) task instructions to reduce bias, and (3) evaluate pre-trained model performance on debiased task instructions. To robustly evaluate LINGO, we conduct a user study with both novice and expert instruction creators, over a dataset of 1,616 linguistic tasks and their natural language instructions, spanning 55 different languages. For both user groups, LINGO promotes the creation of more difficult tasks for pre-trained models, that contain higher linguistic diversity and lower instruction bias. We additionally discuss how the insights learned in developing and evaluating LINGO can aid in the design of future dashboards that aim to minimize the effort involved in prompt creation across multiple domains.	en_US
dc.publisher	The Eurographics Association and John Wiley & Sons Ltd.	en_US
dc.subject	CCS Concepts: Human-centered computing -> Visual analytics; Text input; Computing methodologies -> Natural language processing
dc.subject	Human centered computing
dc.subject	Visual analytics
dc.subject	Text input
dc.subject	Computing methodologies
dc.subject	Natural language processing
dc.title	LINGO : Visually Debiasing Natural Language Instructions to Support Task Diversity	en_US
dc.description.seriesinformation	Computer Graphics Forum
dc.description.sectionheaders	Visualization and Machine Learning
dc.description.volume	42
dc.description.number	3
dc.identifier.doi	10.1111/cgf.14840
dc.identifier.pages	409-421
dc.identifier.pages	13 pages

Files in this item

Name:: v42i3pp409-421_cgf14840.pdf
Size:: 4.671Mb
Format:: PDF

View/Open

Name:: 1207-file-i7.mp4
Size:: 84.42Mb
Format:: Unknown

View/Open

Name:: 1207-file-i8.pdf
Size:: 27.27Kb
Format:: PDF

View/Open

This item appears in the following Collection(s)

42-Issue 3
EuroVis 2023 - Conference Proceedings

Show simple item record