Show simple item record

dc.contributor.authorMetz, Yannicken_US
dc.contributor.authorBykovets, Eugeneen_US
dc.contributor.authorJoos, Lucasen_US
dc.contributor.authorKeim, Danielen_US
dc.contributor.authorEl-Assady, Mennatallahen_US
dc.contributor.editorBujack, Roxanaen_US
dc.contributor.editorArchambault, Danielen_US
dc.contributor.editorSchreck, Tobiasen_US
dc.date.accessioned2023-06-10T06:17:30Z
dc.date.available2023-06-10T06:17:30Z
dc.date.issued2023
dc.identifier.issn1467-8659
dc.identifier.urihttps://doi.org/10.1111/cgf.14839
dc.identifier.urihttps://diglib.eg.org:443/handle/10.1111/cgf14839
dc.description.abstractUnderstanding the behavior of deep reinforcement learning agents is a crucial requirement throughout their development. Existing work has addressed the identification of observable behavioral patterns in state sequences or analysis of isolated internal representations; however, the overall decision-making of deep-learning RL agents remains opaque. To tackle this, we present VISITOR, a visual analytics system enabling the analysis of entire state sequences, the diagnosis of singular predictions, and the comparison between agents. A sequence embedding view enables the multiscale analysis of state sequences, utilizing custom embedding techniques for a stable spatialization of the observations and internal states. We provide multiple layers: (1) a state space embedding, highlighting different groups of states inside the state-action sequences, (2) a trajectory view, emphasizing decision points, (3) a network activation mapping, visualizing the relationship between observations and network activations, (4) a transition embedding, enabling the analysis of state-to-state transitions. The embedding view is accompanied by an interactive reward view that captures the temporal development of metrics, which can be linked directly to states in the embedding. Lastly, a model list allows for the quick comparison of models across multiple metrics. Annotations can be exported to communicate results to different audiences. Our two-stage evaluation with eight experts confirms the effectiveness in identifying states of interest, comparing the quality of policies, and reasoning about the internal decision-making processes.en_US
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by-nc/4.0/
dc.subjectCCS Concepts: Human-centered computing -> Visual analytics; Computing methodologies -> Reinforcement learning
dc.subjectHuman centered computing
dc.subjectVisual analytics
dc.subjectComputing methodologies
dc.subjectReinforcement learning
dc.titleVISITOR: Visual Interactive State Sequence Exploration for Reinforcement Learningen_US
dc.description.seriesinformationComputer Graphics Forum
dc.description.sectionheadersVisualization and Machine Learning
dc.description.volume42
dc.description.number3
dc.identifier.doi10.1111/cgf.14839
dc.identifier.pages397-408
dc.identifier.pages12 pages


Files in this item

Thumbnail
Thumbnail
Thumbnail

This item appears in the following Collection(s)

  • 42-Issue 3
    EuroVis 2023 - Conference Proceedings

Show simple item record

Attribution 4.0 International License
Except where otherwise noted, this item's license is described as Attribution 4.0 International License