Show simple item record

dc.contributor.authorJaunet, Theoen_US
dc.contributor.authorVuillemot, Romainen_US
dc.contributor.authorWolf, Christianen_US
dc.contributor.editorViola, Ivan and Gleicher, Michael and Landesberger von Antburg, Tatianaen_US
dc.date.accessioned2020-05-24T12:59:59Z
dc.date.available2020-05-24T12:59:59Z
dc.date.issued2020
dc.identifier.issn1467-8659
dc.identifier.urihttps://doi.org/10.1111/cgf.13962
dc.identifier.urihttps://diglib.eg.org:443/handle/10.1111/cgf13962
dc.description.abstractWe present DRLViz, a visual analytics interface to interpret the internal memory of an agent (e.g. a robot) trained using deep reinforcement learning. This memory is composed of large temporal vectors updated when the agent moves in an environment and is not trivial to understand due to the number of dimensions, dependencies to past vectors, spatial/temporal correlations, and co-correlation between dimensions. It is often referred to as a black box as only inputs (images) and outputs (actions) are intelligible for humans. Using DRLViz, experts are assisted to interpret decisions using memory reduction interactions, and to investigate the role of parts of the memory when errors have been made (e.g. wrong direction). We report on DRLViz applied in the context of video games simulators (ViZDoom) for a navigation scenario with item gathering tasks. We also report on experts evaluation using DRLViz, and applicability of DRLViz to other scenarios and navigation problems beyond simulation games, as well as its contribution to black box models interpretability and explain-ability in the field of visual analytics.en_US
dc.publisherThe Eurographics Association and John Wiley & Sons Ltd.en_US
dc.rightsAttribution 4.0 International License
dc.rights.urihttps://creativecommons.org/licenses/by/4.0/]
dc.subjectHuman centered computing
dc.subjectVisual analytics
dc.subjectTheory of computation
dc.subjectReinforcement learning
dc.titleDRLViz: Understanding Decisions and Memory in Deep Reinforcement Learningen_US
dc.description.seriesinformationComputer Graphics Forum
dc.description.sectionheadersVisualization Applications and Machine Learning
dc.description.volume39
dc.description.number3
dc.identifier.doi10.1111/cgf.13962
dc.identifier.pages49-61


Files in this item

Thumbnail
Thumbnail

This item appears in the following Collection(s)

  • 39-Issue 3
    EuroVis 2020 - Conference Proceedings

Show simple item record

Attribution 4.0 International License
Except where otherwise noted, this item's license is described as Attribution 4.0 International License