dc.contributor.author | Almushyti, Muna | en_US |
dc.contributor.author | Li, Frederick W. B. | en_US |
dc.contributor.editor | Vidal, Franck P. and Tam, Gary K. L. and Roberts, Jonathan C. | en_US |
dc.date.accessioned | 2019-09-11T05:09:11Z | |
dc.date.available | 2019-09-11T05:09:11Z | |
dc.date.issued | 2019 | |
dc.identifier.isbn | 978-3-03868-096-3 | |
dc.identifier.uri | https://doi.org/10.2312/cgvc.20191269 | |
dc.identifier.uri | https://diglib.eg.org:443/handle/10.2312/cgvc20191269 | |
dc.description.abstract | Recognising Human-object interactions (HOIs) in videos is a challenge task especially when a human can interact with multiple objects. This paper attempts to solve the problem of HOIs by proposing a hierarchical framework that analyzes human-object interactions from a video sequence. The framework consists of LSTMs that firstly capture both human motion and temporal object information independently, followed by fusing these information through a bilinear layer to aggregate human-object features, which are then fed to a global deep LSTM to learn high-level information of HOIs. The proposed approach applies an attention mechanism to LSTMs in order to focus on important parts of human and object temporal information. | en_US |
dc.publisher | The Eurographics Association | en_US |
dc.subject | Computing methodologies | |
dc.subject | Human | |
dc.subject | object interactions (HOIs) | |
dc.subject | LSTM | |
dc.subject | CNN | |
dc.subject | Hierarchical design | |
dc.subject | Temporal information | |
dc.subject | Attention | |
dc.title | Recognising Human-Object Interactions Using Attention-based LSTMs | en_US |
dc.description.seriesinformation | Computer Graphics and Visual Computing (CGVC) | |
dc.description.sectionheaders | Short Papers | |
dc.identifier.doi | 10.2312/cgvc.20191269 | |
dc.identifier.pages | 135-139 | |