Attention And Positional Encoding Are (Almost) All You Need For Shape Matching
Date
2023Metadata
Show full item recordAbstract
The fast development of novel approaches derived from the Transformers architecture has led to outstanding performance in different scenarios, from Natural Language Processing to Computer Vision. Recently, they achieved impressive results even in the challenging task of non-rigid shape matching. However, little is known about the capability of the Transformer-encoder architecture for the shape matching task, and its performances still remained largely unexplored. In this paper, we step back and investigate the contribution made by the Transformer-encoder architecture compared to its more recent alternatives, focusing on why and how it works on this specific task. Thanks to the versatility of our implementation, we can harness the bi-directional structure of the correspondence problem, making it more interpretable. Furthermore, we prove that positional encodings are essential for processing unordered point clouds. Through a comprehensive set of experiments, we find that attention and positional encoding are (almost) all you need for shape matching. The simple Transformer-encoder architecture, coupled with relative position encoding in the attention mechanism, is able to obtain strong improvements, reaching the current state-of-the-art.
BibTeX
@article {10.1111:cgf.14912,
journal = {Computer Graphics Forum},
title = {{Attention And Positional Encoding Are (Almost) All You Need For Shape Matching}},
author = {Raganato, Alessandro and Pasi, Gabriella and Melzi, Simone},
year = {2023},
publisher = {The Eurographics Association and John Wiley & Sons Ltd.},
ISSN = {1467-8659},
DOI = {10.1111/cgf.14912}
}
journal = {Computer Graphics Forum},
title = {{Attention And Positional Encoding Are (Almost) All You Need For Shape Matching}},
author = {Raganato, Alessandro and Pasi, Gabriella and Melzi, Simone},
year = {2023},
publisher = {The Eurographics Association and John Wiley & Sons Ltd.},
ISSN = {1467-8659},
DOI = {10.1111/cgf.14912}
}
Collections
Related items
Showing items related by title, author, creator and subject.
-
Rational Bézier Guarding
Khanteimouri, Payam; Mandad, Manish; Campen, Marcel (The Eurographics Association and John Wiley & Sons Ltd., 2022)We present a reliable method to generate planar meshes of nonlinear rational triangular elements. The elements are guaranteed to be valid, i.e. defined by injective rational functions. The mesh is guaranteed to conform ... -
VA + Embeddings STAR: A State-of-the-Art Report on the Use of Embeddings in Visual Analytics
Huang, Zeyang; Witschard, Daniel; Kucher, Kostiantyn; Kerren, Andreas (The Eurographics Association and John Wiley & Sons Ltd., 2023)Over the past years, an increasing number of publications in information visualization, especially within the field of visual analytics, have mentioned the term ''embedding'' when describing the computational approach. ... -
Teaching Game Programming in an Upper-level Computing Course Through the Development of a C++ Framework and Middleware
Hooper, Steffan; Wünsche, Burkhard C.; Denny, Paul; Luxton-Reilly, Andrew (The Eurographics Association, 2024)The game development industry has a programming skills shortage, with industry surveys often ranking game programming as the top skill-in-demand across small, mid-sized, and large triple-A (AAA) game studios. C++ programming ...