SimBaTex: Similarity-based Text Exploration
Abstract
Natural language processing in combination with visualization can provide efficient ways to discover latent patterns of similarity which can be useful for exploring large sets of text documents. In this poster abstract, we describe the ongoing work on a visual analytics application, called SimBaTex, which is based on embedding technology, dynamic specification of similarity criteria, and a novel approach for similarity-based clustering. The goal of SimBaTex is to provide search-and-explore functionality to enable the user to identify items of interest in a large set of text documents by interactive assessment of both high-level similarity patterns and pairwise similarity of chosen texts.
BibTeX
@inproceedings {10.2312:evp.20211067,
booktitle = {EuroVis 2021 - Posters},
editor = {Byška, Jan and Jänicke, Stefan and Schmidt, Johanna},
title = {{SimBaTex: Similarity-based Text Exploration}},
author = {Witschard, Daniel and Jusufi, Ilir and Kerren, Andreas},
year = {2021},
publisher = {The Eurographics Association},
ISBN = {978-3-03868-144-1},
DOI = {10.2312/evp.20211067}
}
booktitle = {EuroVis 2021 - Posters},
editor = {Byška, Jan and Jänicke, Stefan and Schmidt, Johanna},
title = {{SimBaTex: Similarity-based Text Exploration}},
author = {Witschard, Daniel and Jusufi, Ilir and Kerren, Andreas},
year = {2021},
publisher = {The Eurographics Association},
ISBN = {978-3-03868-144-1},
DOI = {10.2312/evp.20211067}
}