Pregled bibliografske jedinice broj: 537377
Visualization of temporal text collections based on Correspondence Analysis
Visualization of temporal text collections based on Correspondence Analysis // Expert systems with applications, 39 (2012), 15; 12143-12157 doi:10.1016/j.eswa.2012.04.040 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 537377 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Visualization of temporal text collections based on Correspondence Analysis
Autori
Šilić, Artur ; Morin, Annie ; Chauchat, Jean-Hugues ; Dalbelo Bašić, Bojana
Izvornik
Expert systems with applications (0957-4174) 39
(2012), 15;
12143-12157
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
information visualization; singular value decomposition; clustering; text analytics
Sažetak
In this paper, we present CatViz—Temporally-Sliced Correspondence Analysis Visualization. This novel method visualizes relationships through time and is suitable for large-scale temporal multivariate data. We couple CatViz with clustering methods, whereupon we introduce the concept of final centroid transfer, which enables the correspondence of clusters in time. Although CatViz can be used on any type of temporal data, we show how it can be applied to the task of exploratory visual analysis of text collections. We present a successful concept of employing feature-type filtering to present different aspects of textual data. We performed case studies on large collections of French and English news articles. In addition, we conducted a user study that confirms the usefulness of our method. We present typical tasks of exploratory text analysis and discuss application procedures that an analyst might perform. We believe that CatViz is general and highly applicable to large data sets because of its intuitiveness, effectiveness, and robustness. We expect that it will enable a better understanding of texts in huge historical archives.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Projekti:
036-1300646-1986 - Otkrivanje znanja u tekstnim podacima (Dalbelo-Bašić, Bojana, MZO ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb
Citiraj ovu publikaciju:
Časopis indeksira:
- Current Contents Connect (CCC)
- Web of Science Core Collection (WoSCC)
- Science Citation Index Expanded (SCI-EXP)
- SCI-EXP, SSCI i/ili A&HCI
- Scopus