Content Visualization of Scientific Corpora Using an Extensible Relational Database Implementation

A method for supervised classification and visualization of collections of scientific publications is presented. By integrating a text classification module, which leads to class probability estimation, along with a dimensionality reduction technique, which represents each class in the 2-D space, an...

Full description

Saved in:
Bibliographic Details
Published inTheory and Practice of Digital Libraries -- TPDL 2013 Selected Workshops pp. 101 - 112
Main Authors Giannakopoulos, Theodoros, Stamatogiannakis, Eleftherios, Foufoulas, Ioannis, Dimitropoulos, Harry, Manola, Natalia, Ioannidis, Yannis
Format Book Chapter
LanguageEnglish
Published Cham Springer International Publishing
SeriesCommunications in Computer and Information Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:A method for supervised classification and visualization of collections of scientific publications is presented. By integrating a text classification module, which leads to class probability estimation, along with a dimensionality reduction technique, which represents each class in the 2-D space, any collection of unlabelled documents can be visualized. The classification and visualization modules have been trained on three different datasets and respective categorizations. We provide an example of our system’s functionality by visualizing the content of collections of publications which share a common funding scheme. In order to implement this, we have developed a funding mining submodule which identifies documents of particular funding schemes. All the individual modules have been implemented using the madIS system, which provides data analysis functionalities via an extended relational database.
ISBN:9783319084244
3319084240
ISSN:1865-0929
1865-0937
DOI:10.1007/978-3-319-08425-1_10