Data Quality Management in Institutional Research Output Data Center

Institutional research output data center will store normative and convinced scholar’s research output data, and it will effectively support dynamic presentation of research output, reveal institutional academic publication in multiple dimensions, advance open access, and provide data support for su...

Full description

Saved in:
Bibliographic Details
Published inDatabase Systems for Advanced Applications Vol. 11448; pp. 142 - 157
Main Authors Shi, Xiaohua, Xing, Zhuoyuan, Lu, Hongtao
Format Book Chapter
LanguageEnglish
Published Switzerland Springer International Publishing AG 2019
Springer International Publishing
SeriesLecture Notes in Computer Science
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Institutional research output data center will store normative and convinced scholar’s research output data, and it will effectively support dynamic presentation of research output, reveal institutional academic publication in multiple dimensions, advance open access, and provide data support for subject evaluation and discipline development. In this paper, we propose a data quality management framework to build institutional research output data center, and put forward relevant technical solution for different data governance problems, such as department name similarity estimation in data matching, author name disambiguous problem in data merging and security issue in data exchange. We also introduce some learning algorithms such as text distance and community detection with matrix factorization. Comparing with different ways, our methods achieve good performance in quality manage processing.
ISBN:3030185893
9783030185893
ISSN:0302-9743
1611-3349
DOI:10.1007/978-3-030-18590-9_10