Data Quality Management in Institutional Research Output Data Center
Institutional research output data center will store normative and convinced scholar’s research output data, and it will effectively support dynamic presentation of research output, reveal institutional academic publication in multiple dimensions, advance open access, and provide data support for su...
Saved in:
Published in | Database Systems for Advanced Applications Vol. 11448; pp. 142 - 157 |
---|---|
Main Authors | , , |
Format | Book Chapter |
Language | English |
Published |
Switzerland
Springer International Publishing AG
2019
Springer International Publishing |
Series | Lecture Notes in Computer Science |
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | Institutional research output data center will store normative and convinced scholar’s research output data, and it will effectively support dynamic presentation of research output, reveal institutional academic publication in multiple dimensions, advance open access, and provide data support for subject evaluation and discipline development.
In this paper, we propose a data quality management framework to build institutional research output data center, and put forward relevant technical solution for different data governance problems, such as department name similarity estimation in data matching, author name disambiguous problem in data merging and security issue in data exchange. We also introduce some learning algorithms such as text distance and community detection with matrix factorization. Comparing with different ways, our methods achieve good performance in quality manage processing. |
---|---|
ISBN: | 3030185893 9783030185893 |
ISSN: | 0302-9743 1611-3349 |
DOI: | 10.1007/978-3-030-18590-9_10 |