Classification framework and semantic labeling for Big Earth Data

Big Earth Data refers to the multidimensional integration and association of scientific data, including geography, resources, environment, ecology, and biology. An effective data classification system and label management strategy are important foundations for long-term management of data resources....

Full description

Saved in:
Bibliographic Details
Published inBig earth data Vol. ahead-of-print; no. ahead-of-print; pp. 1 - 18
Main Authors Wang, Juanle, Bu, Kun, Yan, Dongmei, Wang, Jingyue, Duan, Bowen, Zhang, Min, He, Guojin
Format Journal Article
LanguageEnglish
Published Taylor & Francis 03.07.2023
Taylor & Francis Group
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Big Earth Data refers to the multidimensional integration and association of scientific data, including geography, resources, environment, ecology, and biology. An effective data classification system and label management strategy are important foundations for long-term management of data resources. The objective of this study was to construct a classification system and realize multidimensional semantic data label management for the Big Earth Data Science Engineering Program (CASEarth). This study constructed two sets of classification and coding systems that realize classification by mapping each other; namely, the geosphere-level and Sustainable Development Goals (SDGs) indicator classifications. This technique was based on natural language processing technology and solved problems with subject-word segmentation, weight calculation, and dynamic matching. A prototype system for classification and label management was constructed based on existing CASEarth datasets of more than 1,100. Furthermore, we expect our study to provide the methodology and technical support for user-oriented classification and label management services for Big Earth Data.
ISSN:2096-4471
2574-5417
DOI:10.1080/20964471.2022.2123946