Covering assisted intuitionistic fuzzy bi-selection technique for data reduction and its applications

The dimension and size of data is growing rapidly with the extensive applications of computer science and lab based engineering in daily life. Due to availability of vagueness, later uncertainty, redundancy, irrelevancy, and noise, which imposes concerns in building effective learning models. Fuzzy...

Full description

Saved in:

Bibliographic Details
Published in	Scientific reports Vol. 14; no. 1; pp. 13568 - 23
Main Authors	Saini, Rajat, Tiwari, Anoop Kumar, Nath, Abhigyan, Singh, Phool, Maurya, S. P., Shah, Mohd Asif
Format	Journal Article
Language	English
Published	London Nature Publishing Group UK 12.06.2024 Nature Publishing Group Nature Portfolio
Subjects	631/114 639/705 692/700 Computer science Data collection Data reduction Datasets Dimensionality reduction Feature selection Fuzzy sets Genetic algorithms Granular structure Humanities and Social Sciences IF set Instance selection Machine learning Mathematical models Medical research Methods multidisciplinary Optimization Peptides R&D Research & development Rough set Science Science (multidisciplinary) Set theory Dimensionality reduction Instance selection IF set Granular structure Rough set
Online Access	Get full text

Cover

Loading…

More Information
Summary:	The dimension and size of data is growing rapidly with the extensive applications of computer science and lab based engineering in daily life. Due to availability of vagueness, later uncertainty, redundancy, irrelevancy, and noise, which imposes concerns in building effective learning models. Fuzzy rough set and its extensions have been applied to deal with these issues by various data reduction approaches. However, construction of a model that can cope with all these issues simultaneously is always a challenging task. None of the studies till date has addressed all these issues simultaneously. This paper investigates a method based on the notions of intuitionistic fuzzy (IF) and rough sets to avoid these obstacles simultaneously by putting forward an interesting data reduction technique. To accomplish this task, firstly, a novel IF similarity relation is addressed. Secondly, we establish an IF rough set model on the basis of this similarity relation. Thirdly, an IF granular structure is presented by using the established similarity relation and the lower approximation. Next, the mathematical theorems are used to validate the proposed notions. Then, the importance-degree of the IF granules is employed for redundant size elimination. Further, significance-degree-preserved dimensionality reduction is discussed. Hence, simultaneous instance and feature selection for large volume of high-dimensional datasets can be performed to eliminate redundancy and irrelevancy in both dimension and size, where vagueness and later uncertainty are handled with rough and IF sets respectively, whilst noise is tackled with IF granular structure. Thereafter, a comprehensive experiment is carried out over the benchmark datasets to demonstrate the effectiveness of simultaneous feature and data point selection methods. Finally, our proposed methodology aided framework is discussed to enhance the regression performance for IC50 of Antiviral Peptides.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	2045-2322 2045-2322
DOI:	10.1038/s41598-024-62099-8