EndToEndML: An Open-Source End-to-End Pipeline for Machine Learning Applications

Artificial intelligence (AI) techniques are widely applied in the life sciences. However, applying innovative AI techniques to understand and deconvolute biological complexity is hindered by the learning curve for life science scientists to understand and use computing languages. An open-source, use...

Full description

Saved in:
Bibliographic Details
Published in2024 7th International Conference on Information and Computer Technologies (ICICT) pp. 350 - 358
Main Authors Pillai, Nisha, Das, Athish Ram, Ayoola, Moses, Gireesan, Ganga, Nanduri, Bindu, Ramkumar, Mahalingam
Format Conference Proceeding
LanguageEnglish
Published IEEE 15.03.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Artificial intelligence (AI) techniques are widely applied in the life sciences. However, applying innovative AI techniques to understand and deconvolute biological complexity is hindered by the learning curve for life science scientists to understand and use computing languages. An open-source, user-friendly interface for AI models, that does not require programming skills to analyze complex biological data will be extremely valuable to the bioinformatics community. With easy access to different sequencing technologies and increased interest in different 'omics' studies, the number of biological datasets being generated has increased and analyzing these high-throughput datasets is computationally demanding. The majority of AI libraries today require advanced programming skills as well as machine learning, data preprocessing, and visualization skills. In this research, we propose a web-based end-to-end pipeline that is capable of preprocessing, training, evaluating, and visualizing machine learning (ML) models without manual intervention or coding expertise. By integrating traditional machine learning and deep neural network models with visualizations, our library assists in recognizing, classifying, clustering, and predicting a wide range of multi-modal, multi-sensor datasets, including images, languages, and one-dimensional numerical data, for drug discovery, pathogen classification, and medical diagnostics.
ISSN:2769-4542
DOI:10.1109/ICICT62343.2024.00063