Comparative Analysis of Machine Learning Algorithms in Fish Survival Prediction

This research paper conducts an exploration of prevalent machine learning classification algorithms, emphasizing the refinement of Random Forest models to optimize performance. The initial phase involves a comparative analysis of Random Forest, k-Nearest Neighbors (kNN), Support Vector Machine (SVM)...

Full description

Saved in:
Bibliographic Details
Published in2024 IEEE Bangalore Humanitarian Technology Conference (B-HTC) pp. 28 - 33
Main Authors Shirwaikar, Rudresh D., D'Souza, Lyzandra, Bhangle, Anirudh Santosh, Joshi, Sarang Deepak, Jathar, Nupur Ajit, Alvares, Jonathan Elroy
Format Conference Proceeding
LanguageEnglish
Published IEEE 22.03.2024
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:This research paper conducts an exploration of prevalent machine learning classification algorithms, emphasizing the refinement of Random Forest models to optimize performance. The initial phase involves a comparative analysis of Random Forest, k-Nearest Neighbors (kNN), Support Vector Machine (SVM), Logistic Regression, and Naive Bayes Classification algorithms across the dataset, resulting in Random forest to be best performed among all with average accuracy of 61%. Subsequently, then we focus on enhancement of the Random Forest algorithm with random train-test split dataset, resulting in an enhanced average accuracy of 65% which is about 4% improvement to the previous accuracy and choosing the best traintest split dataset, along with fine tuning the model, improved the average accuracy to 73% which is about 12% improvement to previous accuracy which is significantly higher showcasing the significance of precise algorithmic tuning. The completion of our investigation lies in the creation of an integrated model which incorporates multiple Random Forest classifier models, each specifically trained for binary class classification, along with a comprehensive model designed for multiclass classification. This combined model excels with a peak accuracy of 82.75% for the best split, demonstrating its strong performance in classification tasks. This research not only contributes to the theoretical understanding of algorithm performance and optimization but also offers practical insights for the application of machine learning in challenging classification scenarios.
DOI:10.1109/B-HTC60740.2024.10564052