Machine Learning Methods for Quantifying Uncertainty in Prospectivity Mapping of Magmatic-Hydrothermal Gold Deposits: A Case Study from Juruena Mineral Province, Northern Mato Grosso, Brazil

Mineral prospectivity mapping (MPM), like other geoscience fields, is subject to a variety of uncertainties. When data about unfavorable sites to find deposits (i.e., drill intersections to barren rocks) is lacking in MPM using machine learning (ML) methods, the synthetic generation of negative data...

Full description

Saved in:
Bibliographic Details
Published inMinerals (Basel) Vol. 12; no. 8; p. 941
Main Authors Silva dos Santos, Victor, Gloaguen, Erwan, Hector Abud Louro, Vinicius, Blouin, Martin
Format Journal Article
LanguageEnglish
Published Basel MDPI AG 01.08.2022
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Mineral prospectivity mapping (MPM), like other geoscience fields, is subject to a variety of uncertainties. When data about unfavorable sites to find deposits (i.e., drill intersections to barren rocks) is lacking in MPM using machine learning (ML) methods, the synthetic generation of negative datasets is required. As a result, techniques for selecting point locations to represent negative examples must be employed. Several approaches have been proposed in the past; however, one can never be certain that the points chosen are true negatives or, at the very least, optimal for training. As a consequence, methodologies that account for the uncertainty of the generation of negative datasets in MPM are needed. In this paper, we compare two criteria for selecting negative examples and quantify the uncertainty associated with this process by generating 400 potential maps for each of the three ML methods utilized (200 maps for each criterion), which include random forest (RF), support vector machine (SVM), and k-nearest neighbors (KNC). The results showed that applying a geological constraint to the creation of negative datasets reduced prediction uncertainty and improved overall model performance but produced larger areas of very high probability (i.e., >0.9) when compared to using only the spatial distribution of known deposits and occurrences as a constraint. SHAP values were used to find approximations for the importance of features in nonlinear methods, and kernel density estimations were used to examine how they varied depending on the negative dataset used to train the ML models. Prospectivity models for magmatic-hydrothermal gold deposits were generated using data from the shuttle radar terrain mission, gamma-ray, magnetic lineaments, and proximity to dykes. The Juruena Mineral Province, situated in Northern Mato Grosso, Brazil, represented the case study for this work.
ISSN:2075-163X
2075-163X
DOI:10.3390/min12080941