Optimization of underground mining production layouts considering geological uncertainty using deep reinforcement learning

Mineral extraction plays a key role in the global raw materials supply chain, however the exhaustion of shallow deposits and typical scarcity of sampled data during exploration activities creates challenges in mine planning and design, where decision-making is highly sensitive to uncertainty in geol...

Full description

Saved in:
Bibliographic Details
Published inEngineering applications of artificial intelligence Vol. 139; p. 109493
Main Authors Noriega, Roberto, Boisvert, Jeff
Format Journal Article
LanguageEnglish
Published Elsevier Ltd 01.01.2025
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:Mineral extraction plays a key role in the global raw materials supply chain, however the exhaustion of shallow deposits and typical scarcity of sampled data during exploration activities creates challenges in mine planning and design, where decision-making is highly sensitive to uncertainty in geology and mineral grade prediction. Geostatistical techniques are commonly used to generate a set of equiprobable simulated numerical models to capture these uncertainties, however incorporating these simulated models within a mine planning and design framework remains a major challenge. The purpose of this paper is to propose a novel approach to decision-making in underground mine design that can use information from an ensemble of numerical realizations of a mineral resource to improve the financial performance of the asset. A deep reinforcement learning (DRL) framework, using the proximal policy optimization (PPO) algorithm, is developed for the design of underground mining production level layouts. A case study is presented using a gold mineral resource characterized by an ensemble of 100 numerical realizations to verify the advantages of the proposed method, considering a baseline consisting of an industry standard automated design method. The DRL approach achieved an 8.3% higher expected profit, a 3.4% more gold mined than the baseline, and has the added functionality of considering uncertainty in mineral grades. [Display omitted] •Reinforcement Learning incorporates geological and mineral grade uncertainty in underground mining production level design.•PPO algorithm uses information across different scales to train a Neural Network that outputs a stope extraction layout.•Benchmark achieved an 8.3% higher expected profit and 3.4% higher gold reserves over an ensemble of resource realizations.•Sensitivity analysis stablished sensible ranges for hyperparameters.
ISSN:0952-1976
DOI:10.1016/j.engappai.2024.109493