Optimised graded metamaterials for mechanical energy confinement and amplification via reinforcement learning
A reinforcement learning approach to design optimised graded metamaterials for mechanical energy confinement and amplification is described. Through the proximal policy optimisation algorithm, the reinforcement agent is trained to optimally set the lengths and the spacing of an array of resonators....
Saved in:
Published in | European journal of mechanics, A, Solids Vol. 99; p. 104947 |
---|---|
Main Authors | , , , , |
Format | Journal Article |
Language | English |
Published |
Elsevier Masson SAS
01.05.2023
|
Subjects | |
Online Access | Get full text |
Cover
Loading…
Summary: | A reinforcement learning approach to design optimised graded metamaterials for mechanical energy confinement and amplification is described. Through the proximal policy optimisation algorithm, the reinforcement agent is trained to optimally set the lengths and the spacing of an array of resonators. The design optimisation problem is formalised in a Markov decision problem by splitting the optimisation procedure into a discrete number of decisions. Being the physics of graded metamaterials governed by the spatial distribution of local resonances, the space of possible configurations is constrained by using a continuous function for the resonators arrangement. A preliminary analytical investigation has been performed to characterise the dispersive properties of the analysed system by treating it as a locally resonant system. The outcomes of the optimisation procedure confirms the results of previous investigations, highlighting both the validity of the proposed approach and the robustness of the systems of graded resonators when employed for mechanical energy confinement and amplification. The role of the resonator spacing is shown to be secondary with respect to the resonator lengths or, in other words, with respect to the oscillation frequencies of the resonators. However, it is also demonstrated that reducing the number of resonators can be advantageous. The outcomes related to the joint optimisation of the resonator lengths and spacing, thanks also to the adaptive control of the analysis duration, overcome significantly the performance of previously known systems by working almost uniquely on enlarging the time in which the harvester oscillations take place without amplifying these oscillations. The proposed procedure is suitable to be applied to a wide range of design optimisation problems in which the effect of the design choices can be assessed through numerical simulations.
•Resonator lengths and spacing of a rainbow based metamaterial as design parameters.•Constraining of the design space based on physical considerations.•Formalisation of the design optimisation problem as a Markov decision process.•Actor–critic reinforcement learning for solving the Markov decision process.•Best grading rule increases the extension of the target oscillation time. |
---|---|
ISSN: | 0997-7538 1873-7285 |
DOI: | 10.1016/j.euromechsol.2023.104947 |