Reinforcement learning control of a biomechanical model of the upper extremity

Among the infinite number of possible movements that can be produced, humans are commonly assumed to choose those that optimize criteria such as minimizing movement time, subject to certain movement constraints like signal-dependent and constant motor noise. While so far these assumptions have only...

Full description

Saved in:

Bibliographic Details
Published in	Scientific reports Vol. 11; no. 1; pp. 14445 - 15
Main Authors	Fischer, Florian, Bachinski, Miroslav, Klar, Markus, Fleig, Arthur, Müller, Jörg
Format	Journal Article
Language	English
Published	London Nature Publishing Group UK 14.07.2021 Nature Publishing Group Nature Portfolio
Subjects	631/114/116/2392 631/114/116/2393 631/114/1305 631/114/2397 631/378/2632 639/705/1042 639/705/117 Biomechanical Phenomena Biomechanics Humanities and Social Sciences Humans Learning Mechanical properties Models, Neurological Motor task performance Movement multidisciplinary Muscles Reinforcement Science Science (multidisciplinary) Upper Extremity
Online Access	Get full text
ISSN	2045-2322 2045-2322
DOI	10.1038/s41598-021-93760-1

Cover

Loading…

More Information
Summary:	Among the infinite number of possible movements that can be produced, humans are commonly assumed to choose those that optimize criteria such as minimizing movement time, subject to certain movement constraints like signal-dependent and constant motor noise. While so far these assumptions have only been evaluated for simplified point-mass or planar models, we address the question of whether they can predict reaching movements in a full skeletal model of the human upper extremity. We learn a control policy using a motor babbling approach as implemented in reinforcement learning, using aimed movements of the tip of the right index finger towards randomly placed 3D targets of varying size. We use a state-of-the-art biomechanical model, which includes seven actuated degrees of freedom. To deal with the curse of dimensionality, we use a simplified second-order muscle model, acting at each degree of freedom instead of individual muscles. The results confirm that the assumptions of signal-dependent and constant motor noise, together with the objective of movement time minimization, are sufficient for a state-of-the-art skeletal model of the human upper extremity to reproduce complex phenomena of human movement, in particular Fitts’ Law and the 2 3 Power Law. This result supports the notion that control of the complex human biomechanical system can plausibly be determined by a set of simple assumptions and can easily be learned.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 14 content type line 23
ISSN:	2045-2322 2045-2322
DOI:	10.1038/s41598-021-93760-1