Evolving scientific discovery by unifying data and background knowledge with AI Hilbert

The discovery of scientific formulae that parsimoniously explain natural phenomena and align with existing background theory is a key goal in science. Historically, scientists have derived natural laws by manipulating equations based on existing knowledge, forming new equations, and verifying them e...

Full description

Saved in:
Bibliographic Details
Published inNature communications Vol. 15; no. 1; pp. 5922 - 14
Main Authors Cory-Wright, Ryan, Cornelio, Cristina, Dash, Sanjeeb, El Khadir, Bachir, Horesh, Lior
Format Journal Article
LanguageEnglish
Published London Nature Publishing Group UK 14.07.2024
Nature Publishing Group
Nature Portfolio
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:The discovery of scientific formulae that parsimoniously explain natural phenomena and align with existing background theory is a key goal in science. Historically, scientists have derived natural laws by manipulating equations based on existing knowledge, forming new equations, and verifying them experimentally. However, this does not include experimental data within the discovery process, which may be inefficient. We propose a solution to this problem when all axioms and scientific laws are expressible as polynomials and argue our approach is widely applicable. We model notions of minimal complexity using binary variables and logical constraints, solve polynomial optimization problems via mixed-integer linear or semidefinite optimization, and prove the validity of our scientific discoveries in a principled manner using Positivstellensatz certificates. We demonstrate that some famous scientific laws, including Kepler’s Law of Planetary Motion and the Radiated Gravitational Wave Power equation, can be derived in a principled manner from axioms and experimental data. Scientific discovery is a highly relevant task in natural sciences, however generating scientifically meaningful laws and determining their consistency remains challenging. The authors introduce an approach that exploits both experimental data and underlying theory in symbolic form to generate formulas that hold scientific significance by solving polynomial optimization problems.
Bibliography:ObjectType-Article-1
SourceType-Scholarly Journals-1
ObjectType-Feature-2
content type line 14
content type line 23
ISSN:2041-1723
2041-1723
DOI:10.1038/s41467-024-50074-w