Analysis of lipid production from Yarrowia lipolytica for renewable fuel production by machine learning

[Display omitted] •Machine learning provided the optimum conditions of biomass and lipid production.•Decision trees discovered 5 combinations of variables for high lipid production.•C/N ratio and fermentation time affect biomass productivity significantly.•Use of glucose as carbon source and medium...

Full description

Saved in:
Bibliographic Details
Published inFuel (Guildford) Vol. 315; p. 122817
Main Authors Coşgun, Ahmet, Günay, M. Erdem, Yıldırım, Ramazan
Format Journal Article
LanguageEnglish
Published Kidlington Elsevier Ltd 01.05.2022
Elsevier BV
Subjects
Online AccessGet full text

Cover

Loading…
More Information
Summary:[Display omitted] •Machine learning provided the optimum conditions of biomass and lipid production.•Decision trees discovered 5 combinations of variables for high lipid production.•C/N ratio and fermentation time affect biomass productivity significantly.•Use of glucose as carbon source and medium pH affect lipid percentage excessively.•Operating conditions, pH and C and N sources highly influence lipid production. In this work, biomass and lipid productivities of Yarrowia lipolytica were analyzed using machine learning techniques. A dataset containing 356 instances was constructed from the experimental results reported in 22 publications. The dataset was analyzed using decision trees to identify the features (descriptors) that lead to high biomass production, lipid content and lipid production. C/N ratio and fermentation time were found to be the most influential features for biomass production while the use of glucose and medium pH seemed to be more important for high lipid content. For the lipid production case, five generalizable paths leading to high values of this output were identified. One of those paths required pH to be<6.3, high glucose and (NH4)2SO4 concentrations, lower concentration for yeast extract and the yeast strain not be H-222. Another one needed a pH greater than 6.3, a C/N ratio smaller than 75, a time greater than 14 h, and a strain other than W29. The same dataset was also explored deeper using association rule mining to determine the effects of individual features on output variables. It was then concluded that machine learning methods are very useful in determining the optimal conditions of biomass growth and lipid yield for Yarrowia lipolytica to produce renewable biofuels.
ISSN:0016-2361
1873-7153
DOI:10.1016/j.fuel.2021.122817