Machine learning to further improve the decision which boar ejaculates to process into artificial insemination doses

Current artificial insemination (AI) laboratory practices assess semen quality of each boar ejaculate to decide which ones to process into AI doses. This decision is aided with two, world-wide used, motility parameters that come available through computer assisted semen analysis (CASA). This decisio...

Full description

Saved in:

Bibliographic Details
Published in	Theriogenology Vol. 144; pp. 112 - 121
Main Authors	Kamphuis, Claudia, Duenk, Pascal, Veerkamp, Roel Franciscus, Visser, Bram, Singh, Gurnoor, Nigsch, Annette, De Mol, Rudi Maria, Broekhuijse, Marleen Leonarda Wilhelmina Johanna
Format	Journal Article
Language	English
Published	United States Elsevier Inc 01.03.2020
Subjects	Boar semen Fertility phenotypes Machine learning Prediction model Boar semen Fertility phenotypes Machine learning Prediction model
Online Access	Get full text

Cover

Loading…

More Information
Summary:	Current artificial insemination (AI) laboratory practices assess semen quality of each boar ejaculate to decide which ones to process into AI doses. This decision is aided with two, world-wide used, motility parameters that come available through computer assisted semen analysis (CASA). This decision process, however, still results in AI doses with variable and sometimes suboptimal fertility outcomes (e.g., small litter size). The hypothesis was that the decision which ejaculates to process into AI doses can be improved by adding more data from CASA systems, and data from other sources, in combination with a data-driven model. Available data consisted of ejaculates that passed the initial decision, and thus, were processed into AI doses and used to inseminate sows. Data were divided into a training set (6793 records) and a validation set (1191 records) for model development, and an independent test set (1434 records) for performance assessment. Gradient Boosting Machine (GBM) models were developed to predict four fertility phenotypes of interest (gestation length, total number born, number born alive, and number of stillborn piglets). Each fertility phenotype was considered as a numeric and as a binary outcome parameter, totaling to eight different fertility phenotypes. Data used to further improve the decision process originated from four sources: 1) CASA information, 2) boar ejaculate information, 3) breeding value estimations, and 4) weather information. These data were used to create seven prediction sets, where each new set added parameters to the ones included in the previous set. The GBM models predicted fertility phenotypes with low correlations (for numeric phenotypes) and area under the curve values (for binary phenotypes) on the test data. Hence, results demonstrated that a combination of more data and GBM did not enable further improvement of the AI dose quality checks, resulting in the rejection of our hypothesis. However, our study revealed parameters affecting boar ejaculate fertility which were not used in today’s decision process. These parameters (listed in the top 10 in at least four GBM models) included one parameter associated with boar ejaculate information, two with breeding value estimations, five with CASA information, and one with weather information. These parameters, therefore, should be further investigated for their potential value when assessing the quality of boar ejaculates in daily routine AI doses processing. •CASA is a valuable tool for AI laboratories to decide which boar ejaculate to process into AI doses.•This decision could not be improved by a data-driven model to analyze the data.•This decision could not be improved by adding more CASA data or other data.•Nine parameters not used today were identified as potentially interesting.
Bibliography:	ObjectType-Article-1 SourceType-Scholarly Journals-1 ObjectType-Feature-2 content type line 23
ISSN:	0093-691X 1879-3231
DOI:	10.1016/j.theriogenology.2019.12.017