TY - JOUR
T1 - Integrating diverse data sources to predict disease risk in dairy cattle— a machine learning approach
AU - Lasser, Jana
AU - Matzhold, Caspar
AU - Egger-Danner, Christa
AU - Fuerst-Waltl, Birgit
AU - Steininger, Franz
AU - Wittek, Thomas
AU - Klimek, Peter
PY - 2021/11/1
Y1 - 2021/11/1
N2 - Livestock farming is currently undergoing a digital revolution and becoming increasingly data-driven. Yet, such data often reside in disconnected silos making them impossible to leverage their full potential to improve animal well-being. Here, we introduce a precision livestock farming approach, bringing together information streams from a variety of life domains of dairy cattle to study whether including more and diverse data sources improves the quality of predictions for eight diseases and whether using more complex prediction algorithms can, to some extent, compensate for less diverse data. Using three machine learning approaches of varying complexity (from logistic regression to gradient boosted trees) trained on data from 5,828 animals in 165 herds in Austria, we show that the prediction of lameness, acute and chronic mastitis, anestrus, ovarian cysts, metritis, ketosis (hyperketonemia), and periparturient hypocalcemia (milk fever) from routinely available data gives encouraging results. For example, we can predict lameness with high sensitivity and specificity (F1 = 0.74). An analysis of the importance of individual variables to prediction performance shows that disease in dairy cattle is a product of the complex interplay between a multitude of life domains, such as housing, nutrition, or climate, that including more and diverse data sources increases prediction performance, and that the reuse of existing data can create actionable information for preventive interventions. Our findings pave the way toward data-driven point-of-care interventions and demonstrate the added value of integrating all available data in the dairy industry to improve animal well-being and reduce disease risk.
AB - Livestock farming is currently undergoing a digital revolution and becoming increasingly data-driven. Yet, such data often reside in disconnected silos making them impossible to leverage their full potential to improve animal well-being. Here, we introduce a precision livestock farming approach, bringing together information streams from a variety of life domains of dairy cattle to study whether including more and diverse data sources improves the quality of predictions for eight diseases and whether using more complex prediction algorithms can, to some extent, compensate for less diverse data. Using three machine learning approaches of varying complexity (from logistic regression to gradient boosted trees) trained on data from 5,828 animals in 165 herds in Austria, we show that the prediction of lameness, acute and chronic mastitis, anestrus, ovarian cysts, metritis, ketosis (hyperketonemia), and periparturient hypocalcemia (milk fever) from routinely available data gives encouraging results. For example, we can predict lameness with high sensitivity and specificity (F1 = 0.74). An analysis of the importance of individual variables to prediction performance shows that disease in dairy cattle is a product of the complex interplay between a multitude of life domains, such as housing, nutrition, or climate, that including more and diverse data sources increases prediction performance, and that the reuse of existing data can create actionable information for preventive interventions. Our findings pave the way toward data-driven point-of-care interventions and demonstrate the added value of integrating all available data in the dairy industry to improve animal well-being and reduce disease risk.
KW - machine learning
KW - dairy farming
KW - precision livestock farming
KW - predictive medicine
KW - disease prediction
KW - data integration
UR - http://www.scopus.com/inward/record.url?scp=85122047214&partnerID=8YFLogxK
U2 - 10.1093/jas/skab294
DO - 10.1093/jas/skab294
M3 - Article
SN - 0021-8812
VL - 99
JO - Journal of Animal Science
JF - Journal of Animal Science
IS - 11
M1 - skab294
ER -