این سایت در حال حاضر پشتیبانی نمی شود و امکان دارد داده های نشریات بروز نباشند
صفحه اصلی
درباره پایگاه
فهرست سامانه ها
الزامات سامانه ها
فهرست سازمانی
تماس با ما
JCR 2016
جستجوی مقالات
یکشنبه 18 آبان 1404
Journal of Livestock Science and Technology
، جلد ۱۲، شماره ۱، صفحات ۳۱-۳۷
عنوان فارسی
چکیده فارسی مقاله
کلیدواژههای فارسی مقاله
عنوان انگلیسی
Comparing the performance of xgboost, Gradient Boosting and GBLUP models under different genomic prediction scenarios
چکیده انگلیسی مقاله
Abstract
The aim of this study was to study the performance of
xgboost
algorithm in genomic evaluation of complex traits as an alternative for Gradient Boosting algorithm (GBM). To this end, genotypic matrices containing genotypic information for, respectively, 5,000 (S1), 10,000 (S2) and 50,000 (S3) single nucleotide polymorphisms (SNP) for 1000 individuals was simulated. Beside
xgboost
and GBM, the GBLUP which is known as an efficient algorithm in terms of accuracy, computing time and memory requirement was also used to predict genomic breeding values.
xgboost
, GBM and GBLUP were run in R software using xgboost, gbm and synbreed
packages. All the analyses were done using a machine equipped with a Core i7-6800K CPU which had 6 physical cores. In addition, 32 gigabyte of memory was installed on the machine. The Person's correlation between predicted and true breeding values (r
p,t
) and the mean squared error (MSE) of prediction were computed to compare predictive performance of different methods. While GBLUP was the most efficient user of memory, GBM required a considerably high amount of memory to run. By increasing size of data from S1 to S3, GBM went out from the competition mainly due to its high demand for memory. Parallel computing with
xgboost
reduced running time by %99 compared to GBM. The
speedup ratio
s (the ratio of the GBM runtime to the time taken by the parallel computing by
xgboost
) were 444 and 554 for the S1 and S2 scenarios, respectively. In addition, parallelization efficiency (
speed up ratio
/number of cores) were, respectively, 74 and 92 for the S1 and S2 scenarios, indicating that by increasing the size of data, the efficiency of parallel computing increased. The
xgboost
was considerably faster than GBLUP in all the scenarios studied. Accuracy of genomic breeding values predicted by
xgboost
was similar to those predicted by GBM. While the accuracy of prediction in terms of r
p,t
was higher for GBLUP, the
MSE
of prediction was lower for
xgboost
, specially for larger datasets. Our results showed that
xgboost
could be an efficient alternative for GBM as it had the same accuracy of prediction, was extremely fast and needed significantly lower memory requirement to predict the genomic breeding values.
کلیدواژههای انگلیسی مقاله
genomic evaluation, parallel computing, computing time, SNP
نویسندگان مقاله
Farhad Ghafouri-Kesbi |
Department of Animal Science, Faculty of Agriculture, Bu-Ali Sina University, Hamedan, Iran
نشانی اینترنتی
https://lst.uk.ac.ir/article_4195_afe67d8342febc3d454ebcfd2dc98629.pdf
فایل مقاله
فایلی برای مقاله ذخیره نشده است
کد مقاله (doi)
زبان مقاله منتشر شده
en
موضوعات مقاله منتشر شده
نوع مقاله منتشر شده
برگشت به:
صفحه اول پایگاه
|
نسخه مرتبط
|
نشریه مرتبط
|
فهرست نشریات