方差组分估计的普遍观点与启发

2022-04-30

字数统计: 906 | 阅读时长≈ 3 分钟

最近看 Misztal 2008年的一篇介绍方差组分估计的文章，其中一段话很有启发。

方差组分的普遍观点

原文如下。我大致翻译一下，方差组分估计的问题往往都可以归因为模型过于复杂，需要估计的参数太多。就像 Box and Draper (1987) 所说，实际上，所有的模型都是错误的，但是某些 (模型) 很有用。虽然复杂的模型可能能够更好地揭示性状的生物学性质，但是在遗传评估中简单的模型可能就足够了，因为 (很多文献报道) 复杂的模型和简单的模型可能预测能力差不多。

Problems with variance component estimation often can be traced to unnecessarily high model complexity with too many parameters to estimate. As Box and Draper (1987) stated, ‘Essentially, all models are wrong, but some are useful’, thus the search for a perfect model is futile. While more complex models may be needed to reveal the biology of traits, simpler models may suffice for genetic evaluation. For example, Lopez-Romero & Carabano (2003) compared random regression models using Legendre polynomials of orders 2–6. While more complex models fit the data better, the predictive ability of all the models was almost identical, indicating almost identical rankings of sires. Good arguments for following productivity in model comparisons were made by Blasco (2006). Reports from literature that simple and complicated models provide similar estimated breeding values are abound, e.g. Piles et al. (2006).

启发

对于实际应用，不同模型的判定标准是合用，而不是正确。判断是否合用应该有多个层面上的考虑，比如计算稳定性，计算时间/计算效率，结果准确性，结果是否易于使用等等。
若无必要，勿增实体。做事情，做东西要从易到难，一步步增加复杂度，不要一开始就觉得越复杂的东西就越好。并且每一步最好需要和之前的简单方法进行比较，如果复杂的方式和简单的方式相比帮助不大，那么可能就不如回退到简单的方法。

参考文献

Misztal I. Reliable computing in estimation of variance components[J]. Journal of animal breeding and genetics, 2008, 125(6): 363-370.

版权声明： 本博客所有文章除特别声明外，著作权归作者所有。转载请注明出处！

方差组分的普遍观点

推荐

启发

参考文献