四因素方差分析模型构建及癌症风险评估
Construction of Four Factor Analysis of Variance (ANOVA) Model and Cancer Risk Assessment
DOI: 10.12677/AAM.2021.106225, PDF,  被引量    科研立项经费支持
作者: 邵梦瑶, 贺兴时, 李玲玲:西安工程大学理学院,陕西 西安
关键词: 四因素方差分析离子辐射吸烟癌症风险Four Factor Analysis of Variance Ion Radiation Smoking Cancer Risk
摘要: 在实际问题中,往往需要分析多重因素对实验结果的影响,方差分析是解决此类问题的一个重要工具。然而,对于方差分析的理论目前集中在单因素方差分析和两因素方差分析。为此,基于两因素方差分析模型给出带有交互效应的四因素方差分析模型的理论推导,并应用其来分析地域、性别、吸烟、离子辐射对癌症患病风险的影响。本文在两因素方差分析模型基础上给出了具有交互作用的四因素方差分析模型的理论推导,并将其应用到具体实例中。在应用多因素方差分析模型时,可以通过对数据做变换来达到正态性、方差齐性的要求。对癌症发病率数据的方差分析结果表明,吸烟时长和离子辐射剂量对癌症的患病风险具有显著性影响,地域和性别并没有显示对癌症风险具有显著性影响。
Abstract: In practical problems, it is often necessary to analyze the influence of multiple factors on the experimental results. Analysis of variance is an important tool to solve the problems. However, the theory of variance analysis (ANOVA) focuses on one-way analysis of variance and two-way analysis of variance (ANOVA). Therefore, based on the two factor analysis of variance (ANOVA) model, this paper gives the theoretical derivation of the four factor analysis of variance (ANOVA) model with interaction effect, and applies it to analyze the influence of region, gender, smoking and ion radiation on cancer risk. Based on the two factor analysis of variance model, this paper gives the theoretical derivation of the interactive four factor analysis of variance model and applies it to specific examples. When applying a multi-factor analysis of variance model, the data can be transformed to meet the requirements of normality and homogeneity of variance. The results of variance analysis of cancer incidence data show that smoking duration and ionizing radiation dose have a significant impact on the risk of cancer, region and gender do not show a significant impact on cancer risk.
文章引用:邵梦瑶, 贺兴时, 李玲玲. 四因素方差分析模型构建及癌症风险评估[J]. 应用数学进展, 2021, 10(6): 2155-2165. https://doi.org/10.12677/AAM.2021.106225

参考文献

[1] 吴坚. 应用概率统计[M]. 第2版. 北京: 高等教育出版社, 2007: 262.
[2] 魏宗舒. 概率论与数理统计教程[M]. 北京: 高等教育出版社, 2001: 372-391.
[3] 张忠群. 概率论与数理统计[M]. 贵阳: 贵州大学出版社, 2008: 203.
[4] Hardle, W.K. and Simar, L. (2014) Applied Multivariate Statistical Analysis. 4th Edition, Springer-Verlag, Berlin.
[5] 何晓群. 多元统计分析[M]. 第四版. 北京: 中国人民大学出版社, 2015.
[6] 王学民. 应用多元统计分析[M]. 第五版. 上海: 上海财经大学出版社, 2017.
[7] 王苗苗. 双因素方差分析模型的构建及应用[J]. 统计与决策, 2015(18): 72-75.
[8] 戴金辉, 韩存. 双因素方差分析方法的比较[J]. 统计与决策, 2018, 34(4): 30-33.
[9] 刘晓华. 多元方差分析模型的构建与应用[J]. 统计与决策, 2019, 35(1): 75-78.
[10] 黄伯强, 李启才. 带交互作用的双因素方差分析的线性回归建模[J]. 统计与决策, 2021, 37(1): 10-15.
[11] 戴金辉, 代金辉. 方差分析在跳水运动成绩管理中的应用[J]. 统计与决策, 2016(22): 80-82.
[12] Khatun, N. (2021) Applications of Normality Test in Statistical Analysis. Open Journal of Statistics, 11, 113-122. [Google Scholar] [CrossRef
[13] Michael, J.R. (1983) The Stabilized Probability Plot. Biometrika, 70, 11-17. [Google Scholar] [CrossRef
[14] Pallmann, P., Hothorn, L.A. and Djira, G.D. (2014) A Levene-Type Test of Homogeneity of Variances against Ordered Alternatives. Computational Statistics, 29, 1593-1608. [Google Scholar] [CrossRef
[15] Grant, E.J., Brenner, A., Sugiyama, H., et al. (2017) Solid Cancer Incidence among the Life Span Study of Atomic Bomb Survivors: 1958-2009. Radiation Research, 187, 513-537. [Google Scholar] [CrossRef
[16] Sinz, F., Gerwinn, S. and Bethge, M. (2008) Characterization of the p-Generalized Normal Distribution. Journal of Multivariate Analysis, 100, 817-820. [Google Scholar] [CrossRef
[17] Kolkiewicz, A., Rice, G. and Xie, Y. (2021) Projection Pursuit Based Tests of Normality with Functional Data. Journal of Statistical Planning and Inference, 211, 326-339. [Google Scholar] [CrossRef
[18] Lilliefors, H.W. (2012) On the Kolmogorov-Smirnov Test for Normality with Mean and Variance Unknown. Journal of the American Statistical Association, 62, 399-402. [Google Scholar] [CrossRef
[19] Conover, W.J., Johnson, M.E. and Johnson, M.M. (2012) A Comparative Study of Tests for Homogeneity of Variances, with Applications to the Outer Continental Shelf Bidding Data. Technometrics, 23, 351-361. [Google Scholar] [CrossRef
[20] Esmailzadeh, N. (2019) A Comparison of Five Bootstrap and Non-Bootstrap Levene-Type Tests of Homogeneity of Variances. Iranian Journal of Science and Technology, Transactions A: Science, 43, 979-989. [Google Scholar] [CrossRef
[21] Sharma, D. and Golam Kibria, B.M. (2013) On Some Test Statistics for Testing Homogeneity of Variances: A Comparative Study. Journal of Statistical Computation and Simulation, 83, 1944-1963.
[22] 薛毅. 统计建模与R软件[M]. 北京: 清华大学出版社, 2007.
[23] Ran, Y. and Yuan, X. (2020) Analysis of the Influencing Factors of the Multi-Linear Regression Model Based on R Language on the Total Cost of Domestic Tourism. Frontiers in Economics and Management, 1, 60-65.
[24] Kumar, M., Sonker, P.Kr., Saroj, A., Jain, A., Bhattacharjee, A. and Saroj, R.Kr. (2020) Parametric Survival Analysis Using R: Illustration with Lung Cancer Data. Cancer Reports, 3, e1210. [Google Scholar] [CrossRef] [PubMed]