Wikipedia network analysis of cancer interactions and world influence
Autoři:
Guillaume Rollin aff001; José Lages aff001; Dima L. Shepelyansky aff002
Působiště autorů:
Institut UTINAM, CNRS, UMR 6213, OSU THETA, Université de Bourgogne Franche-Comté, Besançon, France
aff001; Laboratoire de Physique Théorique, IRSAMC, Université de Toulouse, CNRS, UPS, Toulouse, France
aff002
Vyšlo v časopise:
PLoS ONE 14(9)
Kategorie:
Research Article
doi:
https://doi.org/10.1371/journal.pone.0222508
Souhrn
We apply the Google matrix algorithms for analysis of interactions and influence of 37 cancer types, 203 cancer drugs and 195 world countries using the network of 5 416 537 English Wikipedia articles with all their directed hyperlinks. The PageRank algorithm provides a ranking of cancers which has 60% and 70% overlaps with the top 10 deadliest cancers extracted from World Health Organization GLOBOCAN 2018 and Global Burden of Diseases Study 2017, respectively. The recently developed reduced Google matrix algorithm gives networks of interactions between cancers, drugs and countries taking into account all direct and indirect links between these selected 435 entities. These reduced networks allow to obtain sensitivity of countries to specific cancers and drugs. The strongest links between cancers and drugs are in good agreement with the approved medical prescriptions of specific drugs to specific cancers. We argue that this analysis of knowledge accumulated in Wikipedia provides useful complementary global information about interdependencies between cancers, drugs and world countries.
Klíčová slova:
Social sciences – Sociology – Communications – Mass media – Encyclopedias – Online encyclopedias – Physical sciences – Mathematics – Applied mathematics – Algorithms – Research and analysis methods – Simulation and modeling – Medicine and health sciences – Oncology – Cancers and neoplasms – Breast tumors – Breast cancer – Lung and intrathoracic tumors – Hematologic cancers and related disorders – Leukemias – Lymphomas – Non-Hodgkin lymphoma – Gynecological tumors – Ovarian cancer – Genitourinary tract tumors – Prostate cancer – Hematology – Urology – Prostate diseases
Zdroje
1. Wold Health Organization. World Cancer Day 2018; 2018. Available from: https://www.who.int/cancer/world-cancer-day/2018/en/.
2. Union for International Cancer Control. New Global Cancer Data: GLOBOCAN 2018; 2018. Available from: https://www.uicc.org/new-global-cancer-data-globocan-2018.
3. P G Altbach LER L Reisberg. IARC Biennial Report 2016-2017. International Agency for Research on Cancer; 2017. Available from: http://publications.iarc.fr/Book-And-Report-Series/Iarc-Biennial-Reports/IARC-Biennial-Report-2016-2017.
4. Bray F, Ferlay J, Soerjomataram I, Siegel RL, Torre LA, Jemal A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA: A Cancer Journal for Clinicians. 2018;68(6):394–424.
5. GBD. Global Burden of Disease; 2010. The Lancet. Available from: https://www.thelancet.com/gbd.
6. Brin S, Page L . The anatomy of a large-scale hypertextual Web search engine. Computer Networks and ISDN Systems. 1998;30(1):107—117. https://doi.org/10.1016/S0169-7552(98)00110-X.
7. Langville AN, Meyer CD. Google’s PageRank and Beyond: The Science of Search Engine Rankings. Princeton University Press; 2012.
8. Ermann L, Frahm KM, Shepelyansky DL. Google matrix analysis of directed networks. Rev Mod Phys. 2015;87:1261–1310. doi: 10.1103/RevModPhys.87.1261
9. Encyclopaedia Britannica; 2018. Available from: http://www.britannica.com.
10. Giles J. Internet encyclopaedias go head to head. Nature. 2005;438:900–901. doi: 10.1038/438900a 16355180
11. Butler D. Publish in Wikipedia or perish. Nature. 2008;
12. Callaway E. No rest for the bio-wikis. Nature. 2010;468(7322):359–360. doi: 10.1038/468359a 21085149
13. Reagle JM Jr. Good Faith Collaboration: The Culture of Wikipedia (History and Foundations of Information Science). The MIT Press; 2012.
14. Nielsen FÅ. Wikipedia Research and Tools: Review and Comments. SSRN Electronic Journal. 2012;
15. Lewoniewski W, Wecel K, Abramowicz W. Relative Quality and Popularity Evaluation of Multilingual Wikipedia Articles. Informatics. 2017;4(4):43. doi: 10.3390/informatics4040043
16. Frahm KM, Shepelyansky DL. Reduced Google matrix. arXiv. 2016;arXiv:1602.02394.
17. Frahm KM, Jaffrès-Runser K, Shepelyansky DL. Wikipedia mining of hidden links between political leaders. The European Physical Journal B. 2016;89(12):269. doi: 10.1140/epjb/e2016-70526-3
18. Zant SE, Jaffrès-Runser K, Frahm KM, Shepelyansky DL. Interactions and Influence of World Painters From the Reduced Google Matrix of Wikipedia Networks. IEEE Access. 2018;6:47735–47750. doi: 10.1109/ACCESS.2018.2867327
19. Coquidé C, Lages J, Shepelyansky DL. World influence and interactions of universities from Wikipedia networks. The European Physical Journal B. 2019;92(1):3. doi: 10.1140/epjb/e2018-90532-7
20. Lages J, Shepelyansky DL, Zinovyev A. Inferring hidden causal relations between pathway members using reduced Google matrix of directed biological networks. PLOS ONE. 2018;13(1):1–28. doi: 10.1371/journal.pone.0190812
21. Coquidé C, Ermann L, Lages J, Shepelyansky DL. Influence of petroleum and gas trade on EU economies from the reduced Google matrix analysis of UN COMTRADE data. The European Physical Journal B. 2019;92(8):171. doi: 10.1140/epjb/e2019-100132-6
22. Coquidé C, Lages J, Shepelyansky DL. Interdependence of sectors of economic activities for world countries from the reduced Google matrix analysis of WTO data. arXiv e-prints. 2019; p. arXiv:1905.06489.
23. Coquidé C, Lages J, Shepelyansky DL. Contagion in Bitcoin networks. arXiv e-prints. 2019; p. arXiv:1906.01293.
24. Cancer Treatment Centers of America. Types of Cancer; 2018. Available from: https://www.cancercenter.com/cancer/.
25. National Cancer Institute. List of Cancer Drugs; 2018. Available from: https://www.cancer.gov/about-cancer/treatment/drugs/.
26. El Zant S, Jaffrès-Runser K, Shepelyansky DL. Capturing the influence of geopolitical ties from Wikipedia with reduced Google matrix. PLOS ONE. 2018;13(8):1–31. doi: 10.1371/journal.pone.0201397
27. Rollin G, Lages J, Shepelyansky DL. World Influence of Infectious Diseases From Wikipedia Network Analysis. IEEE Access. 2019;7:26073–26087. doi: 10.1109/ACCESS.2019.2899339
28. Rollin G, Lages J, Shepelyansky D. Wiki4Cancers: Wikipedia network of cancers; 2018. Available from: http://perso.utinam.cnrs.fr/~lages/datasets/Wiki4Cancers/.
29. DrugBank. DrugBank database; 2018. Available from: https://www.drugbank.ca.
30. Frahm KM, Shepelyansky DL. Wikipedia networks of 24 editions of 2017; 2017. Available from: http://www.quantware.ups-tlse.fr/QWLIB/24wiki2017.
31. Chepelianskii AD. Towards physical laws for software architecture. arXiv. 2010;arXiv:1003.5455.
32. Zhirov AO, Zhirov OV, Shepelyansky DL. Two-dimensional ranking of Wikipedia articles. The European Physical Journal B. 2010;77(4):523–531. doi: 10.1140/epjb/e2010-10500-7
33. Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: A Software Environment for Integrated Models of Biomolecular Interaction Networks. Genome Research. 2003;13(11):2498–2504. doi: 10.1101/gr.1239303 14597658
34. Kaatsch P. Epidemiology of childhood cancer. Cancer Treatment Reviews. 2010;36(4):277–285. doi: 10.1016/j.ctrv.2010.02.003 20231056
35. Wikipedia. Talc; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Talc.
36. Wikipedia. Methotrexate; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Methotrexate.
37. Wikipedia. Thalidomide; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Thalidomide.
38. Wikipedia. Paclitaxel; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Paclitaxel.
39. Wikipedia. Bicalutamide; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Bicalutamide.
40. Global Burden of Disease Study. Global, regional, and national age-sex-specific mortality for 282 causes of death in 195 countries and territories, 1980–2017: a systematic analysis for the Global Burden of Disease Study 2017. The Lancet. 2018;392(10159):1736—1788. doi: 10.1016/S0140-6736(18)32203-7
41. Global Burden of Disease Study. Global, regional, and national disability-adjusted life-years (DALYs) for 359 diseases and injuries and healthy life expectancy (HALE) for 195 countries and territories, 1990–2017: a systematic analysis for the Global Burden of Disease Study 2017. The Lancet. 2018;392(10159):1736—1788. doi: 10.1016/S0140-6736(18)32203-7
42. Wikipedia. Breast cancer; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Breast_cancer.
43. Wikipedia. Prostate cancer; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Prostate_cancer.
44. Wong MCS, Jiang JY, Goggins WB, Liang M, Fang Y, Fung FDH, et al. International incidence and mortality trends of liver cancer: a global profile. Scientific Reports. 2017;7:45846. doi: 10.1038/srep45846 28361988
45. Brenner H, Rothenbacher D, Arndt V. In: Verma M, editor. Epidemiology of Stomach Cancer. Totowa, NJ: Humana Press; 2009. p. 467–477. Available from: https://doi.org/10.1007/978-1-60327-492-0_23.
46. Karimi P, Islami F, Anandasabapathy S, Freedman ND, Kamangar F. Gastric Cancer: Descriptive Epidemiology, Risk Factors, Screening, and Prevention. Cancer Epidemiology and Prevention Biomarkers. 2014;23(5):700–713. doi: 10.1158/1055-9965.EPI-13-1057
47. Haggar FA, Boushey RP. Colorectal Cancer Epidemiology: Incidence, Mortality, Survival, and Risk Factors. Clinics in Colon and Rectal Surgery. 2009;22(04):191–197. doi: 10.1055/s-0029-1242458 21037809
48. Miranda-Filho A, Piñeros M, Ferlay J, Soerjomataram I, Monnereau A, Bray F. Epidemiological patterns of leukaemia in 184 countries: a population-based study. The Lancet Haematology. 2018;5(1):e14–e24. doi: 10.1016/S2352-3026(17)30232-6 29304322
49. Wikipedia. Burkitt’s lymphoma; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Burkitt’s_lymphoma.
50. Wikipedia. Melanoma; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Melanoma.
51. Kimura T. East meets West: ethnic differences in prostate cancer epidemiology between East Asians and Caucasians. Chin J Cancer. 2012;31(9):421–429. doi: 10.5732/cjc.011.10324 22085526
52. Wakai K. Descriptive epidemiology of prostate cancer in Japan and Western countries. Nippon Rinsho. 2005;63(2):207–212. 15714967
53. Badmus TA, Adesunkanmi ARK, Yusuf BM, Oseni GO, Eziyi AK, Bakare TIB, et al. Burden of Prostate Cancer in Southwestern Nigeria. Urology. 2010;76(2):412—416. https://doi.org/10.1016/j.urology.2010.03.020. 20451979
54. Wikipedia. Lung cancer; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Lung_cancer.
55. Witschi H. A Short History of Lung Cancer. Toxicological Sciences. 2001;64(1):4–6. doi: 10.1093/toxsci/64.1.4 11606795
56. Wikipedia. San Marino; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/San_Marino.
57. Wikipedia. BRAF; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/BRAF_(gene).
58. Wikipedia. Rituximab; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Rituximab.
59. Gunturu KS, Woo Y, Beaubier N, Remotti HE, Saif MW. Gastric cancer and trastuzumab: first biologic therapy in gastric cancer. Therapeutic Advances in Medical Oncology. 2013;5(2):143–151. doi: 10.1177/1758834012469429 23450234
60. Wikipedia. Jenks natural breaks optimization; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Jenks_natural_breaks_optimization.
61. Wikipedia. Brain Tumor; 2018. Wikipedia. Available from: https://en.wikipedia.org/wiki/Brain_tumor.
Článek vyšel v časopise
PLOS One
2019 Číslo 9
- Proč jsou nemocnice nepřítelem spánku? A jak to změnit?
- Dlouhodobá ketodieta může poškozovat naše orgány
- „Jednohubky“ z klinického výzkumu – 2024/42
- Metamizol jako analgetikum první volby: kdy, pro koho, jak a proč?
- MUDr. Jana Horáková: Remise již dosahujeme u více než 80 % pacientů s myastenií