Maintenance Notice

Due to necessary scheduled maintenance, the JMIR Publications website will be unavailable from Monday, March 11, 2019 at 4:00 PM to 4:30 PM EST. We apologize in advance for any inconvenience this may cause you.

Who will be affected?

Advertisement

Citing this Article

Right click to copy or hit: ctrl+c (cmd+c on mac)

Published on 26.09.19 in Vol 5, No 2 (2019): Jul-Dec

Preprints (earlier versions) of this paper are available at http://preprints.jmir.org/preprint/12163, first published Sep 10, 2018.

This paper is in the following e-collection/theme issue:

    Original Paper

    Developing Machine Learning Algorithms for the Prediction of Early Death in Elderly Cancer Patients: Usability Study

    1Department of Geriatric Oncology, Instituto de Medicina Integral Prof Fernando Figueira, Recife, Brazil

    2Instituto Federal de Pernambuco - IFPE, Department os Computational Science, Recife, Brazil

    3Research Center, Instituto Nacional do Cancer - INCA, Rio de Janeiro, Brazil

    *all authors contributed equally

    Corresponding Author:

    Gabrielle Ribeiro Sena, MSc

    Department of Geriatric Oncology

    Instituto de Medicina Integral Prof Fernando Figueira

    Rua dos Coelhos 300

    Recife, 50070-902

    Brazil

    Phone: 55 81 21224100

    Email: gabriellesena8@gmail.com


    ABSTRACT

    Background: The importance of classifying cancer patients into high- or low-risk groups has led many research teams, from the biomedical and bioinformatics fields, to study the application of machine learning (ML) algorithms. The International Society of Geriatric Oncology recommends the use of the comprehensive geriatric assessment (CGA), a multidisciplinary tool to evaluate health domains, for the follow-up of elderly cancer patients. However, no applications of ML have been proposed using CGA to classify elderly cancer patients.

    Objective: The aim of this study was to propose and develop predictive models, using ML and CGA, to estimate the risk of early death in elderly cancer patients.

    Methods: The ability of ML algorithms to predict early mortality in a cohort involving 608 elderly cancer patients was evaluated. The CGA was conducted during admission by a multidisciplinary team and included the following questionnaires: mini-mental state examination (MMSE), geriatric depression scale-short form, international physical activity questionnaire-short form, timed up and go, Katz index of independence in activities of daily living, Charlson comorbidity index, Karnofsky performance scale (KPS), polypharmacy, and mini nutritional assessment-short form (MNA-SF). The 10-fold cross-validation algorithm was used to evaluate all possible combinations of these questionnaires to estimate the risk of early death, considered when occurring within 6 months of diagnosis, in a variety of ML classifiers, including Naive Bayes (NB), decision tree algorithm J48 (J48), and multilayer perceptron (MLP). On each fold of evaluation, tiebreaking is handled by choosing the smallest set of questionnaires.

    Results: It was possible to select CGA questionnaire subsets with high predictive capacity for early death, which were either statistically similar (NB) or higher (J48 and MLP) when compared with the use of all questionnaires investigated. These results show that CGA questionnaire selection can improve accuracy rates and decrease the time spent to evaluate elderly cancer patients.

    Conclusions: A simplified predictive model aiming to estimate the risk of early death in elderly cancer patients is proposed herein, minimally composed by the MNA-SF and KPS. We strongly recommend that these questionnaires be incorporated into regular geriatric assessment of older patients with cancer.

    JMIR Cancer 2019;5(2):e12163

    doi:10.2196/12163

    KEYWORDS



    Introduction

    Background

    Aging is a complex and personal, cumulative, and irreversible phenomenon that goes well beyond chronological age [1]. It involves several biological events associated with a great variety of molecular and cellular damage, leading to the gradual loss of physiological and immunological reserves and a greater risk for neoplasia-related death [1,2]. Assuming that the elderly population is heterogeneous, this population must be considered not only concerning their chronological age. Thus, an objective analysis of their living conditions as well as aspects related to oncological disease and its therapy is also required [3].

    The International Society of Geriatric Oncology has recommended the use of the Comprehensive Geriatric Assessment (CGA) for the evaluation and follow-up of elderly cancer patients [4]. The CGA is a multidisciplinary tool that uses validated instruments to evaluate several elderly health condition domains, such as functional, cognitive, psychological, social, clinical, and nutritional aspects, as well as comorbidities and the use of medication, among others [5,6]. It is also strongly recommended by the geriatrics and gerontology fields in general because it is, in a complex and heterogeneous context, an objective, measurable, and reproducible form of evaluation, adding possibilities to standard clinical laboratory evaluations [7,8]. However, there is no consensus about what and how many instruments should be used. Employing CGA in practice, however, has become a huge challenge, and owing to its complexity and time spent in its application, it is often underutilized by oncologists and not judged as a completely satisfactory solution in practice, which has served as a stimulus for the construction of simpler tools that have the power to predict outcomes and guide clinical decisions [5,9].

    The accurate prediction of a disease outcome is one of the most interesting and challenging tasks for physicians. As a result, a growing trend was noted in the studies published during the past years that applied machine learning (ML) algorithms for modeling cancer survival. This type of algorithms can discover and identify patterns and relationships between them, from complex databases, while they are able to effectively predict future outcomes of a cancer type [10]. On the basis of the study by Kourou et al [11], the accuracy of cancer prediction outcome has significantly improved by 15% to 20% in the previous years, with the application of ML techniques.

    A study combining data from 4 cohorts involving the elderly, 1 including elderly people with neoplasms, proposed to explore the performance of various ML classifiers (Naive Bayes [NB], k-nearest neighbors, artificial neural networks, random forest, and logistic regression) regarding death prediction in 6 months [12]. Another study used ML to predict mortality of patients in 3 to 12 months and to identify patients who could benefit from palliative care [13]. However, no ML application has been proposed using CGA to classify elderly cancer patients.

    Objectives

    Thus, the primary aim of this study was to propose and develop predictive models, using ML and CGA, to estimate the risk of early death in elderly cancer patients. The secondary aims were to optimize the CGA through the selection of the most appropriate instruments.


    Methods

    Comprehensive Geriatric Assessment

    The ability of ML techniques to predict early mortality in a heterogeneous cohort was tested in 608 elderly cancer patients (aged over 60 years), admitted to the oncogeriatrics sector of the Instituto de Medicina Integral Prof. Fernando Figueira - IMIP, from January 2015 to July 2016. The IMIP is a teaching hospital and cancer center located in Recife, Pernambuco, Brazil. On admission to the cohort database, the patients were evaluated by CGA questionnaires presented in Table 1. The questionnaires were collected by a multiprofessional team, comprising a clinical oncologist, a geriatrician, a physiotherapist, a physical educator, a speech therapist, an occupational therapist, and a nutritionist. The project was approved by the IMIP Ethics Committee on Human Research on June 30, 2016, under number 58298316.5.0000.5201.

    Table 1. Questionnaires/features to evaluate elderly health condition domains in Comprehensive Geriatric Assessment.
    View this table

    Preprocess of Database

    The first step was to remove patients presenting redundancies and/or incomplete questionnaires/features. A total of 543 patients remained after that. Data normalization technique for equalizing the range of features, usually employed in the database before feature selection and learning phase, is of important concern in pattern recognition and computer-aided diagnosis [23]. The most common normalization method used during data transformation is the min-max (where the features are mapped into a predefined range, varying from 0 or −1 to 1). The main advantage of min-max normalization is that it preserves the relationships among the original data values [24]. In this work, all features were normalized in a (0,1) interval, calculated as in equation, where v′ is the value normalized, v is the original value, vmin is the minimum value of corresponding feature, and vmax is the maximum value of corresponding feature: v’=(vvmin)/(vmaxvmin).

    Predictive Models

    Predictive modeling is the general concept of building a model that can make predictions. Typically, such a model includes an ML algorithm that learns certain properties from a database to make those predictions. We have presented below a brief summary of the commonly used supervised learning algorithms:

    • Decision tree J48 (J48) [25]: They are tree-like graphs, where the nodes in the graph test certain conditions on a set of features and the branches split the decision toward the leaf nodes. The leaves represent the lowest level in the graph and determine the class labels.
    • Multilayer perceptron (MLP) [26]: They are graph-like classifiers that mimic the structure of a human or animal brain where the interconnected nodes represent the neurons.
    • Naïve Bayes (NB) [27]: They are based on a statistical model (ie, Bayes theorem, calculating posterior probabilities based on the prior probability and the so-called likelihood).

    The purpose of this work was not to introduce the highest accuracy prediction model. The goal was to designate the most relevant questionnaires to evaluate elderly health condition domains in CGA. Therefore, in the experiments, we always used the same configuration with the default parameter values in Weka (Waikato Environment of Knowledge Analysis) from The University of Waikato, version 3.8.3. The advantage of using default parameters is that it does not introduce optimistic bias by tuning the parameter to maximize performance on the test data. Figure 1 shows more details about the values used in each predictive model.

    Figure 1. Parameters used in Decision Tree (J48), Multilayer perceptron, and Naive Bayes algorithms.
    View this figure

    K-Fold Cross-Validation

    Cross-validation (CV) [28] is one of the most widely used methods to assess the generalizability of predictive models [29] and is subject to ongoing active research [30]. K-fold CV comprises dividing the database into K parts (folds) of equal sizes. For this study, a 10-fold CV is used, and each part is held out in turn and the predictive model (J48, MLP, or NB) is trained on the remaining nine-tenths; then, its error rate is calculated on the holdout set. Thus, the learning procedure is executed a total of K times on different training sets (each of which have much in common). Finally, the K error estimates are averaged to yield an overall error estimate. In this work, the folds are made by preserving the percentage of samples for each class.

    Imbalanced Learn

    The learning procedure and the subsequent prediction of predictive models can be affected by the problem of imbalanced database [31]. The balancing issue corresponds to the difference in the number of samples in the different classes. The resulting database presented 92 deaths within 6 months of admission to the service and 451 patients alive at the end of that period. All deaths were attributed to cancer (treatment complications or disease progression). With a greater imbalanced ratio, the decision function favors the class with the largest number of samples, usually referred as the majority class. The way to fight this issue was to generate new training sets on 10-fold CV by random sampling so that the proportion between classes remained at one-to-one.

    Metrics

    The area under receiver operating characteristics curve, or simply area under curve (AUC), has recently been proposed as an alternative single-number measure for evaluating the generalization of learning algorithms [32]. This measure is far better than classification accuracies when the 2 classes are unbalanced and the cost of misclassification is unspecified [33]. An area of 1.0 represents a model that made all predictions perfectly, and an area of 0.5 represents a model as good as random. AUC can be broken down into sensitivity and specificity:

    • Sensitivity is the true positive rate, and for this study, it is the percentage of patients with early death that are predicted correctly.
    • Specificity is also called the true negative rate, for example, the percentage of patients without early death that are predicted correctly.

    Results

    Evaluating All Possible Combinations of Comprehensive Geriatric Assessment Questionnaires

    Feature selection is an important and frequently used technique for dimension reduction by removing irrelevant and/or redundant information from the database to obtain an optimal feature subset. A 10-fold CV was used to evaluate all possible combinations of CGA questionnaires, presented in Table 2, to estimate the risk of early death in elderly cancer patients. Thus, in each fold, the combination of questionnaires with highest AUC is selected. The same folds are applied to all 511 combinations. Tiebreaking is handled by choosing the smallest set of questionnaires. The occurrence of questionnaires selected on the 10-fold CV, using predictive models, is presented in Table 2. In Figure 2, the flowchart of our methodology is shown.

    Table 2. Occurrence of the Comprehensive Geriatric Assessment questionnaires in the 10-folds using decision tree (J48), multilayer perceptron, and Naive Bayes.
    View this table
    Figure 2. Flowchart of methodology.
    View this figure

    Evaluating combinations of occurrences

    Tables 3-5 show the sensibility, specificity, and AUC values expressed as mean (SD) on the 10-fold CV for the NB, J48, and MLP. The subsets of CGA questionnaires, presented in these tables, consider the occurrences of Table 2. The subset of questionnaires with occurrence ≥0, for example, uses all set of CGA questionnaires, as it considers all occurrences greater than or equal to 0. The other subsets use the same logic and are detailed in the footnotes under the tables. In each metric, according to the paired t test, the P value is calculated considering the subset of questionnaires with occurrence ≥0. The experimental results demonstrate that the feature selection can discard questionnaires and finally find out subsets that reduce the dimensionality of data to make the predictive models more efficient and the results more accurate. Thus, a simplified predictive model aiming to estimate the risk of early death in elderly cancer patients is proposed herein, minimally composed by the Mini Nutritional Assessment-Short Form (MNA-SF), accompanied or not by the Karnofsky performance scale (KPS) and/or the Mini-Mental State Examination.

    Table 3. Metrics considering Comprehensive Geriatric Assessment questionnaire subsets on Naive Bayes classifier.
    View this table
    Table 4. Metrics considering comprehensive geriatric assessment questionnaire subsets on decision tree (J48) classifier.
    View this table
    Table 5. Metrics considering comprehensive geriatric assessment questionnaires subsets on multilayer perceptron classifier.
    View this table

    Discussion

    Principal Findings

    Results indicate that the MNA-SF has greater predictive power to estimate the risk of early death in elderly cancer patients as it was selected on the 10-folds. MNA-SF is a rapid test validated for screening for nutritional risk and malnutrition in the elderly population. The predictive value of MNA-SF for early death may be related to the fact that the 6 MNA-SF questions cover areas other than just nutrition, which are frequently included in the CGA, such as mobility, neuropsychological disorders, and self-reported health, in addition to nutrition aspects, including weight loss, reduced food intake, and body mass index. In fact, low MNA-SF may reveal the effects of advanced disease in the overall health of patients, which also affects cancer-related mortality. A Brazilian study showed that abnormal nutritional status was an independent factor associated with hospital death among older patients with various chronic diseases, including cancer [34]. A similar association was also demonstrated in elderly Asian cancer patients who would receive first-line chemotherapy [35]. Finally, a French multicenter study with 348 elderly cancer patients aged 70 years and above also found that low MNA scores were associated with increased risk of premature death [36].

    The results also indicated that KPS questionnaire has proven itself a valuable tool to estimate the risk of early death in elderly cancer patients. In the past decades, various studies have demonstrated the prognostic value of the KPS not only primarily for various cancers [37-40] but also for other disease entities [41]. It can also be considered as a significant indicator of hospitalization and survival time, in addition to identifying risk groups to assist in the orientation of patients to geriatric outpatients [42].

    Limitations

    The efforts of this paper are a starting point. They provide solid evidences and some clinical recommendations. We proposed and developed simple ML models for the prediction of early death in elderly cancer patients. These models are accurate and precise and could be possibly used by clinicians to make proper treatment plans. However, additional research is needed to continue to strengthen the evidence base.

    Conclusions

    The results showed that the MNA-SF and KPS have the highest predictive power to identify elderly patients at risk for early death. We strongly recommend that these questionnaires be incorporated into regular geriatric assessment of older patients with cancer.

    The MNA-SF and the KPS requires only a few minutes to be completed. In addition, both can be easily managed by any member of the multidisciplinary team to help in the early identification of patients at risk, providing information that assists in the planning of interventions and improving the adherence to CGA in daily clinical oncology practice.

    This study also has limitations that should be considered. This is a nonrandomized, single-center, exploratory study of a heterogeneous patient population similar to a real-life population of older patients with cancer. Conversely, some of its weaknesses could be considered the main strengths of the study: this is one of the few studies in Brazil that, in the clinical practice context of a Unified Health System oncology unit, investigated the use of ML algorithms in the prediction of early death in elderly cancer patients.

    Conflicts of Interest

    None declared.

    References

    1. Falandry C, Bonnefoy M, Freyer G, Gilson E. Biology of cancer and aging: a complex association with cellular senescence. J Clin Oncol 2014 Aug 20;32(24):2604-2610. [CrossRef] [Medline]
    2. Hurria A, Togawa K, Mohile SG, Owusu C, Klepin HD, Gross CP, et al. Predicting chemotherapy toxicity in older adults with cancer: a prospective multicenter study. J Clin Oncol 2011 Sep 1;29(25):3457-3465 [FREE Full text] [CrossRef] [Medline]
    3. Wildiers H, Heeren P, Puts M, Topinkova E, Janssen-Heijnen ML, Extermann M, et al. International Society of Geriatric Oncology consensus on geriatric assessment in older patients with cancer. J Clin Oncol 2014 Aug 20;32(24):2595-2603 [FREE Full text] [CrossRef] [Medline]
    4. Extermann M, Aapro M, Bernabei R, Cohen HJ, Droz JP, Lichtman S, Task Force on CGA of the International Society of Geriatric Oncology. Use of comprehensive geriatric assessment in older cancer patients: recommendations from the task force on CGA of the International Society of Geriatric Oncology (SIOG). Crit Rev Oncol Hematol 2005 Sep;55(3):241-252. [CrossRef] [Medline]
    5. Decoster L, van Puyvelde K, Mohile S, Wedding U, Basso U, Colloca G, et al. Screening tools for multidimensional health problems warranting a geriatric assessment in older cancer patients: an update on SIOG recommendations. Ann Oncol 2015 Feb;26(2):288-300. [CrossRef] [Medline]
    6. Song M, Giovannucci E. Preventable incidence and mortality of carcinoma associated with lifestyle factors among white adults in the United States. JAMA Oncol 2016 Sep 1;2(9):1154-1161 [FREE Full text] [CrossRef] [Medline]
    7. Ellis G, Whitehead MA, Robinson D, O'Neill D, Langhorne P. Comprehensive geriatric assessment for older adults admitted to hospital: meta-analysis of randomised controlled trials. Br Med J 2011 Oct 27;343:d6553 [FREE Full text] [CrossRef] [Medline]
    8. Viganò A, Morais JA. The elderly patient with cancer: a holistic view. Nutrition 2015 Apr;31(4):587-589. [CrossRef] [Medline]
    9. Luciani A, Biganzoli L, Colloca G, Falci C, Castagneto B, Floriani I, et al. Estimating the risk of chemotherapy toxicity in older patients with cancer: the role of the vulnerable elders survey-13 (VES-13). J Geriatr Oncol 2015 Jul;6(4):272-279. [CrossRef] [Medline]
    10. Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Comput Struct Biotechnol J 2015;13:8-17 [FREE Full text] [CrossRef] [Medline]
    11. Kourou K, Exarchos TP, Exarchos KP, Karamouzis MV, Fotiadis DI. Machine learning applications in cancer prognosis and prediction. Comput Struct Biotechnol J 2015;13:8-17 [FREE Full text] [CrossRef] [Medline]
    12. Makar M, Ghassemi M, Cutler DM, Obermeyer Z. Short-term mortality prediction for elderly patients using medicare claims data. Int J Mach Learn Comput 2015 Jun;5(3):192-197 [FREE Full text] [CrossRef] [Medline]
    13. Avati A, Jung K, Harman S, Downing L, Andrew N, Ng A, et al. Improving Palliative Care With Deep Learning. In: Proceedings of the International Conference on Bioinformatics and Biomedicine. 2017 Presented at: BIBM'17; November 13-16, 2017; Kansas City, MO, USA. [CrossRef]
    14. Charlson ME, Pompei P, Ales KL, MacKenzie C. A new method of classifying prognostic comorbidity in longitudinal studies: development and validation. J Chronic Dis 1987;40(5):373-383. [CrossRef] [Medline]
    15. Yesavage JA, Brink T, Rose TL, Lum O, Huang V, Adey M, et al. Development and validation of a geriatric depression screening scale: a preliminary report. J Psychiatr Res 1982;17(1):37-49. [CrossRef] [Medline]
    16. Craig CL, Marshall AL, Sjöström M, Bauman AE, Booth ML, Ainsworth BE, et al. International physical activity questionnaire: 12-country reliability and validity. Med Sci Sports Exerc 2003 Aug;35(8):1381-1395. [CrossRef] [Medline]
    17. Karnofsky DA. CiNii Articles. 1949. The Clinical Evaluation of Chemotherapeutic Agents in Cancer   URL: https://ci.nii.ac.jp/naid/10005058071/en/ [accessed 2019-08-27]
    18. Katz S, Ford AB, Moskowitz RW, Jackson BA, Jaffe MW. Studies of illness in the aged. The index of ADL: a standardized measure of biological and psychosocial function. J Am Med Assoc 1963 Sep 21;185:914-919. [CrossRef] [Medline]
    19. Folstein MF, Folstein SE, McHugh PR. 'Mini-mental state'. A practical method for grading the cognitive state of patients for the clinician. J Psychiatr Res 1975 Nov;12(3):189-198. [CrossRef] [Medline]
    20. Rubenstein LZ, Harker JO, Salvà A, Guigoz Y, Vellas B. Screening for undernutrition in geriatric practice: developing the short-form mini-nutritional assessment (MNA-SF). J Gerontol A Biol Sci Med Sci 2001 Jun;56(6):M366-M372. [CrossRef] [Medline]
    21. Gnjidic D, Hilmer SN, Blyth FM, Naganathan V, Waite L, Seibel MJ, et al. Polypharmacy cutoff and outcomes: five or more medicines were used to identify community-dwelling older men at risk of different adverse outcomes. J Clin Epidemiol 2012 Sep;65(9):989-995. [CrossRef] [Medline]
    22. Podsiadlo D, Richardson S. The timed 'Up & Go': a test of basic functional mobility for frail elderly persons. J Am Geriatr Soc 1991 Feb;39(2):142-148. [CrossRef] [Medline]
    23. KumarSingh B, Verma K, Thoke AS. Investigations on impact of feature normalization techniques on classifier's performance in breast tumor classification. Int J Comput Appl 2015 Apr 22;116(19):11-15. [CrossRef]
    24. Manikandan G, Sairam N, Sharmili S, Venkatakrishnan S. Achieving privacy in data mining using normalization. Indian J Sci Technol 2013;6(4):4268-4272 [FREE Full text]
    25. Salzberg SL. C4.5: programs for machine learning by J Ross Quinlan Morgan Kaufmann Publishers, Inc, 1993. Mach Learn 1994 Sep;16(3):235-240 [FREE Full text] [CrossRef]
    26. Haykin S. Neural Networks: A Comprehensive Foundation. Second Edition. Upper Saddle River, NJ, USA: Prentice Hall PTR; 1998.
    27. John GH, Langley P. Estimating Continuous Distributions in Bayesian Classifiers. In: Proceedings of the Eleventh Conference on Uncertainty in Artificial Intelligence. 1995 Presented at: UAI'95; August 18-20, 1995; Montréal, Qué, Canada p. 338-345   URL: http://dl.acm.org/citation.cfm?id=2074158.2074196
    28. Stone M. Cross-validatory choice and assessment of statistical predictions. J R Stat Soc Ser B Methodol 2018 Dec 5;36(2):111-133. [CrossRef]
    29. Hastie T, Tibshirani R, Friedman J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction. Second Edition. USA: Springer; 2009.
    30. Bergmeir C, Hyndman RJ, Koo B. A note on the validity of cross-validation for evaluating autoregressive time series prediction. Comput Stat Data An 2018 Apr;120:70-83. [CrossRef]
    31. Lemaître G, Nogueira F, Aridas CK. Imbalanced-learn: a python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach Learn Res 2017;18(1):559-563 [FREE Full text]
    32. Huang J, Ling CX. Using AUC and accuracy in evaluating learning algorithms. IEEE Trans Knowl Data Eng 2005 Mar;17(3):299-310. [CrossRef]
    33. Scott MJ, Niranjan M, Melvin DG, Prager RW. CiteSeerX. 1998. Maximum Realisable Performance: A Principled Method for Enhancing Performance By Using Multiple Classifiers in Variable Cost Problem Domains   URL: http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.28.6957 [accessed 2019-08-27]
    34. Ferreira LS, Nascimento LF, Marucci MF. Use of the mini nutritional assessment tool in elderly people from long-term institutions of southeast of Brazil. J Nutr Health Aging 2008 Mar;12(3):213-217. [CrossRef] [Medline]
    35. Kanesvaran R, Li H, Koo K, Poon D. Analysis of prognostic factors of comprehensive geriatric assessment and development of a clinical scoring system in elderly Asian patients with cancer. J Clin Oncol 2011 Sep 20;29(27):3620-3627. [CrossRef] [Medline]
    36. Soubeyran P, Fonck M, Blanc-Bisson C, Blanc J, Ceccaldi J, Mertens C, et al. Predictors of early death risk in older patients treated with first-line chemotherapy for cancer. J Clin Oncol 2012 May 20;30(15):1829-1834. [CrossRef] [Medline]
    37. Buccheri G, Ferrigno D, Tamburini M. Karnofsky and ECOG performance status scoring in lung cancer: a prospective, longitudinal study of 536 patients from a single institution. Eur J Cancer 1996 Jun;32A(7):1135-1141. [CrossRef] [Medline]
    38. Maréchal R, Demols A, Gay F, de Maertelaere V, Arvanitaki M, Hendlisz A, et al. Prognostic factors and prognostic index for chemonaïve and gemcitabine-refractory patients with advanced pancreatic cancer. Oncology 2007;73(1-2):41-51. [CrossRef] [Medline]
    39. Carson KA, Grossman SA, Fisher JD, Shaw EG. Prognostic factors for survival in adult patients with recurrent glioma enrolled onto the new approaches to brain tumor therapy CNS consortium phase I and II clinical trials. J Clin Oncol 2007 Jun 20;25(18):2601-2606 [FREE Full text] [CrossRef] [Medline]
    40. Sperduto PW, Kased N, Roberge D, Xu Z, Shanley R, Luo X, et al. Summary report on the graded prognostic assessment: an accurate and facile diagnosis-specific tool to estimate survival for patients with brain metastases. J Clin Oncol 2012 Feb 1;30(4):419-425 [FREE Full text] [CrossRef] [Medline]
    41. Sorror M, Storer B, Sandmaier BM, Maloney DG, Chauncey TR, Langston A, et al. Hematopoietic cell transplantation-comorbidity index and Karnofsky performance status are independent predictors of morbidity and mortality after allogeneic nonmyeloablative hematopoietic cell transplantation. Cancer 2008 May 1;112(9):1992-2001 [FREE Full text] [CrossRef] [Medline]
    42. Crooks V, Waller S, Smith T, Hahn TJ. The use of the Karnofsky performance scale in determining outcomes and risk in geriatric outpatients. J Gerontol 1991 Jul;46(4):M139-M144. [CrossRef] [Medline]


    Abbreviations

    AUC: area under curve
    CGA: Comprehensive Geriatric Assessment
    CV: cross-validation
    J48: Decision Tree
    KPS: Karnofsky performance scale
    ML: machine learning
    MLP: multilayer perceptron
    MNA-SF: Mini Nutritional Assessment-Short Form
    NB: Naive Bayes


    Edited by H Wu; submitted 10.09.18; peer-reviewed by W Tian, J Li, C Gao, H Liu; comments to author 02.11.18; revised version received 14.02.19; accepted 31.07.19; published 26.09.19

    ©Gabrielle Ribeiro Sena, Tiago Pessoa Ferreira Lima, Maria Julia Gonçalves Mello, Luiz Claudio Santos Thuler, Jurema Telles Oliveira Lima. Originally published in JMIR Cancer (http://cancer.jmir.org), 26.09.2019

    This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Cancer, is properly cited. The complete bibliographic information, a link to the original publication on http://cancer.jmir.org/, as well as this copyright and license information must be included.