TY - JOUR
T1 - Computer-Generated R.E.N.A.L. Nephrometry Scores Yield Comparable Predictive Results to Those of Human-Expert Scores in Predicting Oncologic and Perioperative Outcomes
AU - Heller, N.
AU - Tejpaul, R.
AU - Isensee, F.
AU - Benidir, T.
AU - Hofmann, M.
AU - Blake, P.
AU - Rengal, Z.
AU - Moore, K.
AU - Sathianathen, N.
AU - Kalapara, A.
AU - Rosenberg, J.
AU - Peterson, S.
AU - Walczak, E.
AU - Kutikov, A.
AU - Uzzo, R. G.
AU - Palacios, D. A.
AU - Remer, E. M.
AU - Campbell, S. C.
AU - Papanikolopoulos, N.
AU - Weight, Christopher J.
N1 - Publisher Copyright:
© 2022 by AMERICAN UROLOGICAL ASSOCIATION EDUCATION AND RESEARCH, INC.
PY - 2022/5/1
Y1 - 2022/5/1
N2 - Purpose: We sought to automate R.E.N.A.L. (for radius, exophytic/endophytic, nearness of tumor to collecting system, anterior/posterior, location relative to polar line) nephrometry scoring of preoperative computerized tomography scans and create an artificial intelligence-generated score (AI-score). Subsequently, we aimed to evaluate its ability to predict meaningful oncologic and perioperative outcomes as compared to expert human-generated nephrometry scores (H-scores). Materials and Methods: A total of 300 patients with preoperative computerized tomography were identified from a cohort of 544 consecutive patients undergoing surgical extirpation for suspected renal cancer at a single institution. A deep neural network approach was used to automatically segment kidneys and tumors, and geometric algorithms were developed to estimate components of R.E.N.A.L. nephrometry score. Tumors were independently scored by medical personnel blinded to AI-scores. AI- and H-score agreement was assessed using Lin’s concordance correlation and their predictive abilities for both oncologic and perioperative outcomes were assessed using areas under the curve. Results: Median age was 60 years (IQE 51e68), and 40% were female. Median tumor size was 4.2 cm and 91.3% had malignant tumors, including 27%, 37% and 24% with high stage, grade and necrosis, respectively. There was significant agreement between H-scores and AI-scores (Lin’s ρ=0.59). Both AI- and H-scores similarly predicted meaningful oncologic outcomes (p <0.001) including presence of malignancy, necrosis, and high-grade and -stage disease (p <0.003). They also predicted surgical approach (p <0.004) and specific perioperative outcomes (p <0.05). Conclusions: Fully automated AI-generated R.E.N.A.L. scores are comparable to human-generated R.E.N.A.L. scores and predict a wide variety of meaningful patient-centered outcomes. This unambiguous artificial intelligence-based scoring is intended to facilitate wider adoption of the R.E.N.A.L. score.
AB - Purpose: We sought to automate R.E.N.A.L. (for radius, exophytic/endophytic, nearness of tumor to collecting system, anterior/posterior, location relative to polar line) nephrometry scoring of preoperative computerized tomography scans and create an artificial intelligence-generated score (AI-score). Subsequently, we aimed to evaluate its ability to predict meaningful oncologic and perioperative outcomes as compared to expert human-generated nephrometry scores (H-scores). Materials and Methods: A total of 300 patients with preoperative computerized tomography were identified from a cohort of 544 consecutive patients undergoing surgical extirpation for suspected renal cancer at a single institution. A deep neural network approach was used to automatically segment kidneys and tumors, and geometric algorithms were developed to estimate components of R.E.N.A.L. nephrometry score. Tumors were independently scored by medical personnel blinded to AI-scores. AI- and H-score agreement was assessed using Lin’s concordance correlation and their predictive abilities for both oncologic and perioperative outcomes were assessed using areas under the curve. Results: Median age was 60 years (IQE 51e68), and 40% were female. Median tumor size was 4.2 cm and 91.3% had malignant tumors, including 27%, 37% and 24% with high stage, grade and necrosis, respectively. There was significant agreement between H-scores and AI-scores (Lin’s ρ=0.59). Both AI- and H-scores similarly predicted meaningful oncologic outcomes (p <0.001) including presence of malignancy, necrosis, and high-grade and -stage disease (p <0.003). They also predicted surgical approach (p <0.004) and specific perioperative outcomes (p <0.05). Conclusions: Fully automated AI-generated R.E.N.A.L. scores are comparable to human-generated R.E.N.A.L. scores and predict a wide variety of meaningful patient-centered outcomes. This unambiguous artificial intelligence-based scoring is intended to facilitate wider adoption of the R.E.N.A.L. score.
KW - artificial intelligence
KW - machine learning
UR - http://www.scopus.com/inward/record.url?scp=85128248844&partnerID=8YFLogxK
UR - https://www.webofscience.com/api/gateway?GWVersion=2&SrcApp=purepublist2023&SrcAuth=WosAPI&KeyUT=WOS:000779109100037&DestLinkType=FullRecord&DestApp=WOS
U2 - 10.1097/JU.0000000000002390
DO - 10.1097/JU.0000000000002390
M3 - Article
C2 - 34968146
SN - 0022-5347
VL - 207
SP - 1105
EP - 1114
JO - Journal of Urology
JF - Journal of Urology
IS - 5
ER -