#0356
Artificial intelligence to identify surgical anatomy for intraoperative guidance during laparoscopic donor nephrectomy
C. Ong1, L. Kyaw1, M. Leung2, Y. Lee2, B. Tsai2, J. Chueh3, H. Tiong1
1National
University Hospital, National University Health System, Department of Urology,
Singapore, Singapore
2Smart Surgery Tek, Taipei, Taiwan
3National Taiwan University Hospital, National Taiwan University,
Department of Urology, Taipei, Taiwan
Introduction:
Although the risk of intraoperative complications of laparoscopic donor nephrectomy (LDN) is now acceptably low, the work continues to minimise technical mishaps during this ‘high stakes’ surgery. This video demonstrates the pilot use of a patented proprietary deep learning (DL)-based computer vision (CV) to automatically recognise key anatomical structures and prevent intraoperative injuries, which is especially crucial during the learning curve.
Material and methods:
7027 images manually annotated by pixels were selected from 16 surgical videos (National University Hospital, NUH) for training as ground truth, and 2266 annotated images from 4 separate surgical videos were used for validation. This ensured a balanced validation ratio of nearly 20% for each label (spleen, left kidney, renal artery, renal vein, and ureter). The YOLO (You Only Look Once) v11x DL network (https://docs.ultralytics.com/models/yolo11/), known for its speed and accuracy in real-time detection, was adapted to train our model. For further optimisation, it uses a sophisticated loss function which incorporates the accuracy of each pixel in segmentation tasks (binary cross-entropy loss), compares the predicted bounding box coordinates against ground truth (bounding box loss), and emphasises the importance of difficult-to-detect labels (distribution focal loss). Metrics were calculated based on true positives (TP) and false negatives (FN) as below: • Precision = TP/(TP+FP) • Recall = TP/(TP+FN) • F1 score = 2(Precision*Recall)/(Precision+Recall) High precision minimises false positives which could disrupt surgical workflows, while high recall ensures comprehensive detection, minimising false negatives that could affect patient safety. F1 serves as the harmonic mean of recall and precision.
Results:
Quantitative evaluation of the validation dataset using the hold-out validation method yielded performance metrics as in the figure below. Prospective evaluation was performed on a video from another surgeon (JC) and institution (National Taiwan University) and also in real-time in NUH.