8000
More
Lists (3)
Sort Name ascending (A-Z)
Stars
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded VQA in Robotic Surgery
Semi-Supervised Video Transformer for Surgical Phase Recognition (submission to MICCAI2025)
[MedIA] MeMGB-Diff: Memory-Efficient Multivariate Gaussian Bias Diffusion Model for 3D Bias Field Correction
A repository for surgical action triplet dataset. Data are videos of laparoscopic cholecystectomy that have been annotated with <instrument, verb, target> labels for every surgical fine-grained act…
Meshed-Memory Transformer for Image Captioning. CVPR 2020
(TMI-2024) Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery