Biography
I am currently an Algorithm Expert (Senior Research Scientist) at the Alibaba DAMO Academy. I received my Ph.D. degree in Electronic Engineering at The Chinese University of Hong Kong (CUHK) in 2025, advised by Prof. Hongliang Ren and Prof. Jiewen Lai, with thesis committee members Prof. Tan Lee, Prof. Qi Dou, and Prof. S. Kevin Zhou. During my Ph.D. study, I was a visiting student at Technical University of Munich (TUM), advised by Prof. Nassir Navab; and at The University of Sydney (USYD), advised by Prof. Luping Zhou. Previously, I received the B. Sc. degree in Opto-Electronics Information Science and Engineering from Beijing Institute of Technology (BIT) in 2021, advised by Prof. Kun Gao. I am fortunate to have been working with Prof. Mobarak I. Hoque (UoM), Prof. Zhongliang Jiang (HKU), and Yanheng Li (CityU HK).
My research interests include artificial intelligence and its applications in medical image computing, human-robot interaction, and surgical data science. I recently work on vision-language understanding and generation.
News
-
[08/2025] One paper EndoChat is accepted by Medical Image Analysis (IF: 11.8).
-
[08/2025] I have officially joined Alibaba DAMO Academy as an Algorithm Expert through the Alibaba Star Program.
-
[08/2025] Great honor to join Biomimetic Intelligence and Robotics (IF: 5.4) as a Young Editorial Board Member.
-
[08/2025] One paper EndoVLA is accepted by CoRL 2025.
-
[08/2025] One paper CoPESD is accepted by ACM MM 2025.
-
[07/2025] I will attend MICS 2025 during July 19-20 in Cixi, China.
-
[07/2025] One paper SurgCSS is accepted by Medical Image Analysis (IF: 11.8).
-
[07/2025] Three papers are accepted by IEEE ICIA 2025.
-
[07/2025] Serve as an Invited Faculty at Surgical Data Science (SDS) Summer School 2025 in Strasbourg, France.
-
[06/2025] One paper PedSemiSeg is accepted by Computerized Medical Imaging and Graphics (IF: 4.9).
-
[06/2025] Four papers SurgSora, Endo-4DGX, SurgTPGS, and SPA are accepted by MICCAI 2025.
-
[06/2025] One paper CapsDT is accepted by IROS 2025 as Oral Presentation.
-
[06/2025] One paper EndoARSS is accepted by Advanced Intelligent Systems (IF: 6.1).
-
[06/2025] Successfully passed my Ph.D. Thesis Defense and became a Dr. now!
-
[05/2025] A co-author paper ETSM won the Best Application Award at the poster presentation of MRC Symposium 2025!
-
[05/2025] I will attend MRC Symposium 2025 during May 29-31 in Hong Kong SAR, China.
-
[05/2025] The 3rd C4SR+ Workshop is accepted by IROS 2025, and welcome to join us in Hangzhou, China!
-
[05/2025] One paper GRAD is accepted by Information Fusion (IF: 15.5).
-
[03/2025] Check our recent technical report of DeepSeek in Robotic Surgery.
-
[02/2025] We will host The 1st Workshop on Efficient Medical AI in MICCAI 2025 and see you in Daejeon, Korea!
-
[01/2025] Three papers Endo-2DTAM, SurgPLAN++, and ETSM are accepted by ICRA 2025.
-
[01/2025] One paper PvNeXt is accepted by ICLR 2025.
-
[01/2025] The preprint of our recent work EndoChat on MLLM for endoscopic surgery is online!
-
[12/2024] One paper V2-SfMLearner is accepted by IEEE Transactions on Automation Science and Engineering (IF: 6.4).
-
[12/2024] Check our recent work SurgSora on controllable surgical video generation!
-
[10/2024] Moved to Munich, Germany, and joined CAMP, Technical University of Munich (TUM)!
-
[10/2024] LighTDiff won the MICCAI 2024 Best Paper Runner Up (3/2771)!
-
[10/2024] Two papers are accepted by IEEE ROBIO 2024.
-
[10/2024] Our team won the runner-up of MICCAI 2024 BraTS Challenge on Sub-Sahara-Africa Adult Glioma.
-
[09/2024] Our paper LighTDiff is selected in MICCAI 2024 Best Paper and Young Scientist Award Shortlist.
-
[07/2024] One paper Surgical-VQLA++ is accepted by Information Fusion (IF: 15.5).
-
[07/2024] I am awarded the CUHK PhD IMPAC Award.
-
[06/2024] One paper ASI-Seg is accepted by IROS 2024 as Oral Presentation.
-
[06/2024] Our Surgical-DINO is selected as Long Oral and receives the IPCAI 2024 Best Paper Award Shortlist!
-
[06/2024] Four papers are accepted by MICCAI 2024 (with 2 early accepted & 1 oral)!
-
[05/2024] I am awarded the CUHK Overseas Research Attachment Scholarship 2024-2025.
-
[05/2024] Our poster on surgical instrument tracking receives the Best Poster Award in ICRA 2024 C4SR+ Workshop!
-
[02/2024] One paper CAT-SD is accepted by IEEE TMI (IF: 9.8).
-
[02/2024] One paper EndoOOD is accepted by IEEE ISBI 2024.
-
[01/2024] One paper OSSAR is accepted by ICRA 2024 with the IEEE RAS Travel Grant Award.
-
[01/2024] One paper on AI-assisted needle insertion simulator is accepted by IEEE Sensors Journal.
-
[12/2023] One paper Surgical-DINO is early accepted by IPCAI 2024.
-
[12/2023] One paper on style transfer for VR is accepted by HCII 2024.
-
[10/2023] One paper on multimodal image fusion is accepted by IEEE Sensors Journal.
-
[09/2023] One paper on pain communication is accepted by Frontiers in Physiology.
-
[08/2023] One paper EndoCSS is accepted by Computers in Biology and Medicine.
-
[07/2023] One paper AdaptPoint is accepted by ICCV 2023.
-
[07/2023] I passed my Ph.D. proposal defense and became a Ph.D. candidate!
-
[07/2023] Our paper on intubation landmark detection receives the ICBIR 2023 Best Student Paper Award!
-
[07/2023] Two papers are accepted by ICBIR 2023.
-
[06/2023] One paper on CT airway extraction is accepted by Artificial Intelligence in Medicine.
-
[06/2023] One paper on pulmonary vessel segmentation is accepted by MBEC Journal.
-
[06/2023] One paper on oropharyngeal organ sim-to-real segmentation is accepted by MBEC Journal.
-
[06/2023] Three papers LLCaps, CAT-ViL and CS-VQLA are accepted by MICCAI 2023 (1 oral & 2 posters).
-
[06/2023] One paper on deep learning-assisted micro force sensing is accepted by IEEE TIM.
-
[06/2023] Our poster on sim-to-real receives the Best Poster Award in ICRA 2023 Workshop on Surgical Robotics!
-
[01/2023] One paper Surgical-VQLA is accepted by ICRA 2023 with the IEEE RAS Travel Grant Award.
-
[01/2023] One poster is accepted by IEEE VR 2023.
-
[10/2022] One paper on reinforcement learning and stomach coverage of WCE is accepted by ROBIO 2022.
Selected Awards
2025, IROS IES SYP Travel Award |
2025, MRC Symposium Best Application Award |
2024, MICCAI Best Paper Runner Up (3/2771) |
2024, Runner Up of MICCAI BraTS Challenge on Sub-Sahara-Africa Adult Glioma |
2024, PhD International Mobility for Partnerships and Collaborations (IMPAC) Award |
2024, IPCAI Best Paper Award Shortlist |
2024, CUHK Overseas Research Attachment Scholarship |
2024, Best Poster Award of ICRA 2024 C4SR+ Workshop |
2024, IEEE ICRA RAS Travel Grants Award |
2023, ICBIR Best Student Paper Award |
2023, Best Poster Award of IEEE ICRA Workshop on Surgical Robotics |
2023, IEEE ICRA RAS Travel Grants Award |
2021, Merit Award of EMedIC Global |
2021, CUHK Vice-Chancellor's Ph.D. Scholarship Scheme |
* indicates equal contribution; † indicates project lead/corresponding author.
Preprints
-
Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras
Beilei Cui*, Long Bai*, Mobarakol Islam*, An Wang, Zhiqi Ma, Yiming Huang, Feng Li, Zhen Chen, Zhongliang Jiang, Nassir Navab, Hongliang Ren†
Preprint, 2025.
Book Chapters
-
Ultrasound Guidance and Robotic Procedures: Actual and Future Intelligence
Long Bai*, Lei Zhao*, Hongliang Ren
Handbook of Robotic Surgery, 2024.
-
3D Reconstruction of Deformable Tissues in Robotic Surgery
Mengya Xu, Tiebing Tang, Ziqi Guo, An Wang, Beilei Cui, Long Bai, Hongliang Ren
Handbook of Robotic Surgery, 2024.
Journal Papers
-
EndoChat: Grounded Multimodal Large Language Model for Endoscopic Surgery
Guankun Wang*, Long Bai*, Junyi Wang*, Kun Yuan*, Zhen Li, Tianxu Jiang, Xiting He, Jinlin Wu, Zhen Chen, Zhen Lei, Hongbin Liu, Jiazheng Wang, Fan Zhang, Nicolas Padoy, Nassir Navab, Hongliang Ren†
Medical Image Analysis (MedIA), 2025. (IF: 11.8)
[Code]
-
Rethinking Data Imbalance in Class Incremental Surgical Instrument Segmentation
Shifang Zhao, Long Bai†, Kun Yuan, Feng Li, Jieming Yu, Wenzhen Dong, Guankun Wang, Mobarakol Islam, Nicolas Padoy, Nassir Navab, Hongliang Ren†
Medical Image Analysis (MedIA), 2025. (IF: 11.8)
[Code]
-
PedSemiSeg: Pedagogy-inspired Semi-supervised Polyp Segmentation
An Wang, Haoyu Ma, Long Bai, Yanan Wu, Mengya Xu, Yang Zhang, Mobarakol Islam, Hongliang Ren†
Computerized Medical Imaging and Graphics, 2025. (IF: 4.9)
-
EndoARSS: Adapting Spatially-Aware Foundation Model for Efficient Activity Recognition and Semantic Segmentation in Endoscopic Surgery
Guankun Wang, Rui Tang, Mengya Xu, Long Bai, Huxin Gao, Hongliang Ren†
Advanced Intelligent Systems, 2025. (IF: 6.1)
-
Multimodal Graph Representation Learning for Robust Surgical Workflow Recognition with Adversarial Feature Disentanglement
Long Bai*, Boyi Ma*, Ruohan Wang, Guankun Wang, Beilei Cui, Zhongliang Jiang, Mobarakol Islam, Zhe Min, Jiewen Lai, Nassir Navab, Hongliang Ren†
Information Fusion, 2025. (IF: 15.5)
-
V2-SfMLearner: Learning Monocular Depth and Ego-motion for Multimodal Wireless Capsule Endoscopy
Long Bai*, Beilei Cui*, Liangyu Wang*, Yanheng Li, Shilong Yao, Sishen Yuan, Yanan Wu, Yang Zhang, Max Q.-H. Meng, Zhen Li, Weiping Ding, Hongliang Ren†
IEEE Transactions on Automation Science and Engineering (TASE), 2025. (IF: 6.4)
-
Surgical-VQLA++: Adversarial Contrastive Learning for Calibrated Robust Visual Question-Localized Answering in Robotic Surgery
Long Bai*, Guankun Wang*, Mobarakol Islam*, Lalithkumar Seenivasan, An Wang, Hongliang Ren†
Information Fusion, 2025. (IF: 15.5)
[Code & Data]
-
Privacy-Preserving Synthetic Continual Semantic Segmentation for Robotic Surgery
Mengya Xu*, Mobarakol Islam*, Long Bai, Hongliang Ren†
IEEE Transactions on Medical Imaging (TMI), 2024. (IF: 9.8)
[Code]
-
Surgical-DINO: Adapter Learning of Foundation Models for Depth Estimation in Endoscopic Surgery
Beilei Cui*, Mobarakol Islam*, Long Bai, Hongliang Ren†
International Journal of Computer Assisted Radiology and Surgery (IJCARS, Presented at IPCAI), 2024. (IF: 2.3)
(IPCAI Best Paper Award Shortlist, Long Oral)
[Code]
-
Data-driven 3D Tactile Cues with Intermediate Soft Interfaces towards Training Needle Insertions
Ruijie Tang*, Shilong Yao*, Long Bai, Hong Yan, Max Q.-H. Meng†, Hongliang Ren†
IEEE Sensors Journal, 2024. (IF: 4.5)
-
Rethinking Exemplars for Continual Semantic Segmentation in Endoscopy Scenes: Entropy-based Mini-Batch Pseudo-Replay
Guankun Wang*, Long Bai*, Yanan Wu, Tong Chen, Hongliang Ren†
Computers in Biology and Medicine (CBM), 2023. (IF: 7.0)
-
Joint Sparse Representations and Coupled Dictionary Learning in Multi-Source Heterogeneous Image Pseudo-color Fusion
Long Bai, Shilong Yao, Kun Gao†, Yanjun Huang, Ruijie Tang, Hong Yan, Max Q.-H. Meng, Hongliang Ren†
IEEE Sensors Journal, 2023. (IF: 4.3)
-
Two-stage Contextual Transformer-based Convolutional Neural Network for Airway Extraction from CT Images
Yanan Wu, Shuiqing Zhao, Shouliang Qi†, Jie Feng, Haowen Pang, Runsheng Chang, Long Bai, Mengqi Li, Shuyue Xia, Wei Qian, Hongliang Ren†
Artificial Intelligence in Medicine (AIM), 2023. (IF: 6.1)
[Code]
-
An RNN-LSTM Enhanced Compact and Affordable Micro Force Sensing System for Interventional Continuum Robots with Interchangeable End-Effector Instruments
Shilong Yao*, Ruijie Tang*, Long Bai, Hong Yan, Hongliang Ren†, Li Liu†
IEEE Transactions on Instrumentation and Measurement, 2023. (IF: 5.6)
-
Domain Adaptive Sim-to-Real Segmentation of Oropharyngeal Organs
Guankun Wang, Tian-Ao Ren, Jiewen Lai, Long Bai, Hongliang Ren†
Medical & Biological Engineering & Computing (MBEC), 2023.
[Dataset]
-
Transformer-based 3D U-Net for Pulmonary Vessel Segmentation and Artery-vein Separation from CT Images
Yanan Wu, Shouliang Qi†, Meihuan Wang, Shuiqing Zhao, Haowen Pang, Jiaxuan Xu, Long Bai, Hongliang Ren†
Medical & Biological Engineering & Computing (MBEC), 2023.
[Code]
-
Rethinking Pain Communication of Patients with Alzheimer's Disease through E-textile Interaction Design
Yanheng Li*, Long Bai*, Yaxuan Mao, Hongliang Ren, Yu Qiao, Xin Tong†, Ray LC†
Frontiers in Physiology, 2023.
Conference Papers
-
EndoVLA: Dual-Phase Vision-Language-Action Model for Autonomous Tracking in Endoscopy
Chi Kit Ng*, Long Bai*, Guankun Wang*, Yupeng Wang, Huxin Gao, Kun Yuan, Chenhan Jin, Tieyong Zeng, Hongliang Ren†
9th Annual Conference on Robot Learning (CoRL), 2025.
-
CoPESD: A Multi-Level Surgical Motion Dataset for Training Large Vision-Language Models to Co-Pilot Endoscopic Submucosal Dissection
Guankun Wang, Han Xiao, Renrui Zhang, Huxin Gao, Long Bai, Xiaoxiao Yang, Zhen Li, Hongsheng Li†, Hongliang Ren†
ACM International Conference on Multimedia (MM), 2025.
[Page]
-
SurgSora: Object-Aware Diffusion Model for Controllable Surgical Video Generation
Tong Chen, Shuya Yang, Junyi Wang, Long Bai†, Hongliang Ren, Luping Zhou†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2025.
[Page]
-
Endo-4DGX: Robust Endoscopic Scene Reconstruction and Illumination Correction with Gaussian Splatting
Yiming Huang*, Long Bai*, Beilei Cui*, Yanheng Li, Tong Chen, Jie Wang, Jinlin Wu, Zhen Lei, Hongbin Liu, Hongliang Ren†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2025.
[Page]
-
SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian Splatting
Yiming Huang*, Long Bai*, Beilei Cui*, Kun Yuan, Guankun Wang, Mobarak I. Hoque, Nicolas Padoy, Nassir Navab, Hongliang Ren†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2025.
[Page]
-
Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement
Kun Yuan, Tingxuan Chen, Shi Li, Joel Lavanchy, Christian Heiliger, Ege Özsoy, Yiming Huang, Long Bai, Nassir Navab, Vinkle Srivastav, Hongliang Ren, Nicolas Padoy
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2025.
[Code]
-
Geo-RepNet: Geometry-Aware Representation Learning for Surgical Phase Recognition in Endoscopic Submucosal Dissection
Rui Tang, Haochen Yin, Guankun Wang, Long Bai, An Wang, Huxin Gao, Jiazheng Wang, Hongliang Ren†
IEEE International Conference on Information and Automation (ICIA), 2025.
-
NeuroABench: A Multimodal Evaluation Benchmark for Neurosurgical Anatomy Identification
Ziyang Song, Xiaofan Ye, Zelin Zang, Boqiang Xu, Long Bai, Jinlin Wu†, Hongliang Ren, Jiebo Luo, Hongbin Liu, Zhen Lei
IEEE International Conference on Information and Automation (ICIA), 2025.
-
A Comparative Study of Generative and Diffusion Models for Specular Reflection Removal in Endoscopic Videos
Yunqi Cai, An Wang, Rulin Zhou, Long Bai, Jiewen Lai, Hongliang Ren†
IEEE International Conference on Information and Automation (ICIA), 2025.
-
CapsDT: Diffusion-Transformer for Capsule Robot Manipulation
Xiting He, Mingwu Su, Xinqi Jiang, Long Bai, Hongliang Ren†
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2025. (Oral)
-
Advancing Dense Endoscopic Reconstruction with Gaussian Splatting-driven Surface Normal-aware Tracking and Mapping
Yiming Huang*, Beilei Cui*, Long Bai*, Zhen Chen, Jinlin Wu, Zhen Li, Hongbin Liu, Hongliang Ren†
IEEE International Conference on Robotics and Automation (ICRA), 2025.
[Code]
-
ETSM: Automating Dissection Trajectory Suggestion and Confidence Map-Based Safety Margin Prediction for Robot-assisted Endoscopic Submucosal Dissection
Mengya Xu, Wenjin Mo, Guankun Wang, Huxin Gao, An Wang, Long Bai, Chaoyang Lyu, Xiaoxiao Yang, Zhen Li, Hongliang Ren†
IEEE International Conference on Robotics and Automation (ICRA), 2025.
(MRC Symposium 2025 Best Application Award)
-
SurgPLAN++: Universal Surgical Phase Localization Network for Online and Offline Inference
Zhen Chen, Xingjian Luo, Jinlin Wu†, Long Bai, Zhen Lei, Hongliang Ren, Sebastien Ourselin, Hongbin Liu†
IEEE International Conference on Robotics and Automation (ICRA), 2025.
[Code]
-
TAU-106K: A New Dataset for Comprehensive Understanding of Traffic Accident
Yixuan Zhou*, Long Bai*, Sijia Cai†, Bing Deng, Xing Xu, Heng Tao Shen
International Conference on Learning Representations (ICLR), 2025.
[Code & Data]
-
PvNeXt: Rethinking Network Design and Temporal Motion for Point Cloud Video Recognition
Jie Wang, Tingfa Xu, Lihe Ding, Xinjie Zhang, Long Bai, Jianan Li†
International Conference on Learning Representations (ICLR), 2025.
-
EndoUIC: Promptable Diffusion Transformer for Unified Illumination Correction in Capsule Endoscopy
Long Bai*, Tong Chen*, Qiaozhi Tan*, Wan Jun Nah, Yanheng Li, Zhicheng He, Sishen Yuan, Jinlin Wu, Zhen Chen, Mobarakol Islam, Zhen Li, Hongbin Liu, Hongliang Ren†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2024.
[Code & Data]
-
Endo-4DGS: Endoscopic Monocular Scene Reconstruction with 4D Gaussian Splatting
Yiming Huang*, Beilei Cui*, Long Bai*, Ziqi Guo, Mengya Xu, Hongliang Ren†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2024.
[Code]
-
LighTDiff: Surgical Endoscopic Image Low-Light Enhancement with T-Diffusion
Tong Chen*, Qingcheng Lyu*, Long Bai*, Erjian Guo, Huxin Gao, Xiaoxiao Yang, Hongliang Ren, Luping Zhou†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2024. (Early Accepted)
(MICCAI Best Paper Runner Up, Top 3 out of 2771 Submissions)
[Code]
-
EndoDAC: Efficient Adapting Foundation Model for Self-Supervised Depth Estimation from Any Endoscopic Camera
Beilei Cui*, Mobarakol Islam*, Long Bai*, An Wang, Hongliang Ren†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2024. (Early Accepted)
[Code]
-
ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding
Zhen Chen, Zongmin Zhang, Wenwu Guo, Xingjian Luo, Long Bai, Jinlin Wu†, Hongliang Ren, Hongbin Liu
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2024. (Oral)
[Code]
-
OSSAR: Towards Open-Set Surgical Activity Recognition in Robot-assisted Surgery
Long Bai*, Guankun Wang*, Jie Wang, Xiaoxiao Yang, Huxin Gao, Xin Liang, An Wang, Mobarakol Islam, Hongliang Ren†
IEEE International Conference on Robotics and Automation (ICRA), 2024.
(IEEE RAS Travel Grant Award)
[Code]
-
Registering Neural 4D Gaussians for Endoscopic Surgery
Yiming Huang, Beilei Cui, Ikemura Kei, Jiekai Zhang, Long Bai, Hongliang Ren†
IEEE International Conference on Robotics and Biomimetics (ROBIO), 2024.
-
Markerless Platform-independent Web-Based Augmented Reality with Auto-Scaling and Real-Time Head Tracking towards Neurointerventional Preoperative Planning and Training of Head-mounted Robotic Needle Insertion
Hon Lung Ho, Yupeng Wang, An Wang, Long Bai, Hongliang Ren†
IEEE International Conference on Robotics and Biomimetics (ROBIO), 2024.
-
EndoOOD: Uncertainty-aware Out-of-distribution Detection in Capsule Endoscopy Diagnosis
Qiaozhi Tan*, Long Bai*, Guankun Wang*, Mobarakol Islam, Hongliang Ren†
IEEE International Symposium on Biomedical Imaging (ISBI), 2024.
-
Learning to Adapt Foundation Model DINOv2 for Capsule Endoscopy Diagnosis
Bowen Zhang*, Ying Chen*, Long Bai, Yan Zhao, Yuxiang Sun, Yixuan Yuan, Jianhua Zhang, Hongliang Ren†
International Conference on Biomimetic Intelligence and Robotics & Medical Robotics Forum (ICBIR), 2024.
-
Affecting Audience Valence and Arousal in 360 Immersive Environments: How Powerful Neural Style Transfer Is?
Yanheng Li, Long Bai, Yaxuan Mao, Xuening Peng, Zehao Zhang, Antoni B. Chan, Jixing Li, Xin Tong†, Ray LC†
International Conference on Human-Computer Interaction (HCII), 2024.
-
Sample-adaptive Augmentation for Point Cloud Recognition Against Real-world Corruptions
Jie Wang, Lihe Ding, Tingfa Xu, Shaocong Dong, Xinli Xu, Long Bai, Jianan Li†
International Conference on Computer Vision (ICCV), 2023.
[Code]
-
LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion
Long Bai*, Tong Chen*, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2023. (Oral, Top 3%)
[Code & Data]
-
CAT-ViL: Co-Attention Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery
Long Bai*, Mobarakol Islam*, Hongliang Ren†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2023.
[Code]
-
Revisiting Distillation for Continual Learning on Visual Question Localized-Answering in Robotic Surgery
Long Bai*, Mobarakol Islam*, Hongliang Ren†
Medical Image Computing and Computer Assisted Intervention (MICCAI), 2023.
[Code]
-
Landmark Detection using Transformer Toward Robot-assisted Nasal Airway Intubation
Tianhang Liu, Hechen Li, Long Bai†, Yanan Wu, An Wang, Mobarakol Islam, Hongliang Ren†
International Conference on Biomimetic Intelligence and Robotics & Medical Robotics Forum (ICBIR), 2023.
(Best Student Paper Award)
[Code]
-
Semi-supervised Learning for Segmentation of Bleeding Regions in Video Capsule Endoscopy
Hechen Li, Yanan Wu, Long Bai†, An Wang, Tong Chen, Hongliang Ren†
International Conference on Biomimetic Intelligence and Robotics & Medical Robotics Forum (ICBIR), 2023.
-
Surgical-VQLA: Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery
Long Bai*, Mobarakol Islam*, Lalithkumar Seenivasan, Hongliang Ren†
IEEE International Conference on Robotics and Automation (ICRA), 2023.
(IEEE RAS Travel Grant Award)
[Code & Data]
-
Deep Reinforcement Learning-Based Control for Stomach Coverage Scanning of Wireless Capsule Endoscopy
Yameng Zhang*, Long Bai*, Li Liu†, Hongliang Ren†, Max Q–H Meng
IEEE International Conference on Robotics and Biomimetics (ROBIO), 2022.
-
The Influence of Age and Gender Information on the Diagnosis of Diabetic Retinopathy: Based on Neural Networks
Long Bai*, Sihang Chen*, Mingyang Gao*, Leila Abdelrahman, Manal Al Ghamdi, Mohamed Abdel-Mottaleb
Annual International Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), 2021.
Workshop Papers
-
Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery
Boyi Ma, Yanguang Zhao, Jie Wang, Guankun Wang, Kun Yuan, Tong Chen, Long Bai†, Hongliang Ren†
MICCAI CREATE Workshop, 2025.
-
SAM 2 in Robotic Surgery: An Empirical Evaluation for Robustness and Generalization in Surgical Video Segmentation
Jieming Yu, An Wang, Wenzhen Dong, Mengya Xu, Mobarakol Islam, Jie Wang, Long Bai†, Hongliang Ren†
MICCAI Efficient Medical AI (EMA) Workshop, 2025
-
Surgical-LVLM: Learning to Adapt Large Vision-Language Model for Grounded Visual Question Answering in Robotic Surgery
Guankun Wang*, Long Bai*, Wan Jun Nah, Jie Wang, Zhaoxi Zhang, Zhen Chen, Jinlin Wu, Mobarakol Islam, Hongbin Liu, Hongliang Ren†
ICLR FM-Wild Workshop, 2025.
-
Transferring Knowledge from High-Quality to Low-Quality MRI for Adult Glioma Diagnosis
Yanguang Zhao, Long Bai†, Zhaoxi Zhang, Yanan Wu, Mobarakol Islam, Hongliang Ren†
MICCAI BraTS-SSA Challenge, 2024.
(Runner Up of Top-performing Team)
-
A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery
Mengya Xu, Ziqi Guo, An Wang, Long Bai, Hongliang Ren†
MICCAI EARTH Workshop, 2024.
[Code]
-
Adapting SAM for Surgical Instrument Tracking and Segmentation in Endoscopic Submucosal Dissection Videos
Jieming Yu, Long Bai†, Guankun Wang, An Wang, Xiaoxiao Yang, Huxin Gao, Hongliang Ren†
IEEE ICRA C4SR+ Workshop, 2024.
(Best Poster Award)
-
Domain Adaptive Sim-to-Real Segmentation of Oropharyngeal Organs Towards Robot-assisted Intubation
Guankun Wang, Tian-Ao Ren, Jiewen Lai, Long Bai†, Hongliang Ren†
IEEE ICRA Workshop on New Evolutions in Surgical Robotics, 2023.
(Best Poster Award)
[Dataset]
-
The Exploration and Evaluation of Using Neural Style Transfer in Generating Affective 360° Panoramic VR Environments
Yanheng Li, Long Bai, Yaxuan Mao, Xuening Peng, Zehao Zhang, Xin Tong†, Ray LC†
IEEE Conference on Virtual Reality and 3D User Interfaces Abstracts and Workshops, 2023.
Professional Services
Editorial & Conference Services:
Young Editorial Board Member, Biomimetic Intelligence and Robotics (IF: 5.4)
Session Chair, IEEE International Conference on Information and Automation (ICIA) 2025
Session Chair, International Conference on Biomimetic Intelligence and Robotics (ICBIR) 2025
Associate Editor, IEEE International Conference on Robotics and Biomimetics (ROBIO) 2024
Workshop Organizer:
IROS 2025 C4SR+ Workshop
MICCAI 2025 EMA Workshop
MICCAI 2025 CREATE Workshop
MICCAI 2024 EARTH Workshop
Regular Reviewer:
NeurIPS, ICLR, AAAI, MICCAI, CoRL, ICRA, IROS, IPCAI, ISBI, CHI, VR, CSCW, IUI, ROBIO
T-PAMI, T-MI, T-NNLS, T-CSVT, T-ASE, T-IM, RA-L, OJ-IM, IEEE Sensors Journal, MedIA, Information Fusion, Information Sciences, CMIG, AIIM, CBM, MBEC, JSCE, JMRR
Teaching
Supervision & Mentees:
Junyi Wang | Intern 2024-2025 | |
Zhicheng He | Intern 2024-2025 | -> Research Master, NUS |
Xinyu Ma | Intern 2024-2025 | -> Ph.D. Student, MPU |
Yanguang Zhao | Intern 2024-2025 | -> M.Sc. Student, NUS |
Zhaoxi Zhang | Intern 2024-2025 | -> M.Phil. Student, PKU |
Wenzhen Dong | Intern 2023-2025 |
Jieming Yu | Intern 2023-2025 | -> Ph.D. Student, HKUST |
Boyi Ma | Intern 2023-2024 | -> Ph.D. Student, UofT |
Ruohan Wang | Intern 2023-2024 | -> Ph.D. Student, Brown |
Qiaozhi Tan | Intern 2023-2024 | -> Ph.D. Student, CityU HK |
Yihan Ma | Intern 2023-2024 | -> M.Sc. Student, UCL |
Tianhang Liu | CUHK M.Sc. 2022-2023 | -> R&D, ASTRI HK |
Guankun Wang | Intern 2022-2023 | -> Ph.D. Student, CUHK |
Yuanhao Zhao | CUHK M.Sc. 2021-2022 | -> R&D, Samsung China |
Liangyu Wang | CUHK M.Sc. 2021-2022 | -> Ph.D. Student, KAUST |
Tong Chen | Intern 2021-2022 | -> Ph.D. Student, USYD |
Guest Faculty:
Invited Talks:
Intelligent Perception Enhancement & Multimodal Interaction in CAI, BIROB Editorial Forum, Aug 2025. |
Surgical Visual Question Localized-Answering, Pre-ICRA Online, May 2023. |
Lifelong Learning: Background, Methodology, and Applications, HKUST UG, Oct 2022. |
Teaching Assistant:
2024-2025 | Spring | ELEG5757 | Intelligent Wearable Electronics |
2024-2025 | Spring | ELEG3103 | Robotic Perception and Intelligence |
2023-2024 | Spring | ELEG5600 | Advanced Perception for Intelligent Robotics |
2023-2024 | Fall | ENGG2760 | Probability for Engineers |
2022-2023 | Spring | ELEG5757 | Intelligent Wearable Electronics |
2022-2023 | Fall | ENGG2760 | Probability for Engineers |
2021-2022 | Spring | ELEG3201 | Microelectronic Devices and Circuits |
2021-2022 | Fall | ENGG2760 | Probability for Engineers |
Some Links
Lab of Robotics, Embodied AI, and Navigation in Vivo, CUHK.
Chair for Computer Aided Medical Procedures & Augmented Reality (CAMP), TUM.
CAMP Personal Page, TUM.
© Long Bai | Last updated: June 2025