Bingjie YAN
Logo Hong Kong Polytechnic University
Logo School of Optometry, Hong Kong Polytechnic University

I am a first-year Ph.D. student at School of Optometry, Hong Kong Polytechnic University, supervised by Prof. Danli SHI and co-supervised by Prof. Mingguang HE. Before that, I received my M.Eng. degree in Electronic Information from Institute of Computing Technology, Chinese Academy of Sciences, under the supervision of Prof. Yiqiang CHEN and Prof. Xinlong JIANG, and my B.Eng. degree in Software Engineering from Hainan University.

Research Interests:
Medical Agent & Agentic AI: AI agents for healthcare, autonomous medical systems, and intelligent clinical decision support.

Please feel free to contact me via email if you are interested in my research!


Education
  • Hong Kong Polytechnic University
    Hong Kong Polytechnic University
    Ph.D. Student in School of Optometry
    Sep. 2025 - present
  • Institute of Computing Technology, Chinese Academy of Sciences
    Institute of Computing Technology, Chinese Academy of Sciences
    M.Eng. in Electronic Information
    Sep. 2022 - Jun. 2025
  • Hainan University
    Hainan University
    B.Eng. in Software Engineering
    Sep. 2018 - Jun. 2022
Honors & Awards
  • Institute of Computing Technology Ziqiang International Exchange Scholarship
    2025
  • National Scholarship (3%)
    2024
  • First-Class Scholarship of ICT, CAS
    2024
  • Outstanding Graduate of Hainan University
    2022
  • Many Innovation and Entrepreneurship Awards and Competitions
    2019-2022
  • First-Class Scholarship of Hainan University
    2019
Services
  • AAAI Conference on Artificial Intelligence
    Program Committee Member & Reviewer
    2025 - present
  • IEEE Hainan University Branch
    President
    Mar. 2021 - Jun. 2022
  • Association of Robotics and Artificial Intelligence
    Co-Founder & Vice President
    Jul. 2020 - Jun. 2022
  • Cyberspace Security Association @Hainan University
    Vice President
    Sep. 2020 - Jun. 2021
Experience
  • Institute of AI Industry Research, Tsinghua University
    Research Intern
    Jul. 2024 - Oct. 2024
  • Hong Kong Baptist University
    Research Assistant
    Jun. 2024 - Jul. 2024
  • FedML Inc.
    Remote Research Intern
    Jun. 2022 - Sep. 2022
News (view all )
2025
My Google Scholar citations surpassed 500! 🎉 See my Google Scholar profile.
Sep 07
I started my Ph.D. journey at School of Optometry, Hong Kong Polytechnic University! Excited to begin my Ph.D. studies under the supervision of Prof. Danli SHI and co-supervision of Prof. Mingguang HE!
Aug 31
I successfully graduated from Institute of Computing Technology, Chinese Academy of Sciences with M.Eng. degree! Thanks to my supervisors Prof. Yiqiang CHEN and Prof. Xinlong JIANG for their guidance during my master's study!
Jun 29
Our survey paper "Towards Harnessing the Collaborative Power of Large and Small Models for Domain Tasks" is available on arXiv!
Apr 23
2024
Our paper "Doubly-Bounded Queue for Constrained Online Learning: Keeping Pace with Dynamics of Both Loss and Constraint" is accepted by AAAI 2025 (CCF-A)!
Dec 09
Selected Publications (view all )
Buffalo: Biomedical Vision-Language Understanding with Cross-Modal Prototype and Federated Foundation Model Collaboration
Buffalo: Biomedical Vision-Language Understanding with Cross-Modal Prototype and Federated Foundation Model Collaboration

Bingjie YAN, Qian CHEN, Yiqiang CHEN, Xinlong JIANG, Wuliang HUANG, Bingyu WANG, Zhirui WANG, Chenlong GAO, Teng ZHANG ( corresponding author )

ACM CIKM'24, CCF-B, CORE-A (Acceptance Rate: 22.7%) (2024) Oral

Federated learning (FL) enables collaborative learning across multiple biomedical data silos with multimodal foundation models while preserving privacy. Due to the heterogeneity in data processing and collection methodologies across diverse medical institutions and the varying medical inspections patients undergo, modal heterogeneity exists in practical scenarios, where severe modal heterogeneity may even prevent model training. With privacy considerations, data transfer cannot be permitted, restricting knowledge exchange among different clients. To trickle these issues, we propose a cross-modal prototype imputation method for visual-language understanding (Buffalo) with only a slight increase in communication cost, which can improve the performance of fine-tuning general foundation models for downstream biomedical tasks. We conducted extensive experiments on medical report generation and biomedical visual question-answering tasks. The results demonstrate that Buffalo can fully utilize data from all clients to improve model generalization compared to other modal imputation methods in three modal heterogeneity scenarios, approaching or even surpassing the performance in the ideal scenario without missing modality.

Buffalo: Biomedical Vision-Language Understanding with Cross-Modal Prototype and Federated Foundation Model Collaboration

Bingjie YAN, Qian CHEN, Yiqiang CHEN, Xinlong JIANG, Wuliang HUANG, Bingyu WANG, Zhirui WANG, Chenlong GAO, Teng ZHANG ( corresponding author )

ACM CIKM'24, CCF-B, CORE-A (Acceptance Rate: 22.7%) (2024) Oral

Federated learning (FL) enables collaborative learning across multiple biomedical data silos with multimodal foundation models while preserving privacy. Due to the heterogeneity in data processing and collection methodologies across diverse medical institutions and the varying medical inspections patients undergo, modal heterogeneity exists in practical scenarios, where severe modal heterogeneity may even prevent model training. With privacy considerations, data transfer cannot be permitted, restricting knowledge exchange among different clients. To trickle these issues, we propose a cross-modal prototype imputation method for visual-language understanding (Buffalo) with only a slight increase in communication cost, which can improve the performance of fine-tuning general foundation models for downstream biomedical tasks. We conducted extensive experiments on medical report generation and biomedical visual question-answering tasks. The results demonstrate that Buffalo can fully utilize data from all clients to improve model generalization compared to other modal imputation methods in three modal heterogeneity scenarios, approaching or even surpassing the performance in the ideal scenario without missing modality.

Model Trip: Enhancing Privacy and Fairness in Model Fusion across Multi-Federations for Trustworthy Global Healthcare
Model Trip: Enhancing Privacy and Fairness in Model Fusion across Multi-Federations for Trustworthy Global Healthcare

Qian CHEN, Yiqiang CHEN, Bingjie YAN, Xinlong JIANG, Xiaojin ZHANG, Yan KANG, Teng ZHANG, Wuliang HUANG, Chenlong GAO, Lixin FAN, Qiang YANG ( corresponding author )

ICDE'24, CCF-A (2024) Oral

Federated Learning has emerged as a revolutionary innovation in the evolving landscape of global healthcare, fostering collaboration among institutions and facilitating collaborative data analysis. As practical applications continue to proliferate, numerous federations have formed in different regions. The optimization and sustainable development of federation-pretrained models have emerged as new challenges. These challenges primarily encompass privacy, population shift and data dependency, which may lead to severe consequences such as the leakage of sensitive information within models and training samples, unfair model performance and resource burdens. To tackle these issues, we propose FairFusion, a cross-federation model fusion approach that enhances privacy and fairness. FairFusion operates across federations within a Model Trip paradigm, integrating knowledge from diverse federations to continually enhance model performance. Through federated model fusion, multi-objective quantification and optimization, FairFusion obtains trustworthy solutions that excel in utility, privacy and fairness. We conduct comprehensive experiments on three public real-world healthcare datasets. The results demonstrate that FairFusion achieves outstanding model fusion performance in terms of utility and fairness across various model structures and subgroups with sensitive attributes while guaranteeing model privacy.

Model Trip: Enhancing Privacy and Fairness in Model Fusion across Multi-Federations for Trustworthy Global Healthcare

Qian CHEN, Yiqiang CHEN, Bingjie YAN, Xinlong JIANG, Xiaojin ZHANG, Yan KANG, Teng ZHANG, Wuliang HUANG, Chenlong GAO, Lixin FAN, Qiang YANG ( corresponding author )

ICDE'24, CCF-A (2024) Oral

Federated Learning has emerged as a revolutionary innovation in the evolving landscape of global healthcare, fostering collaboration among institutions and facilitating collaborative data analysis. As practical applications continue to proliferate, numerous federations have formed in different regions. The optimization and sustainable development of federation-pretrained models have emerged as new challenges. These challenges primarily encompass privacy, population shift and data dependency, which may lead to severe consequences such as the leakage of sensitive information within models and training samples, unfair model performance and resource burdens. To tackle these issues, we propose FairFusion, a cross-federation model fusion approach that enhances privacy and fairness. FairFusion operates across federations within a Model Trip paradigm, integrating knowledge from diverse federations to continually enhance model performance. Through federated model fusion, multi-objective quantification and optimization, FairFusion obtains trustworthy solutions that excel in utility, privacy and fairness. We conduct comprehensive experiments on three public real-world healthcare datasets. The results demonstrate that FairFusion achieves outstanding model fusion performance in terms of utility and fairness across various model structures and subgroups with sensitive attributes while guaranteeing model privacy.

FedEYE: A Scalable and Flexible End-to-end Federated Learning Platform for Ophthalmology
FedEYE: A Scalable and Flexible End-to-end Federated Learning Platform for Ophthalmology

Bingjie YAN*, Danmin CAO*, Xinlong JIANG, Yiqiang CHEN, Weiwei DAI, Fan DONG, Wuliang HUANG, Teng ZHANG, Chenlong GAO, Qian CHEN, Zhen YAN, Zhirui WANG (* equal contribution, corresponding author )

Patterns, Cell Press, JCR-Q1, IF=6.7 (2024)

Federated learning (FL) enables training machine learning models on decentralized medical data while preserving privacy. Despite growing research on FL algorithms and systems, building real-world FL applications requires extensive expertise, posing barriers for medical researchers. FedEYE, an end-to-end FL platform tailored for ophthalmologists without programming skills, is developed here to easily create federated projects on tasks like image classification. The platform provides rich capabilities, scalability, flexible deployment, and separation of concerns. With user-friendly interfaces and comprehension of underlying mechanisms, FedEYE strives to democratize FL for ophthalmology.

FedEYE: A Scalable and Flexible End-to-end Federated Learning Platform for Ophthalmology

Bingjie YAN*, Danmin CAO*, Xinlong JIANG, Yiqiang CHEN, Weiwei DAI, Fan DONG, Wuliang HUANG, Teng ZHANG, Chenlong GAO, Qian CHEN, Zhen YAN, Zhirui WANG (* equal contribution, corresponding author )

Patterns, Cell Press, JCR-Q1, IF=6.7 (2024)

Federated learning (FL) enables training machine learning models on decentralized medical data while preserving privacy. Despite growing research on FL algorithms and systems, building real-world FL applications requires extensive expertise, posing barriers for medical researchers. FedEYE, an end-to-end FL platform tailored for ophthalmologists without programming skills, is developed here to easily create federated projects on tasks like image classification. The platform provides rich capabilities, scalability, flexible deployment, and separation of concerns. With user-friendly interfaces and comprehension of underlying mechanisms, FedEYE strives to democratize FL for ophthalmology.

All publications