Timezone: »

FFA-IR: Towards an Explainable and Reliable Medical Report Generation Benchmark 
Mingjie Li · Wenjia Cai · Rui Liu · Yuetian Weng · Xiaoyun Zhao · Cong Wang · Xin Chen · Zhong Liu · Caineng Pan · Mengke Li · yingfeng zheng · Yizhi Liu · Flora Salim · Karin Verspoor · Xiaodan Liang · Xiaojun Chang

The automatic generation of long and coherent medical reports given medical images (e.g. Chest X-ray and Fundus Fluorescein Angiography (FFA)) has great potential to support clinical practice. Researchers have explored advanced methods from computer vision and natural language processing to incorporate medical domain knowledge for the generation of readable medical reports. However, existing medical report generation (MRG) benchmarks lack both explainable annotations and reliable evaluation tools, hindering the current research advances from two aspects: firstly, existing methods can only predict reports without accurate explanation, undermining the trustworthiness of the diagnostic methods; secondly, the comparison among the predicted reports from different MRG methods is unreliable using the evaluation metrics of natural-language generation (NLG). To address these issues, in this paper, we propose an explainable and reliable MRG benchmark based on FFA Images and Reports (FFA-IR). Specifically, FFA-IR is large, with 10,790 reports along with 1,048,584 FFA images from clinical practice; it includes explainable annotations, based on a schema of 46 categories of lesions; and it is bilingual, providing both English and Chinese reports for each case. Besides using the widely used NLG metrics, we propose a set of nine human evaluation criteria to evaluate the generated reports. We envision FFA-IR as a testbed for explainable and reliable medical report generation. We also hope that it can broadly accelerate medical imaging research and facilitate interaction between the fields of medical imaging, computer vision, and natural language processing.

Author Information

Mingjie Li (Monash University)
Wenjia Cai (Zhongshan Ophthalmic Center, Sun Yat-sen University)
Rui Liu (Monash University)
Yuetian Weng (Monash University)
Xiaoyun Zhao (Monash University)
Xin Chen (Sun Yat-sen University)
Zhong Liu (Sun Yat-sen University)
Caineng Pan (University of Massachusetts, Amherst)
Mengke Li (Sun yat-Sen university)
yingfeng zheng (SUN YAT-SEN UNIVERSITY)
Flora Salim (RMIT University)
Karin Verspoor (Royal Melbourne Institute of Technology)
Xiaodan Liang (Sun Yat-sen University)
Xiaojun Chang (Monash University)

More from the Same Authors