Timezone: »
Medical report generation, which aims to automatically generate a long and coherent report of a given medical image, has been receiving growing research interests. Existing approaches mainly adopt a supervised manner and heavily rely on coupled image-report pairs. However, in the medical domain, building a large-scale image-report paired dataset is both time-consuming and expensive. To relax the dependency on paired data, we propose an unsupervised model Knowledge Graph Auto-Encoder (KGAE) which accepts independent sets of images and reports in training. KGAE consists of a pre-constructed knowledge graph, a knowledge-driven encoder and a knowledge-driven decoder. The knowledge graph works as the shared latent space to bridge the visual and textual domains; The knowledge-driven encoder projects medical images and reports to the corresponding coordinates in this latent space and the knowledge-driven decoder generates a medical report given a coordinate in this space. Since the knowledge-driven encoder and decoder can be trained with independent sets of images and reports, KGAE is unsupervised. The experiments show that the unsupervised KGAE generates desirable medical reports without using any image-report training pairs. Moreover, KGAE can also work in both semi-supervised and supervised settings, and accept paired images and reports in training. By further fine-tuning with image-report pairs, KGAE consistently outperforms the current state-of-the-art models on two datasets.
Author Information
Fenglin Liu (Peking University)
Chenyu You (Yale University)
Chenyu You is a Ph.D. student in the Department of Electrical Engineering, at Yale University, working with Professor James Duncan. He obtained his master degree in Electrical Engineering from Stanford University, specializing in Artificial Intelligence (AI) Prior to that, he received his bachelor degree (with highest honors) in Electrical Engineering and Mathematics from Rensselaer Polytechnic Institute (RPI). He is broadly interested in the area of machine learning theory and algorithms intersecting the fields of computer & medical vision, natural language processing, signal processing, and distributed systems.
Xian Wu (Tencent)
Shen Ge (Tencent Medical AI Lab)
Sheng wang (University of Washington)
Xu Sun (Peking University)
More from the Same Authors
-
2022 Poster: Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations »
Peng Jin · Jinfa Huang · Fenglin Liu · Xian Wu · Shen Ge · Guoli Song · David Clifton · Jie Chen -
2022 : Gradient Knowledge Distillation for Pre-trained Language Models »
Lean Wang · Lei Li · Xu Sun -
2023 Poster: Theoretically Modeling Client Data Divergence for Federated Natural Language Backdoor Defense »
Zhiyuan Zhang · Deli Chen · Hao Zhou · Fandong Meng · Jie Zhou · Xu Sun -
2023 Poster: Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition »
Shuhuai Ren · Aston Zhang · Yi Zhu · Shuai Zhang · Shuai Zheng · Mu Li · Alexander Smola · Xu Sun -
2023 Poster: Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective »
Chenyu You · Weicheng Dai · Yifei Min · Fenglin Liu · David Clifton · S. Kevin Zhou · Lawrence Staib · James Duncan -
2023 Poster: FETV: A Benchmark for Fine-Grained Evaluation of Open-Domain Text-to-Video Generation »
Yuanxin Liu · Lei Li · Shuhuai Ren · Rundong Gao · Shicheng Li · Sishuo Chen · Xu Sun · Lu Hou -
2023 Poster: Benchmarking Large Language Models on CMExam - A comprehensive Chinese Medical Exam Dataset »
Junling Liu · Peilin Zhou · Yining Hua · Dading Chong · Zhongyu Tian · Andrew Liu · Helin Wang · Chenyu You · Zhenhua Guo · Zhu Lei · Michael Li -
2022 Spotlight: Lightning Talks 6B-3 »
Lingfeng Yang · Yao Lai · Zizheng Pan · Zhenyu Wang · Weicong Liang · Chuanyang Zheng · Jian-Wei Zhang · Peng Jin · Jing Liu · Xiuying Wei · Yao Mu · Xiang Li · YUHUI YUAN · Zizheng Pan · Yifan Sun · Yunchen Zhang · Jianfei Cai · Hao Luo · zheyang li · Jinfa Huang · Haoyu He · Yi Yang · Ping Luo · Fenglin Liu · Henghui Ding · Borui Zhao · Xiangguo Zhang · Kai Zhang · Pichao WANG · Bohan Zhuang · Wei Chen · Ruihao Gong · Zhi Yang · Xian Wu · Feng Ding · Jianfei Cai · Xiao Luo · Renjie Song · Weihong Lin · Jian Yang · Wenming Tan · Bohan Zhuang · Shanghang Zhang · Shen Ge · Fan Wang · Qi Zhang · Guoli Song · Jun Xiao · Hao Li · Ding Jia · David Clifton · Ye Ren · Fengwei Yu · Zheng Zhang · Jie Chen · Shiliang Pu · Xianglong Liu · Chao Zhang · Han Hu -
2022 Spotlight: Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations »
Peng Jin · Jinfa Huang · Fenglin Liu · Xian Wu · Shen Ge · Guoli Song · David Clifton · Jie Chen -
2022 : Gradient Knowledge Distillation for Pre-trained Language Models »
Lean Wang · Lei Li · Xu Sun -
2022 Poster: Retrieve, Reason, and Refine: Generating Accurate and Faithful Patient Instructions »
Fenglin Liu · Bang Yang · Chenyu You · Xian Wu · Shen Ge · Zhangdaihong Liu · Xu Sun · Yang Yang · David Clifton -
2022 Poster: Class-Aware Adversarial Transformers for Medical Image Segmentation »
Chenyu You · Ruihan Zhao · Fenglin Liu · Siyuan Dong · Sandeep Chinchali · Ufuk Topcu · Lawrence Staib · James Duncan -
2021 : Continual Learning in Large-Scale Pre-Training »
Xu Sun -
2021 Poster: Topology-Imbalance Learning for Semi-Supervised Node Classification »
Deli Chen · Yankai Lin · Guangxiang Zhao · Xuancheng Ren · Peng Li · Jie Zhou · Xu Sun -
2020 Poster: Prophet Attention: Predicting Attention with Future Attention »
Fenglin Liu · Xuancheng Ren · Xian Wu · Shen Ge · Wei Fan · Yuexian Zou · Xu Sun -
2019 Poster: Understanding and Improving Layer Normalization »
Jingjing Xu · Xu Sun · Zhiyuan Zhang · Guangxiang Zhao · Junyang Lin -
2019 Poster: Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations »
Fenglin Liu · Yuanxin Liu · Xuancheng Ren · Xiaodong He · Xu Sun -
2014 Poster: Structure Regularization for Structured Prediction »
Xu Sun