Poster
in
Workshop: Generative AI and Biology (GenBio@NeurIPS2023)

Identifying Neglected Hypotheses in Neurodegenerative Disease with Large Language Models

Spencer Hey ⋅ Darren Angle ⋅ Christopher Chatham

Keywords: structured information extraction neglected hypotheses scientific hypothesis formulation Large language models

Project Page [ OpenReview]

Abstract

Neurodegenerative diseases remain a medical challenge, with existing treatments for many such diseases yielding limited benefits. Yet, research into diseases like Alzheimer's often focuses on a narrow set of hypotheses, potentially overlooking promising research avenues. We devised a workflow to curate scientific publications, extract central hypotheses using gpt3.5-turbo, convert these hypotheses into high-dimensional vectors, and cluster them hierarchically. Employing a secondary agglomerative clustering on the "noise" subset, followed by GPT-4 analysis, we identified signals of neglected hypotheses. This methodology unveiled several notable neglected hypotheses including treatment with coenzyme Q10, CPAP treatment to slow cognitive decline, and lithium treatment in Alzheimer's. We believe this methodology offers a novel and scalable approach to identifying overlooked hypotheses and broadening the neurodegenerative disease research landscape.

Chat is not available.