Timezone: »
Keeping track of scientific challenges, advances and emerging directions is a fundamental part of research. However, researchers face a flood of papers that hinders discovery of important knowledge. In biomedicine, this directly impacts human lives. To address this problem, we present a novel task of extraction and search of scientific challenges and directions, to facilitate rapid knowledge discovery. We construct and release an expert-annotated corpus of texts sampled from full-length papers, labeled with novel semantic categories that generalize across many types of challenges and directions. We focus on a large corpus of interdisciplinary work relating to the COVID-19 pandemic, ranging from biomedicine to areas such as AI and economics. We apply a model trained on our data to identify challenges and directions across the corpus and build a dedicated search engine. In experiments with 19 researchers and clinicians using our system, we outperform a popular scientific search engine in assisting knowledge discovery. Finally, we show that models trained on our resource generalize to the wider biomedical domain and to AI papers, highlighting its broad utility. We make our data, model and search engine publicly available.
Author Information
Dan Lahav (Tel Aviv University)
Jon Saad-Falcon (Georgia Institute of Technology)
Duen Horng Chau (Georgia Tech)
Diyi Yang (Georgia Tech)
Eric Horvitz (Microsoft Research)
Daniel Weld (University of Washington & AI2)
Tom Hope (Allen Institute for Artificial Intelligence)
More from the Same Authors
-
2021 : A Large-Scale Database for Graph Representation Learning »
Scott Freitas · Yuxiao Dong · Joshua Neil · Duen Horng Chau -
2021 : GAM Changer: Editing Generalized Additive Models with Interactive Visualization »
Zijie Jay Wang · Harsha Nori · Duen Horng Chau · Jennifer Wortman Vaughan · Rich Caruana -
2021 : Scientific Language Models for Biomedical Knowledge Base Completion: An Empirical Study »
Rahul Nadkarni · David Wadden · Iz Beltagy · Noah Smith · Hanna Hajishirzi · Tom Hope -
2021 : Bursting Scientific Filter Bubbles: Boosting Innovation via Novel Author Discovery »
Jason Portenoy · Jevin West · Eric Horvitz · Daniel Weld · Tom Hope -
2022 : A Universal Abstraction for Hierarchical Hopfield Networks »
Benjamin Hoover · Duen Horng Chau · Hendrik Strobelt · Dmitry Krotov -
2022 : A Universal Abstraction for Hierarchical Hopfield Networks »
Benjamin Hoover · Duen Horng Chau · Hendrik Strobelt · Dmitry Krotov -
2022 : A Universal Abstraction for Hierarchical Hopfield Networks »
Benjamin Hoover · Duen Horng Chau · Hendrik Strobelt · Dmitry Krotov -
2022 : Dan Weld: From Advice Taking to Active Learning »
Daniel Weld -
2021 : A Large-Scale Database for Graph Representation Learning »
Scott Freitas · Yuxiao Dong · Joshua Neil · Duen Horng Chau -
2020 : Closing Remarks: Eric Horvitz (Microsoft) »
Eric Horvitz -
2020 Workshop: Cooperative AI »
Thore Graepel · Dario Amodei · Vincent Conitzer · Allan Dafoe · Gillian Hadfield · Eric Horvitz · Sarit Kraus · Kate Larson · Yoram Bachrach -
2019 Poster: Efficient Forward Architecture Search »
Hanzhang Hu · John Langford · Rich Caruana · Saurajit Mukherjee · Eric Horvitz · Debadeepta Dey -
2019 Poster: Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting »
Aditya Grover · Jiaming Song · Ashish Kapoor · Kenneth Tran · Alekh Agarwal · Eric Horvitz · Stefano Ermon -
2019 Poster: Staying up to Date with Online Content Changes Using Reinforcement Learning for Scheduling »
Andrey Kolobov · Yuval Peres · Cheng Lu · Eric Horvitz -
2017 Poster: Estimating Accuracy from Unlabeled Data: A Probabilistic Logic Approach »
Emmanouil Platanios · Hoifung Poon · Tom M Mitchell · Eric Horvitz -
2012 Poster: Patient Risk Stratification for Hospital-Associated C. Diff as a Time-Series Classification Task »
Jenna Wiens · John Guttag · Eric Horvitz -
2012 Spotlight: Patient Risk Stratification for Hospital-Associated C. Diff as a Time-Series Classification Task »
Jenna Wiens · John Guttag · Eric Horvitz -
2009 Poster: Breaking Boundaries Between Induction Time and Diagnosis Time Active Information Acquisition »
Ashish Kapoor · Eric Horvitz