Skip to yearly menu bar Skip to main content


Search All 2024 Events
 

391 Results

<<   <   Page 2 of 33   >   >>
Wed 19:30 Anime & AI
Nathan Yoo · Cory Li
Workshop
Demo: Harnessing Generative AI for Comprehensive Evaluation of Medical Imaging AI
Yisak Kim · Seunghyun Jang · Soyeon Kim · Kyungmin Jeon · Chang Min Park
Poster
Wed 11:00 Semi-Truths: A Large-Scale Dataset of AI-Augmented Images for Evaluating Robustness of AI-Generated Image detectors
Anisha Pal · Julia Kruk · Mansi Phute · Manognya Bhattaram · Diyi Yang · Duen Horng Chau · Judy Hoffman
Workshop
Provocation: Who benefits from “inclusion” in Generative AI?
Samantha Dalal · Siobhan Mackenzie Hall · Nari Johnson
Workshop
SAGE-RT: Synthetic Alignment data Generation for Safety Evaluation and Red Teaming
Anurakt Kumar · Divyanshu Kumar · Jatan Loya · Nitin Aravind Birur · Tanay Baswa · Sahil Agarwal · Prashanth Harshangi
Poster
Thu 16:30 Evaluating the World Model Implicit in a Generative Model
Keyon Vafa · Justin Chen · Ashesh Rambachan · Jon Kleinberg · Sendhil Mullainathan
Poster
Thu 16:30 ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Shenghai Yuan · Jinfa Huang · Yongqi Xu · YaoYang Liu · Shaofeng Zhang · Yujun Shi · Rui-Jie Zhu · Xinhua Cheng · Jiebo Luo · Li Yuan
Poster
Fri 11:00 GREAT Score: Global Robustness Evaluation of Adversarial Perturbation using Generative Models
ZAITANG LI · Pin-Yu Chen · Tsung-Yi Ho
Workshop
Evaluating synergies among generative design models for multi-objective optimization of drug-like proteins
June Shin · Nathan Rollins · Jordan Anderson · Grace Carey · Allison Colthart · Thomas Hopf · Ivan Mascanfroni · Jyothsna Visweswaraiah · Yi Xing · Kevin Otipoby · Nathan Higginson-Scott · Ryan Peckner
Poster
Wed 16:30 MR-Ben: A Meta-Reasoning Benchmark for Evaluating System-2 Thinking in LLMs
Zhongshen Zeng · Yinhong Liu · Yingjia Wan · Jingyao Li · Pengguang Chen · Jianbo Dai · Yuxuan Yao · Rongwu Xu · Zehan Qi · Wanru Zhao · Linling Shen · Jianqiao Lu · Haochen Tan · Yukang Chen · Hao Zhang · Zhan Shi · Bailin Wang · Zhijiang Guo · Jiaya Jia
Affinity Event
GPTCodeval: An Empirical Evaluation Benchmark for Code Generation Using Language Models
Shreya Rajpal · Anbarasi Masilamani · Siva Shanmugam Gopal
Poster
Thu 16:30 DARG: Dynamic Evaluation of Large Language Models via Adaptive Reasoning Graph
Zhehao Zhang · Jiaao Chen · Diyi Yang