Expo Talk Panel
|
Tue 16:00
|
AgentInstruct: Agentic flows are effective synthetic-data generators
Elaina Robinson · Arindam Mitra · Yash Lara
|
|
Poster
|
Thu 16:30
|
GREATS: Online Selection of High-Quality Data for LLM Training in Every Iteration
Jiachen (Tianhao) Wang · Tong Wu · Dawn Song · Prateek Mittal · Ruoxi Jia
|
|
Poster
|
Wed 16:30
|
CLUES: Collaborative Private-domain High-quality Data Selection for LLMs via Training Dynamics
Wanru Zhao · Hongxiang Fan · Shell Xu Hu · Wangchunshu Zhou · Nicholas Lane
|
|
Poster
|
Fri 16:30
|
Persistent Homology for High-dimensional Data Based on Spectral Methods
Sebastian Damrich · Philipp Berens · Dmitry Kobak
|
|
Workshop
|
|
Instruct-SkillMix: A Powerful Pipeline for LLM Instruction Tuning
Simran Kaur · Simon Park · Anirudh Goyal · Sanjeev Arora
|
|
Poster
|
Fri 11:00
|
Learning from Highly Sparse Spatio-temporal Data
Leyan Deng · Chenwang Wu · Defu Lian · Enhong Chen
|
|
Workshop
|
Sun 15:45
|
Data-Driven High-Dimensional Inverse Problems: A Journey Through Strong Gravitational Lensing Data Analysis
Laurence Perreault-Levasseur
|
|
Poster
|
|
MaskFactory: Towards High-quality Synthetic Data Generation for Dichotomous Image Segmentation
Haotian Qian · Yinda Chen · Shengtao Lou · Fahad Shahbaz Khan · Xiaogang Jin · Deng-Ping Fan
|
|
Workshop
|
|
Self-Data Distillation for Recovering Quality in Pruned Large Language Models
Vithursan Thangarasa · Ganesh Venkatesh · Nish Sinnadurai · Sean Lie
|
|
Workshop
|
Sat 14:30
|
Alan Yuille (Johns Hopkins University): Supervision of 3D-aware Models by Synthetic Data
Alan Yuille
|
|
Poster
|
Wed 11:00
|
SHDocs: A dataset, benchmark, and method to efficiently generate high-quality, real-world specular highlight data with near-perfect alignment
Jovin Leong · Koa Di · Benjamin Cham · Shaun Heng
|
|
Workshop
|
Sat 14:20
|
Invited Talk 3 - Scaling LLMs with Synthetic Data Loops
Suchin Gururangan
|
|