NeurIPS 2025 Friday 12/5

Timezone: America/Los_Angeles

Full Schedule Sun 11/30 Mon 12/1 Tue 12/2 Wed 12/3 Thu 12/4 Fri 12/5 Sat 12/6 Sun 12/7

Registration Desk

8:00 AM - 12:00 PM

Invited Talk

From Benchmarks to Problems - A Perspective on Problem Finding in AI

Kyunghyun Cho

8:30 AM - 9:30 AM

During the past 15 years or so, I have worked on a series of seemingly distinct but eventually related problems, including machine learning algorithms, generative modeling with neural networks, machine translation, language modeling, medical imaging, a bit of healthcare, protein modeling and a bit of drug discovery. I chose to work on some of these problems intentionally, while it was pure serendipity that I worked on some others. It was only in hindsight that these seemingly different problems turned out to be closely related to each other from both technical, social and personal perspectives. In this talk, I plan to do my own retrospective on my own choices, be them intentional or not, on these problems and share with you my thoughts what our own discipline, which is sometimes called computer science, data science, machine learning or artificial intelligence, is.

... more

Speaker Bio

Kyunghyun Cho - Glen de Vries Professor of Health Statistics, NYU; Executive Director of Frontier Research, Prescient Design, Genentech Cho's work spans machine learning and natural language processing. He co-developed the Gated Recurrent Unit (GRU) architecture and has contributed to neural machine translation and sequence-to-sequence learning. He is a CIFAR Fellow of Learning in Machines & Brains and received the 2021 Samsung Ho-Am Prize in Engineering. He served as program chair for ICLR 2020, NeurIPS 2022, and ICML 2022.

... more

Session

Streaming Lounge

8:30 AM - 4:00 PM

Oral

Oral 5A Language Model 3

10:00 AM - 11:00 AM

3 Events in this session

EvoLM: In Search of Lost Language Model Training Dynamics

Zhenting Qi · Fan Nie · Alexandre Alahi · James Zou · Himabindu Lakkaraju · Yilun Du · Eric Xing · Sham Kakade · Hanlin Zhang

Large Language Diffusion Models

Shen Nie · Fengqi Zhu · Zebin You · Xiaolu Zhang · Jingyang Ou · Jun Hu · Jun Zhou · Yankai Lin · Ji-Rong Wen · Chongxuan LI

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Yang Yue · Zhiqi Chen · Rui Lu · Andrew Zhao · Zhaokai Wang · Yang Yue · Shiji Song · Gao Huang

Go to Event Page

Oral

Oral 5B Multimodel 3

10:00 AM - 11:00 AM

3 Events in this session

Boosting Knowledge Utilization in Multimodal Large Language Models via Adaptive Logits Fusion and Attention Reallocation

Wenbin An · Jiahao Nie · Feng Tian · Haonan Lin · mingxiang cai · Yaqiang Wu · QianYing Wang · Xiaoqin Zhang · Shijian Lu

HyperET: Efficient Training in Hyperbolic Space for Multi-modal Large Language Models

Zelin Peng · Zhengqin Xu · Qingyang Liu · Xiaokang Yang · Wei Shen

Rethinking Multimodal Learning from the Perspective of Mitigating Classification Ability Disproportion

Qing-Yuan Jiang · Longfei Huang · Yang Yang

Go to Event Page

Poster

Mexico City Poster Session 5

11:00 AM - 2:00 PM

40 Events in this session

AngleRoCL: Angle-Robust Concept Learning for Physically View-Invariant Adversarial Patches

Wenjun Ji · Yuxiang Fu · Luyang Ying · Deng-Ping Fan · Yuyi Wang · Ming-Ming Cheng · Ivor Tsang · Qing Guo

User-Instructed Disparity-aware Defocus Control

Yudong Han · Yan Yang · Hao Yang · Liyuan Pan

HypoBootstrap: A Bootstrapping Framework for Inductive Reasoning

Si Chen · Yifei Li · Richong Zhang

When Less Language is More: Language-Reasoning Disentanglement Makes LLMs Better Multilingual Reasoners

Weixiang Zhao · Jiahe Guo · Yang Deng · Tongtong Wu · Wenxuan Zhang · Yulin Hu · Xingyu Sui · Yanyan Zhao · Wanxiang Che · Bing Qin · Tat-Seng Chua · Ting Liu

Teaching Language Models to Evolve with Users: Dynamic Profile Modeling for Personalized Alignment

Weixiang Zhao · Xingyu Sui · Yulin Hu · Jiahe Guo · Haixiao Liu · Biye Li · Yanyan Zhao · Bing Qin · Ting Liu

SHAP values via sparse Fourier representation

Ali Gorji · Andisheh Amrollahi · Andreas Krause

Reward-Aware Proto-Representations in Reinforcement Learning

Hon Tik Tse · Siddarth Chandrasekar · Marlos C. Machado

SSIMBaD: Sigma Scaling with SSIM-Guided Balanced Diffusion for AnimeFace Colorization

Junpyo Seo · HanbinKoo · jieun yook · Byung-Ro Moon

Connectome-Based Modelling Reveals Orientation Maps in the Drosophila Optic Lobe

Jia Nuo Liew · Shenghan Lin · Bowen Chen · Wei Zhang · Xiaowei Zhu · Wei Zhang · Xiaolin Hu

Aligning Evaluation with Clinical Priorities: Calibration, Label Shift, and Error Costs

Gerardo Flores · Alyssa H. Smith · Julia Fukuyama · Ashia Wilson

3D Gaussian Flats: Hybrid 2D/3D Photometric Scene Reconstruction

Maria Taktasheva · Lily Goli · Alessandro Fiorini · Zhen Li · Daniel Rebain · Andrea Tagliasacchi

A Closer Look at Graph Transformers: Cross-Aggregation and Beyond

Jiaming Zhuo · Ziyi Ma · Yintong Lu · Yuwei Liu · Kun Fu · Di Jin · Chuan Wang · Wu Wenning · Zhen Wang · Xiaochun Cao · Liang Yang

AdaLRS: Loss-Guided Adaptive Learning Rate Search for Efficient Foundation Model Pretraining

Hongyuan Dong · Dingkang Yang · Xiao Liang · ChaoFeng · Ran Jiao

AI Progress Should Be Measured by Capability-Per-Resource, Not Scale Alone: A Framework for Gradient-Guided Resource Allocation in LLMs

David McCoy · Yulun Wu · Zachary Butzin-Dozier

Bridging Time and Linguistics: LLMs as Time Series Analyzer through Symbolization and Segmentation

Jianyang Qin · Chaoyang Li · Jinhao Cui · Lingzhi Wang · Zhao Liu · Qing Liao

COALA: Numerically Stable and Efficient Framework for Context-Aware Low-Rank Approximation

Uliana Parkina · Maxim Rakhuba

Compressed and Smooth Latent Space for Text Diffusion Modeling

Viacheslav Meshchaninov · Egor Chimbulatov · Alexander Shabalin · Aleksandr Abramov · Dmitry Vetrov

CymbaDiff: Structured Spatial Diffusion for Sketch-based 3D Semantic Urban Scene Generation

Li Liang · Bo Miao · Xinyu Wang · NAVEED AKHTAR · Jordan Vice · Ajmal Mian

DrivingRecon: Large 4D Gaussian Reconstruction Model For Autonomous Driving

Hao LU · Tianshuo Xu · Wenzhao Zheng · Yunpeng Zhang · Wei Zhan · Dalong Du · Masayoshi TOMIZUKA · Kurt Keutzer · Yingcong Chen

GLNCD: Graph-Level Novel Category Discovery

Bowen Deng · Lele Fu · Sheng Huang · Tianchi Liao · Jialong Chen · Zhang Tao · Chuan Chen

Light-Weight Diffusion Multiplier and Uncertainty Quantification for Fourier Neural Operators

Albert Matveev · Sanmitra Ghosh · Aamal Hussain · James-Michael Leahy · Michalis Michaelides

Memory-Augmented Potential Field Theory: A Framework for Adaptive Control in Non-Convex Domains

Dongzhe Zheng · Wenjie Mei

Mesh Interpolation Graph Network for Dynamic and Spatially Irregular Global Weather Forecasting

Zinan Zheng · Yang Liu · Jia Li

More Than Generation: Unifying Generation and Depth Estimation via Text-to-Image Diffusion Models

Hongkai Lin · Dingkang Liang · Mingyang Du · Xin Zhou · Xiang Bai

Mysteries of the Deep: Role of Intermediate Representations in Out of Distribution Detection

Ignacio Meza De la Jara · Cristian Rodriguez-Opazo · Damien Teney · Damith Ranasinghe · Ehsan Abbasnejad

Navigating the MIL Trade-Off: Flexible Pooling for Whole Slide Image Classification

Hossein Jafarinia · Danial Hamdi · Amirhossein Alamdar · Elahe Zahiri · Soroush Vafaie Tabar · Alireza Alipanah · Nahal Mirzaie · Saeed Razavi · Amir Najafi · Mohammad Hossein Rohban

Nearly-Linear Time Private Hypothesis Selection with the Optimal Approximation Factor

Maryam Aliakbarpour · Zhan Shi · Ria Stevens · Vincent Wang

Rectifying Soft-Label Entangled Bias in Long-Tailed Dataset Distillation

Chenyang Jiang · Hang Zhao · Xinyu Zhang · Zhengcen Li · Qiben Shan · Shaocong Wu · Jingyong Su

Reduction-based Pseudo-label Generation for Instance-dependent Partial Label Learning

Congyu Qiao · Ning Xu · Yihao Hu · Xin Geng

Resource-Constrained Federated Continual Learning: What Does Matter?

Yichen Li · Yuying Wang · Jiahua Dong · Haozhao Wang · Yining Qi · Rui Zhang · Ruixuan Li

Sample-Efficient Tabular Self-Play for Offline Robust Reinforcement Learning

Na Li · Zewu Zheng · Wei Ni · Hangguan Shan · Wenjie Zhang · Xinyu Li

Second-order Optimization under Heavy-Tailed Noise: Hessian Clipping and Sample Complexity Limits

Abdurakhmon Sadiev · Peter Richtarik · Ilyas Fatkhullin

THD-BAR: Topology Hierarchical Derived Brain Autoregressive Modeling for EEG Generic Representations

Wenchao Yang · Weidong Yan · Wenkang Liu · Yulan Ma · Yang Li

The Hawthorne Effect in Reasoning Models: Evaluating and Steering Test Awareness

Sahar Abdelnabi · Ahmed Salem

The Quest for Universal Master Key Filters in DS-CNNs

Zahra Babaiee · Peyman M. Kiasari · Daniela Rus · Radu Grosu

Tight Bounds on the Distortion of Randomized and Deterministic Distributed Voting

Mohammad Abam · Davoud Kareshki · Marzieh Nilipour · MohammadHossein Paydar · Masoud Seddighin

Towards foundational LiDAR world models with efficient latent flow matching

Tianran Liu · Shengwen Zhao · Nicholas Rhinehart

Tru-POMDP: Task Planning Under Uncertainty via Tree of Hypotheses and Open-Ended POMDPs

Wenjing Tang · Xinyu He · Yongxi Huang · Yunxiao Xiao · Cewu Lu · Panpan Cai

UniDomain: Pretraining a Unified PDDL Domain from Real-World Demonstrations for Generalizable Robot Task Planning

Haoming Ye · Yunxiao Xiao · Cewu Lu · Panpan Cai

YEAST: Yet Another Sequential Test

Alexey Kurennoy · Majed Dodin · Tural Gurbanov · Ana Peleteiro Ramallo

Go to Event Page

Award

Sejnowski-Hinton Award

2:00 PM - 2:30 PM

The brain processes information through many layers of neurons. This deep architecture is representationally powerful, but it complicates learning by making it hard to identify the responsible neurons when a mistake is made. In machine learning, the backpropagation algorithm assigns blame to a neuron by computing exactly how it contributed to an error. To do this, it multiplies error signals by matrices consisting of all the synaptic weights on the neuron's axon and farther downstream. This operation requires a precisely choreographed transport of synaptic weight information, which is thought to be impossible in the brain. Here we present a surprisingly simple algorithm for deep learning, which assigns blame by multiplying error signals by random synaptic weights. We show that a network can learn to extract useful information from signals sent through these random feedback connections. In essence, the network learns to learn. We demonstrate that this new mechanism performs as quickly and accurately as backpropagation on a variety of problems and describe the principles which underlie its function. Our demonstration provides a plausible basis for how a neuron can be adapted using error signals generated at distal locations in the brain, and thus dispels long-held assumptions about the algorithmic constraints on learning in neural circuits.

... more

Invited Talk

Demystifying depth: Principles of learning in deep neural networks

Andrew Saxe

2:30 PM - 3:30 PM

Deep neural networks have revolutionized artificial intelligence, yet their inner workings remain poorly understood. This talk presents mathematical analyses of the nonlinear dynamics of learning in several solvable deep network models, offering theoretical insights into the role of depth. These models reveal how learning algorithms, data structure, initialization schemes, and architectural choices interact to produce hidden representations that afford complex generalization behaviors. A recurring theme across these analyses is a neural race: competing pathways within a deep network vie to explain the data, with an implicit bias toward shared representations. These shared representations in turn shape the network’s capacity for systematic generalization, multitasking, and transfer learning. I will show how such principles manifest across diverse architectures—including feedforward, recurrent, and linear attention networks. Together, these results provide analytic foundations for understanding how environmental statistics, network architecture, and learning dynamics jointly structure the emergence of neural representations and behavior.

... more

Speaker Bio

Andrew Saxe - Professor of Theoretical Neuroscience & Machine Learning, Gatsby Computational Neuroscience Unit and Sainsbury Wellcome Centre, UCL. Saxe’s research focuses on mathematical theories of learning in neural networks. He has developed exact solutions for learning dynamics in deep linear networks and studies connections between artificial and biological learning systems. He is a CIFAR Fellow of Learning in Machines & Brains and recipient of the 2019 Wellcome Trust Beit Prize. His work includes theoretical analyses of semantic development and the dynamics of representation learning.

... more

Oral

Oral 6A Reinforcement/State-space 3

3:30 PM - 4:30 PM

3 Events in this session

A Snapshot of Influence: A Local Data Attribution Framework for Online Reinforcement Learning

Yuzheng Hu · Fan Wu · Haotian Ye · David Forsyth · James Zou · Nan Jiang · Jiaqi Ma · Han Zhao

1000 Layer Networks for Self-Supervised RL: Scaling Depth Can Enable New Goal-Reaching Capabilities

Kevin Wang · Ishaan Javali · Michał Bortkiewicz · Tomasz Trzcinski · Benjamin Eysenbach

Learning long range dependencies through time reversal symmetry breaking

Guillaume Pourcel · Maxence Ernoult

Go to Event Page

Oral

Oral 6B Multimodal 4

3:30 PM - 4:30 PM

3 Events in this session

KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction

Jang-Hyun Kim · Jinuk Kim · Sangwoo Kwon · Jae W. Lee · Sangdoo Yun · Hyun Oh Song

MokA: Multimodal Low-Rank Adaptation for MLLMs

Yake Wei · Yu Miao · Dongzhan Zhou · Di Hu

ElasticMM: Efficient Multimodal LLMs Serving with Elastic Multimodal Parallelism

Zedong Liu · Shenggan Cheng · Guangming Tan · Yang You · Dingwen Tao

Go to Event Page

Poster

Mexico City Poster Session 6

4:30 PM - 7:30 PM

25 Events in this session

LoRA-EnVar: Parameter-Efficient Hybrid Ensemble Variational Assimilation for Weather Forecasting

Yi Xiao · Hang Fan · Kun Chen · Ye Cao · Ben Fei · Wei Xue · LEI BAI

Afterburner: Reinforcement Learning Facilitates Self-Improving Code Efficiency Optimization

Mingzhe Du · Anh Tuan Luu · Yue Liu · Yuhao Qing · Dong HUANG · Xinyi He · Qian Liu · Zejun MA · See-Kiong Ng

F-Adapter: Frequency-Adaptive Parameter-Efficient Fine-Tuning in Scientific Machine Learning

Hangwei Zhang · Chun Kang · Yan Wang · Difan Zou

SPARTAN: A Sparse Transformer World Model Attending to What Matters

Anson Lei · Bernhard Schölkopf · Ingmar Posner

Neural Attention Search

Difan Deng · Marius Lindauer

A Dynamic Learning Strategy for Dempster-Shafer Theory with Applications in Classification and Enhancement

Linlin Fan · Xingyu Liu · Mingliang Zhou · Xuekai Wei · Weizhi Xian · Jielu Yan · Weijia Jia

AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models

Xinyi Wang · Xun Yang · Yanlong Xu · Yuchen Wu · Zhen Li · Na Zhao

Aligning What Matters: Masked Latent Adaptation for Text-to-Audio-Video Generation

Jiyang Zheng · Siqi Pan · Yu Yao · Zhaoqing Wang · Dadong Wang · Tongliang Liu

Cognitive Predictive Processing: A Human-inspired Framework for Adaptive Exploration in Open-World Reinforcement Learning

boheng liu · Ziyu Li · Chenghua Duan · YuTian Liu · Zhuo Wang · Xiuxing Li · Qing Li · Xia Wu

Diversity Is All You Need for Contrastive Learning: Spectral Bounds on Gradient Magnitudes

Peter Ochieng

EAReranker: Efficient Embedding Adequacy Assessment for Retrieval Augmented Generation

Dongyang Zeng · Yaping Liu · Wei Zhang · Shuo Zhang · Xinwang Liu · Binxing Fang

GraphKeeper: Graph Domain-Incremental Learning via Knowledge Disentanglement and Preservation

Zihao Guo · Qingyun Sun · Ziwei Zhang · Haonan Yuan · HUIPING ZHUANG · Xingcheng Fu · Jianxin Li

Integral Imprecise Probability Metrics

Siu Lun (Alan) Chau · Michele Caprio · Krikamol Muandet

Linguini: A benchmark for language-agnostic linguistic reasoning

Eduardo Sánchez · Belen Alastruey · Christophe Ropers · Arina Turkatenko · Pontus Lars Erik Saito Stenetorp · Mikel Artetxe · Marta Ruiz Costa-jussà

More Than Just Functional: LLM-as-a-Critique for Efficient Code Generation

Derui Zhu · Dingfan Chen · jinfu chen · Jens Grossklags · Alexander Pretschner · Weiyi Shang

OCRBench v2: An Improved Benchmark for Evaluating Large Multimodal Models on Visual Text Localization and Reasoning

Ling Fu · Zhebin Kuang · Jiajun Song · Mingxin Huang · Biao Yang · Yuzhe Li · Linghao Zhu · Qidi Luo · Xinyu Wang · Hao Lu · Zhang Li · Guozhi Tang · Bin Shan · Chunhui Lin · Qi Liu · Binghong Wu · Hao Feng · Hao Liu · Can Huang · Jingqun Tang · Wei Chen · Lianwen Jin · Yuliang Liu · Xiang Bai

Prompt Tuning Transformers for Data Memorization

Haiyu Wang · Yuanyuan Lin

RankSEG-RMA: An Efficient Segmentation Algorithm via Reciprocal Moment Approximation

Zixun Wang · Ben Dai

Semi-supervised Graph Anomaly Detection via Robust Homophily Learning

GUOGUO AI · Hezhe Qiao · Hui Yan · Guansong Pang

SEMPO: Lightweight Foundation Models for Time Series Forecasting

Hui He · Kun Yi · Yuanchi Ma · Qi Zhang · Zhendong Niu · Guansong Pang

Storyboard-guided Alignment for Fine-grained Video Action Recognition

Enqi Liu · Liyuan Pan · Yan Yang · Yiran Zhong · Zhijing Wu · Xinxiao Wu · Liu Liu

The Quotient Bayesian Learning Rule

Mykola Lukashchuk · Raphaël Trésor · Wouter Nuijten · Ismail Senoz · Bert Vries

Turning the Tables: Enabling Backward Transfer via Causal-Aware LoRA in Continual Learning

Chaoyang Li · Runze Ye · Jianyang Qin · Jinhao Cui · Lingzhi Wang · Ning Hu · Qing Liao

UFO-RL: Uncertainty-Focused Optimization for Efficient Reinforcement Learning Data Selection

Yang Zhao · Kai Xiong · Xiao Ding · Li Du · Yangou Ouyang · Zhouhao Sun · Jiannan Guan · Wenbin Zhang · Bin Liu · Dong Hu · Bing Qin · Ting Liu

UniGTE: Unified Graph–Text Encoding for Zero-Shot Generalization across Graph Tasks and Domains

Duo Wang · Yuan Zuo · Guangyue Lu · Junjie Wu

Go to Event Page