Workshop
|
Sat 15:45
|
Auto-Evaluation with Few Labels through Post-hoc Regression
Benjamin Eyre · David Madras
|
|
Workshop
|
|
Investigating Implicit Bias in Large Language Models: A Large-Scale Study of Over 50 LLMs
Divyanshu Kumar · Umang Jain · Sahil Agarwal · Prashanth Harshangi
|
|
Workshop
|
|
Can Generic LLMs Help Analyze Child-Adult Interactions Involving Children with Autism in Clinical Observation?
Tiantian Feng · Anfeng Xu · Rimita Lahiri · Sudarsana Kadiri · Helen Tager-Flusberg · So Kim · Somer Bishop · Catherine Lord · Shrikanth Narayanan
|
|
Workshop
|
|
CausalBench: A Comprehensive Benchmark for Evaluating Causal Reasoning Capabilities of Large Language Models
ZEYU WANG
|
|
Workshop
|
|
Evaluating Chemistry Prompts for Large-Language Model Fine-Tuning
Carmelo Gonzales · Michael Pieler · Kevin Maik Jablonka · Santiago Miret
|
|
Workshop
|
|
GSR-Bench: A Benchmark for Grounded Spatial Reasoning Evaluation via Multimodal LLMs
Navid Rajabi · Jana Kosecka
|
|
Workshop
|
|
DynoClass: A Dynamic Table-Class Detection System Without the Need for Predefined Ontologies
Haonan Wang · Eugene Wu · Kechen Liu · Jiaxiang Liu
|
|
Workshop
|
|
Optimizing Fine-Tuning Efficiency: Gradient Subspace Tracking on Grassmann Manifolds for Large Language Models
Sahar Rajabi · Sirisha Rambhatla
|
|
Workshop
|
Sat 10:55
|
Expertise-Centric Prompting Framework for Financial Tabular Data Generation using Pre-trained Large Language Models
Subin Kim · Jungmin Son · Minyoung Jung · Youngjun Kwak
|
|
Workshop
|
|
CausalBench: A Comprehensive Benchmark for Evaluating Causal Reasoning Capabilities of Large Language Models
ZEYU WANG
|
|
Workshop
|
|
Different Bias Under Different Criteria: Assessing Bias in LLMs with a Fact-Based Approach
Changgeon Ko · Jisu Shin · Hoyun Song · Jeongyeon Seo · Jong Park
|
|
Workshop
|
|
Can large language models reason about causal relationships in multimodal time series data?
Elizabeth Healey · Isaac S Kohane
|
|