Workshop
|
|
Q-Morality: Quantum-Enhanced ActAdd-Guided Bias Reduction in LLMs
Shardul Kulkarni
|
|
Workshop
|
|
LLMs Infer Protected Attributes Beyond Proxy Features
Dimitri Staufer
|
|
Workshop
|
|
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
Khaoula Chehbouni · Jonathan Colaço Carr · Yash More · Jackie CK Cheung · Golnoosh Farnadi
|
|
Workshop
|
Sat 17:27
|
Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset
Khaoula Chehbouni · Jonathan Colaço Carr · Yash More · Jackie CK Cheung · Golnoosh Farnadi
|
|
Workshop
|
Sat 17:27
|
Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models
Kefan Song · Jin Yao · Shangtong Zhang
|
|
Workshop
|
|
Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models
Kefan Song · Jin Yao · Shangtong Zhang
|
|
Workshop
|
Sat 12:00
|
A STEP TOWARDS MIXTURE OF GRADER: STATISTICAL ANALYSIS OF EXISTING AUTOMATIC EVALUATION METRICS
Yun Joon Soh · Jishen Zhao
|
|
Workshop
|
|
Understanding The Effect Of Temperature On Alignment With Human Opinions
Maja Pavlovic · Massimo Poesio
|
|
Workshop
|
Sat 17:27
|
Understanding The Effect Of Temperature On Alignment With Human Opinions
Maja Pavlovic · Massimo Poesio
|
|
Workshop
|
Sat 17:27
|
Counterpart Fairness – Addressing Systematic Between-Group Differences in Fairness Evaluation
Yifei Wang · Zhengyang Zhou · Liqin Wang · John Laurentiev · Peter Hou · Li Zhou · Pengyu Hong
|
|
Workshop
|
|
Counterpart Fairness – Addressing Systematic Between-Group Differences in Fairness Evaluation
Yifei Wang · Zhengyang Zhou · Liqin Wang · John Laurentiev · Peter Hou · Li Zhou · Pengyu Hong
|
|
Workshop
|
|
Advancing Agentic Systems: Dynamic Task Decomposition, Tool Integration and Evaluation using Novel Metrics and Dataset
Shankar Kumar Jeyakumar · Alaa Ahmad · Adrian Gabriel
|
|