NeurIPS 2024

Skip to yearly menu bar Skip to main content

59 Results

Workshop		Q-Morality: Quantum-Enhanced ActAdd-Guided Bias Reduction in LLMs Shardul Kulkarni
Workshop		LLMs Infer Protected Attributes Beyond Proxy Features Dimitri Staufer
Workshop		Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset Khaoula Chehbouni · Jonathan Colaço Carr · Yash More · Jackie CK Cheung · Golnoosh Farnadi
Workshop	Sat 17:27	Beyond the Safety Bundle: Auditing the Helpful and Harmless Dataset Khaoula Chehbouni · Jonathan Colaço Carr · Yash More · Jackie CK Cheung · Golnoosh Farnadi
Workshop	Sat 17:27	Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models Kefan Song · Jin Yao · Shangtong Zhang
Workshop		Toward Large Language Models that Benefit for All: Benchmarking Group Fairness in Reward Models Kefan Song · Jin Yao · Shangtong Zhang
Workshop	Sat 12:00	A STEP TOWARDS MIXTURE OF GRADER: STATISTICAL ANALYSIS OF EXISTING AUTOMATIC EVALUATION METRICS Yun Joon Soh · Jishen Zhao
Workshop		Understanding The Effect Of Temperature On Alignment With Human Opinions Maja Pavlovic · Massimo Poesio
Workshop	Sat 17:27	Understanding The Effect Of Temperature On Alignment With Human Opinions Maja Pavlovic · Massimo Poesio
Workshop	Sat 17:27	Counterpart Fairness – Addressing Systematic Between-Group Differences in Fairness Evaluation Yifei Wang · Zhengyang Zhou · Liqin Wang · John Laurentiev · Peter Hou · Li Zhou · Pengyu Hong
Workshop		Counterpart Fairness – Addressing Systematic Between-Group Differences in Fairness Evaluation Yifei Wang · Zhengyang Zhou · Liqin Wang · John Laurentiev · Peter Hou · Li Zhou · Pengyu Hong
Workshop		Advancing Agentic Systems: Dynamic Task Decomposition, Tool Integration and Evaluation using Novel Metrics and Dataset Shankar Kumar Jeyakumar · Alaa Ahmad · Adrian Gabriel