Skip to yearly menu bar Skip to main content


Prompt Genotyping: Quantifying the Evaluation Gap Between Synthetic Benchmarks and Real LLM Performance

Sohum Mehta · Saaketh Bhojanam

Abstract

Chat is not available.