AInstein: Can AI Rediscover Scientific Concepts from First Principles?
Shambhavi Mishra · Gaurav Sahu · Marco Pedersoli · Laurent Charlin · Jose Dolz · Chris Pal
Abstract
Large language models have demonstrated remarkable capabilities across diverse tasks, yet a fundamental question remains: can these models genuinely rediscover complex scientific insights, or do they merely recite memorized information? We present AInstein, a novel framework for evaluating whether language models can derive established scientific concepts from first principles when stripped of domain-specific terminology. Rather than testing the recall of scientific facts, we reformulate landmark discoveries as conceptual puzzles, challenging models to reconstruct the underlying technical solutions independently.
Chat is not available.
Successful Page Load