Session 4: Evaluating core reasoning for reliable planning tasks
Harsha Kokel
Abstract
- Core reasoning tasks for planning
- Datasets for evaluations
- Planning Benchmark Desiderata
Successful Page Load