Skip to yearly menu bar Skip to main content


Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Yimeng Zhang ⋅ Ziyi Wang ⋅ Yuxuan Lu ⋅ Simon Zhan ⋅ Dakuo Wang

Abstract

Chat is not available.