Dynamic Planning and Learning under Recovering Rewards