a. Statistical Learning in Operations
A Simple and Optimal Policy Design with Safety against Heavy-tailed Risk for Multi-armed Bandits. NeurIPS 2022.. 2022.
Offline Planning and Online Learning under Recovering Rewards. Management Science.. 2023.
Stochastic Multi-armed Bandits: Optimal Trade-off among Optimality, Consistency, and Tail Risk. NeurIPS 2023 Spotlight (top 3%).. 2023.