Publications
Horizon-Free and Instance-Dependent Regret Bounds for Reinforcement Learning with General Function Approximation
Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang
AISTATS 2024 [Arxiv] [Code]Tackling Heavy-Tailed Rewards in Reinforcement Learning with Function Approximation: Minimax Optimal and Instance-Dependent Regret Bounds
Jiayi Huang, Han Zhong, Liwei Wang, Lin F. Yang
NeurIPS 2023 [Arxiv] [Code]Breaking the Moments Condition Barrier: No-Regret Algorithm for Bandits with Super Heavy-Tailed Payoffs
Han Zhong, Jiayi Huang, Lin F. Yang, Liwei Wang
NeurIPS 2021 [Arxiv] [Code]