Department of Mathematics - Seminar on Statistics - Online Estimation and Inference for Robust Policy Evaluation in Reinforcement Learning | School of Science - The Hong Kong University of Science and Technology

18 Jul 2023

4:00pm - 5:00pm

Seminar, Lecture, Talk

In this paper, we propose a robust policy evaluation algorithm in reinforcement learning, to feature outlier contamination and heavy-tailed reward distributions. We further develop a fully-online method to conduct statistical inference for the modeling parameters. Our method converges faster to the minimum asymptotic variance than the classical temporal difference (TD) learning and avoids the selection of the step sizes. Numerical experiments are provided on the effectiveness of the proposed algorithm in real-world reinforcement learning experiments, which highlight the efficiency and robustness of our approach when compared to the existing online bootstrap method. This work is joint with Jiyuan Tu (SUFE), Xi Chen (NYU), and Weidong Liu (SJTU).

18 Jul 2023

4:00pm - 5:00pm

Where

Room 2303 (Lifts 17/18)

Speakers/Performers

Prof. Yichen ZHANG
Purdue University

Organizer(S)

Department of Mathematics

Contact/Enquiries

Payment Details

Audience

Alumni, Faculty and staff, PG students, UG students

Language(s)

English

Other Events

16 Jun 2026

Seminar, Lecture, Talk

IAS / School of Science Joint Lecture - Shaping Tumor Cell Plasticity and Therapy Resistance in Glioblastoma

Abstract Tumor heterogeneity fueled by plasticity and genetic diversification of cancer cells is key to therapy failure of malignant glioma. The speaker's team implemented spatial and genetic p...

11 May 2026

Seminar, Lecture, Talk

IAS / School of Science Joint Lecture - Regioselective Pyridine C-H-Functionalization and Skeletal Editing

Abstract Pyridines belong to the most abundant heteroarenes in medicinal chemistry and in agrochemical industry. In the lecture, highly regioselective pyridine C-H functionalization through a d...