About me

I am a Ph.D. student at the RLAI lab at the University of Alberta, supervised by Dr. Martha White and Dr. Adam White. Currently, I am working on improving an online learning agent using offline data.

My research interests:
  • - Reinforcement Learning
  • - Offline to Online
  • - Representation Learning


  1. Doctor of Philosophy in Computing Science

    Sept 2020 - Present
    Expected graduation: Summer 2024

    Supervised by Dr. Martha White and Dr. Adam White
    RLAI lab, University of Alberta
    Reinforcement learning, Offline to online

  2. Master of Science in Computing Science

    Sept 2017 - Sept 2020

    Supervised by Dr. Martha White and Dr. Adam White
    RLAI lab, University of Alberta
    Reinforcement learning, Representation learning

  3. Bachelor of Science with Honors in Computing Science

    Sept 2013 - June 2017

    University of Alberta
    Graduated with first class honors


  1. The In-Sample Softmax for Offline Reinforcement Learning

    ICLR, 2022

    Chenjun Xiao*, Han Wang*, Yangchen Pan, Adam White, Martha White

  2. No More Pesky Hyperparameters: Offline Hyperparameter Tuning for RL

    TMLR, 2022

    Han Wang*, Archit Sakhadeo*, Adam White, James Bell, Vincent Liu, Xutong Zhao, Puer Liu, Tadashi Kozuno, Alona Fyshe, Martha White

  3. Investigating the Properties of Neural Network Representations in Reinforcement Learning

    Under Review, 2022

    Han Wang, Erfan Miahi, Martha White, Marlos C. Machado, Zaheer Abbas, Raksha Kumaraswamy, Vincent Liu, Adam White

  4. Measuring and Mitigating Interference in Reinforcement Learning

    CoLLAs, 2023

    Vincent Liu, Han Wang, Ruo Yu Tao, Khurram Javed, Adam White, Martha White

  5. Avoiding Model Estimation in Robust Markov Decision Processes with a Generative Model

    Under Review, 2023

    Wenhao Yang, Han Wang, Tadashi Kozuno, Scott M. Jordan, Zhihua Zhang

  6. Replay Memory as An Empirical MDP: Combining Conservative Estimation with Experience Replay

    ICLR, 2022

    Hongming Zhang, Chenjun Xiao, Han Wang, Jun Jin, Bo Xu, Martin Müller

  7. In-sample Sparsemax for Offline Reinforcement Learning by Tsallis Regularization

    Under Review, 2023

    Lingwei Zhu, Matthew Kyle Schlegel, Han Wang, Martha White

  8. Improving Deep Reinforcement Learning with Empirical Bellman Consistency

    Under Review, 2023

    Hongming Zhang, Chenjun Xiao, Chao Gao, Han Wang, Bo Xu, Martin Müller


  1. Emergent Representations in Reinforcement Learning and Their Properties

    M.Sc., 2020

Work History

  1. Research Intern

    May 2022 - Dec 2022

    Offline reinforcement learning
    Noah's Ark Lab, Huawei Technologies Canada Co., Ltd.

  2. Teaching Assistant

    Winter 2022

    CMPUT 267 - Basics of Machine Learning
    University of Alberta

  3. Teaching Assistant

    Fall 2018, Fall 2017, Fall 2016

    CMPUT 366 - Intelligent Systems, University of Alberta
    University of Alberta

  4. Programmer

    July 2016

    Henan Yufa Property Limited Company