- Ph.D. Student in Machine Learning and Control at Hybrid Robotics Lab, BAIR, UC Berkeley
- Working on Reinforcement Learning
Scalable RL, Skill Discovery and Search, Robust and Safe RL, Data-Efficient RL
Can agent performance almost surely monotone increase using any data stream?
Offline RL, Off2On RL, Off-Policy Q-Learning, Self/Unsupervised RL, Dynamical Systems