I study reinforcement learning for social good.
I am currently a resident at OpenAI, interested in applying RL for social good. I am also a co-founder of the nonprofit Humanity Unleashed, and am on leave from a CS PhD at Harvard. Other areas of interest include alignment and the principles of intelligence.
I am currently mentored by Tong Mu and Alec Helyar at OpenAI, and work on Lillian Weng's Safety and Alignment team. I am also very fortunate to be advised by the wonderful Professor Milind Tambe, and grateful to have worked closely with Chuang Gan at MIT-IBM Watson, Amy Zhang at Meta, and William Wang at UCSB.
Led three disparate research projects: CFPI,
LCD, and an unreleased hierarchical RL project
Teaching Assistant for CS 165B (Machine Learning)
GPA: 3.96
High Honors
Regents Scholar (top 2.5% of school)
Relevant coursework: Convex Optimization, Game Theory, Advanced Linear Algebra, Differential Geometry,
Statistical Machine Learning, Special Topics in Deep Learning
AlphaGo Zero Reimplementation
Graph Theory w/ UCSB
BERT Lecture Summarization
Predicting Winners in League
3D graphics with React
It's like LinkedIn but Tinder
Green Uber
Connecting HS Students w/ College Students
I really like learning, and thinking about learning. I like spending time with people even more.
I love playing tennis (and losing miserably at it to my superior roommate), riding the BART, hating on Apple (sometimes while riding the BART), watching anime, and hunting dinosaurs. Haha just kidding on that last one
The credit assignment problem is an extremely interesting problem that appears in Reinforcement Learning and AI in general. Let's say that I play a game of chess, and make n moves in succession. At the end of the game, I get just one discrete feedback signal: the outcome of the game. How does one attribute the importance of each move to the outcome of the game? This is the credit assignment problem. For a more in-depth introduction to the topic I would recommend this paper from Minsky, starting from part 3 on page 10.
The reason I mention this here is because very little of my career credit should be attributed to me. I am eternally grateful to the following people for their kindness, support and guidance. Without them, I would have nothing. In order of recency (not importance): Jiachen Li, Chad Spensky, Shou Chaofan, Derren Slinde.