Shi Dong (董仕)
AbstractI received my PhD from Stanford University, and my doctoral research, advised by Prof. Benjamin Van Roy, develops a general and versatile framework to study reinforcement learning. Motivated by a passion for natural languages, I am dedicated to understanding and unleashing the power of reinforcement learning in large language models (LLMs). Prior to joining RadixArk, I have spent time in xAI, Google DeepMind, and the Knowledge and Language Team, Microsoft. Beyond research, I perform in theater and am a devout fan of British history and culture. Professional Experience
Education
Invited Talks
Honors
Activities
Languages
|