Hi! I am a PhD candidate in the CILVR lab at NYU Courant advised by Mengye Ren and supported by the NDSEG fellowship. I am also interning at Meta FAIR working on unified multimodal models and visual tokenization.
I am broadly interested in advancing the visual perception, decision-making, and self-improvement capabilities of AI systems to enable them to continuously operate in the complex real world, for the purpose of amplifying human productivity and well-being. Towards this end, I am focused on the following research directions:
- Learning world models from raw video and uncurated interaction data and using them as simulators, planners, and co-training objectives
- Enabling self-improvement via exploration, self-supervised learning, and verification
Previously, I interned at Meta FAIR on the computer use agents team. Before starting my PhD, I worked on the systematic equities research team at The Voleon Group. I completed my joint BS/MS in computer science at the University of Michigan. There, I was fortunate to work with Honglak Lee on deep reinforcement learning and representation learning, and Michael P. Wellman on multi-agent systems.
|
|
|
April 2026 |
I will join Meta's AI Mentorship Program as a visiting researcher this fall to work on world models and video learning. |
|
|
I started my internship at Meta FAIR working on unified pretraining and visual tokenization mentored by Jakob Verbeek. |
|
January 2026 |
Midway Network is accepted to ICLR 2026. |
|
May 2025 |
I started my internship Meta FAIR working on computer use agents mentored by Joseph Tighe and Pierluca D'Oro. |
|
February 2025 |
PooDLe is accepted to ICLR 2025. |
|
|
![]() |
Chris Hoang, Mengye Ren ICLR 2026 project page / arXiv Midway Network is a new self-supervised learning architecture that is the first to learn visual representations for both object recognition and motion prediction solely from natural videos by leveraging latent dynamics. |
![]() |
Azwar Abdulsalam, Chris Hoang, Mengye Ren ICLR 2026 2nd Workshop on World Models project page LaMo is a structured latent dynamics model that enables long-horizon prediction by autoregressively predicting latent motion tokens used to advance a visual scene's latent state over time. |
![]() |
Junyeob Baek, Hosung Lee, Chris Hoang, Mengye Ren, Sungjin Ahn ICML Tokenization Workshop 2025 paper We introduce Discrete-JEPA, which learns discrete semantic tokens for improved symbolic reasoning and long-horizon planning. |
|
Alex N. Wang*, Chris Hoang*, Yuwen Xiong, Yann LeCun, Mengye Ren ICLR 2025 project page / arXiv We propose a self-supervised learning framework that combines pooled and dense objectives to learn representations with spatial and semantic understanding from naturalistic videos. |
|
Chris Hoang, Sungryull Sohn, Jongwook Choi, Wilka Carvalho, Honglak Lee NeurIPS 2021 NeurIPS Workshop on Deep Reinforcement Learning 2020 project page / arXiv / video We leverage successor features to formulate a graph-based planning framework and goal-conditioned policy, enabling long-horizon goal-reaching in visual environments. |
![]() |
Xintong Wang, Chris Hoang, Yevgeniy Vorobeychik, Michael P. Wellman, Games 2021 paper We use an agent-based model and empirical game-theoretic analysis to study price manipulation in financial markets and propose mitigation mechanisms. |
![]() |
Xintong Wang, Chris Hoang, Michael P. Wemman, ICAIF 2020 ICML Workshop on AI in Finance 2019 paper We design learning-based trading strategies that have improved robustness to market manipulation and evaluate them with agent-based simulation. |
