Oscar Michel's Webpage

About Me

I am a second year PhD student at NYU Courant advised by Prof. Saining Xie. My research is in world models.

Recently, I interned at NVIDIA working on reward models for COSMOS in the Deep Imagination Research group. Before that, I was a predoctoral researcher at the Allen Institute for AI working primarily with Dr. Tanmay Gupta and Dr. Ani Kembhavi.

I was an undergraduate at the University of Chicago, where I received a degree in mathematics. There I was fortunate to work with Prof. Rana Hanocka and Prof. Michael Maire.

Selected Publications

Solaris: Building a Multiplayer Video World Model in Minecraft
Oscar Michel*, Georgy Savva*, Daohan Lu*, Suppakit Waiwitlikhit, Timothy Meehan, Dhairya Mishra, Srivats Poddar, Jack Lu, Saining Xie
Arxiv 2025

A multiplayer video world model in Minecraft and an engine for generating multiplayer gameplay data.

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Matt Deitke, Chirstopher Clark, et al.
CVPR 2025 (Best Paper Honorable Mention)

A family of open multimodal models that are as good as the best commerical ones. My contribution was creating a novel evaluation benchmark for testing fine-grained world knowledge in VQA.

OBJect 3DIT: Language-guided 3D-aware Image Editing
Oscar Michel, Anand Bhattad, Eli VanderBilt, Ranjay Krishna, Ani Kembhavi, Tanmay Gupta
NeurIPS 2023

Using synthetic data to give diffusion models intutitive controls that enable geometrically accurate 3D manipulations of objects in images.

Objaverse: A Universe of Annotated 3D Objects
Matt Deitke, Dustin Schwenk, Jordi Salvador, Luca Weihs, Oscar Michel, Eli VanderBilt, Ludwig Schmidt, Kiana Ehsani, Aniruddha Kembhavi, Ali Farhadi
CVPR 2023

Objaverse is a large dataset of 800k+ 3D models.

Text2Mesh: Text-Driven Neural Stylization for Meshes
Oscar Michel*, Roi Bar On*, Richard Liu*, Sagie Benaim, Rana Hanocka
CVPR 2022 (Oral)

Text2Mesh is an algorithm for language guided stylization of a 3D object.

Awards and Honors