Huong Ngo 👩‍💻

I'm a Member of Technical Staff at Vercept working on data curation, model training and building evaluations for computer-use agents. I was a Predoctoral Young Investigator at Allen Institute for AI's PRIOR Team working on automatic speech recognition and data-centric machine learning (Check out OLMoASR). I graduated from the University of Washington with a B.S in Data Science and Statistics and was fortunate to be advised by Matt Deitke and Ludwig Schmidt.

My research interests centers around large-scale machine learning with a focus on data-centric and multimodal approaches. I'm particularly excited about building agents with the ability to perceive and understand their surroundings, reason about them to plan and execute actions to accomplish tasks. I'm also broadly interested in how to build agents that can handle long-term planning and decision-making and learn from experiences to improve over time.

Email  /  CV (as of early 2026)  /  Scholar  /  Twitter  /  Github

profile photo

News

  • OLMoASR is on arXiv - Check out the paper and the code!
  • Excited to be joining Vercept as a Member of Technical Staff!
  • Starting as a Predoctoral Young Investigator at the Allen Institute for AI's PRIOR Team!
  • Starting my internship with Matt Deitke at AI2 on the PRIOR Team!

Publications, projects, preprints

OLMoASR: Open Models and Data for Training Robust Speech Recognition Models
Huong Ngo, Matt Deitke, Martijn Bartelds, Sarah Pratt, Josh Gardner, Matt Jordan, Ludwig Schmidt
arXiv, 2025
arxiv / code / website

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang, Jae Sung Park, Mohammadreza Salehi, Niklas Muennighoff, Kyle Lo, Luca Soldaini, Jiasen Lu, Taira Anderson, Erin Bransom, Kiana Ehsani, Huong Ngo, YenSung Chen, Ajay Patel, Mark Yatskar, Chris Callison-Burch, Andrew Head, Rose Hendrix, Favyen Bastani, Eli VanderBilt, Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks, Sam Skjonsberg, Michael Schmitz, Aaron Sarnat, Byron Bischoff, Pete Walsh, Chris Newell, Piper Wolters, Tanmay Gupta, Kuo-Hao Zeng, Jon Borchardt, Dirk Groeneveld, Jen Dumas, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif, Carissa Schoenick, Oscar Michel, Ranjay Krishna, Luca Weihs, Noah A. Smith, Hannaneh Hajishirzi, Ross Girshick, Ali Farhadi, Aniruddha Kembhavi
arXiv, 2024
arxiv / code / website

Objaverse-XL: A Universe of 10M+ 3D Objects
Matt Deitke, Ruoshi Liu, Matthew Wallingford, Huong Ngo*, Oscar Michel, Aditya Kusupati, Alan Fan, Christian Laforte, Vikram Voleti, Samir Yitzhak Gadre, Eli VanderBilt, Aniruddha Kembhavi, Carl Vondrick, Georgia Gkioxari, Kiana Ehsani, Ludwig Schmidt,, Ali Farhadi,
NeurIPS Datasets and Benchmarks Track, 2023
arxiv / code / website

Text2Midi: Generating Symbolic Music Representation from Text
Huong Ngo*, Alan Fan, Daksh Sinha, Nicholas Boren
code

Gehirn: Automated Generation of Symbolic Music Representation Datasets
Huong Ngo*, Alan Fan, Daksh Sinha, Nicholas Boren
code

More

  • I was born in Hanoi, Vietnam but moved to the US in 2019 after finishing secondary school in Singapore!

Feel free to steal this website's source code.