Huong Ngo 👩‍💻

I'm looking for my next challenge! I was a Member of Technical Staff at Vercept (acquired by Anthropic) working on data curation, model training and building evaluations for computer-use agents, and a Predoctoral Young Investigator at Allen Institute for AI's PRIOR Team working on automatic speech recognition and data-centric machine learning (Check out OLMoASR). I graduated from the University of Washington with a B.S in Data Science and Statistics and was fortunate to be advised by Matt Deitke and Ludwig Schmidt.

My research interests center on data-centric machine learning and multimodality — specifically, how the quality, scale, and structure of training data shapes what models can learn. I've worked on this across domains, from large-scale speech data curation for open ASR systems to data collection and filtering pipelines for computer-use agents. More recently, I've become increasingly interested in agent systems themselves: building agents that can perceive their surroundings, reason about them, and execute actions to accomplish tasks — focusing on understanding what data it takes to get there. More importantly, I want to build agents that can handle long-term planning and decision-making and learn from experiences to improve over time.

Email  /  CV (as of early 2026)  /  Scholar  /  Twitter  /  Github

profile photo

News

  • OLMoASR is now on arXiv - Check out the paper and the code!
  • Excited to be joining Vercept as a Member of Technical Staff!
  • Starting as a Predoctoral Young Investigator at the Allen Institute for AI's PRIOR Team!
  • Starting my internship with Matt Deitke at AI2 on the PRIOR Team!

Publications, projects, preprints

OLMoASR: Open Models and Data for Training Robust Speech Recognition Models
Huong Ngo, Matt Deitke, Martijn Bartelds, Sarah Pratt, Josh Gardner, Matt Jordan, Ludwig Schmidt
arXiv, 2025
arxiv / code / website

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models
Matt Deitke, Christopher Clark, Sangho Lee, Rohun Tripathi, Yue Yang, Jae Sung Park, Mohammadreza Salehi, Niklas Muennighoff, Kyle Lo, Luca Soldaini, Jiasen Lu, Taira Anderson, Erin Bransom, Kiana Ehsani, Huong Ngo, YenSung Chen, Ajay Patel, Mark Yatskar, Chris Callison-Burch, Andrew Head, Rose Hendrix, Favyen Bastani, Eli VanderBilt, Nathan Lambert, Yvonne Chou, Arnavi Chheda, Jenna Sparks, Sam Skjonsberg, Michael Schmitz, Aaron Sarnat, Byron Bischoff, Pete Walsh, Chris Newell, Piper Wolters, Tanmay Gupta, Kuo-Hao Zeng, Jon Borchardt, Dirk Groeneveld, Jen Dumas, Crystal Nam, Sophie Lebrecht, Caitlin Wittlif, Carissa Schoenick, Oscar Michel, Ranjay Krishna, Luca Weihs, Noah A. Smith, Hannaneh Hajishirzi, Ross Girshick, Ali Farhadi, Aniruddha Kembhavi
arXiv, 2024
arxiv / code / website

Objaverse-XL: A Universe of 10M+ 3D Objects
Matt Deitke, Ruoshi Liu, Matthew Wallingford, Huong Ngo*, Oscar Michel, Aditya Kusupati, Alan Fan, Christian Laforte, Vikram Voleti, Samir Yitzhak Gadre, Eli VanderBilt, Aniruddha Kembhavi, Carl Vondrick, Georgia Gkioxari, Kiana Ehsani, Ludwig Schmidt,, Ali Farhadi,
NeurIPS Datasets and Benchmarks Track, 2023
arxiv / code / website

Text2Midi: Generating Symbolic Music Representation from Text
Huong Ngo*, Alan Fan, Daksh Sinha, Nicholas Boren
code

Gehirn: Automated Generation of Symbolic Music Representation Datasets
Huong Ngo*, Alan Fan, Daksh Sinha, Nicholas Boren
code

More

  • I was born in Hanoi, Vietnam but moved to the US in 2019 after finishing secondary school in Singapore!

Feel free to steal this website's source code.