Otilia Stretcu
Mountain View, California, USA

I am a Senior Research Scientist at Google Research working on core machine learning methodology—primarily around vision-language models, data mining, active learning, and safety—with applications in various areas including computer vision and natural language processing.
My current research focuses on two key problems:
- Enabling domain experts to effectively build models without requiring AI expertise. Training AI models for specialized domains typically requires curating large datasets, which is expensive, time-consuming and usually demands AI expertise. My research removes that barrier by automating the process of mining the right kinds of data and iteratively improving the model, and spans multiple areas including modeling, active learning and distillation.
- Improving AI Trust & Safety models to make the internet safer. As powerful LLMs become more accessible, their potential for malicious use (like generating online abuse/threats) grows. My research flips this around, leveraging LLM capabilities to detect and counteract malicious activities. This involves fundamental research aimed at expanding the reasoning capabilities of LLMs over multimodal data, to better identify sophisticated harmful content and malicious behaviors.
If either or both of these problems excite you, feel free to reach out!
Previously, I obtained my PhD from the Machine Learning Department at Carnegie Mellon University, co-advised by Tom Mitchell and Barnabàs Pòczos. My PhD research focused on developing algorithms for machine learning, mainly focused on semi-supervised learning, curriculum learning, multitask learning, and graph-based problems. I am also passionate about applying machine learning methods in neuroscience, in order to study how the brain understands language and controls speech. Previously, I did some research in Computer Vision, with the goal of detecting and tracking objects in videos.
Before I joined CMU, I graduated with an M.Phil. in Advanced Computer Science from the University of Cambridge, UK. In my Master's thesis I used Machine Learning methods to detect and align chromosomes in microscope images, advised by Pietro Lió.
news
I will be teaching a lecture on Vision-Language Models at the Eastern European Machine Learning Summer School in Sarajevo, Bosnia and Herzegovina. Join us!
I will be giving a talk at the ICLR 2025 workshop “I Can’t Believe It’s Not Better: Challenges in Applied Deep Learning”.
Our work on Visual Program Distillation: Distilling Tools and Programmatic Reasoning into Vision-Language Models was selected for oral presentation at the CVPR 2024.
Our work on Agile Modeling: From Concept to Classifier in Minutes was nominated for best paper award at the NeurIPS 2023 Workshop on Adaptive Experimental Design and Active Learning in the Real World.
I joined Google as full-time research scientist.
Our work on meaning representation in the brain was accepted at NeurIPS 2020.
Our work on graph-agreement models for semi-supervised learning was accepted at NeurIPS 2019.
Organizing a workshop on Adaptive & Multitask Learning at ICML 2019.
Our work on curriculum learning for machine translation was accepted for oral presentation at NAACL 2019.
Thank you CMLH for the fellowship in digital health in support of our work on Parkinson’s disease.