Jesse Dodge

Welcome!

I am a research scientist in Meta Superintelligence Labs. I was at the Allen Institute for AI from 2019 to 2025. I was the lead of the evaluation team for OLMo, and I was the research lead of Playground, where you can interact with Ai2's recent models and use OLMoTrace to trace the output of our models back to their training data in real time. I work on building large language models like OLMo, evaluating large language models throughout training, creating and documenting the contents of web-scale pretraining datasets like Dolma, the environmental impact of AI, and improving transparency and reproducibility in the research community.

You can find a bio on my About Me page.

News, Recognition, Awards

Spotlight Presentation (top 3.2%, Main track) at NeurIPS 2025 for Signal and Noise
Spotlight Presentation (Datasets & Benchmarks track) at NeurIPS 2025 for SciArena
Spotlight Presentation (12th highest review score paper) at COLM 2025 for Fluid Language Model Benchmarking
Best Demo Paper at ACL 2025 for OLMoTrace
Spotlight Presentation (top 5%) at ICLR 2025 for Evaluating the Environmental Impact of LLMs
Best Resource Paper at ACL 2024 for Dolma, our LLM pretraining corpus
Best Theme Paper at ACL 2024 for OLMo, our LLM
Spotlight Presentation (top 5%) at ICLR 2024 for What's In My Big Data, a toolkit for understanding pretraining datasets
10-year Test-of-Time Paper at ACL 2022 for our vision+language system Midge
Best Student Paper at NAACL 2015 for Retrofitting

Welcome!

News, Recognition, Awards

Featured Talks

Talk Title

Talk Title

Talk Title

Talk Title

Featured Press