Harvard Machine Learning Foundations Group

We are a research group focused on some of the foundational questions in modern machine learning. We are interested in both experimental and theoretical approaches that advance our understanding. Our group contains ML practitioners, theoretical computer scientists, statisticians, and neuroscientists, all sharing the goal of placing machine and natural learning on firmer foundations, and elucidating their fundamental capabilities and limitations.

Our group organizes the Kempner Seminar Series - a research seminar on the foundations of both natural and artificial learning. See mailing list, Google calendar , and list of talks.

Opportunities: We are looking for graduate students and postdocs. See opportunities section below. Announcements on positions will also be posted on social media.

People

Researchers

Gustaf Ahdritz

PhD Student

Alex Atanasov

PhD Student

Demba Ba

Faculty

Boaz Barak

Faculty

Blake Bordelon

PhD Student

Chi-Ning Chou

PhD Student

Ben Edelman

PhD Student

Yang Hu

PhD Student

Lucas Janson

Faculty

Samy Jelassi

Postdoctoral Fellow

Sham Kakade

Faculty

Seth Neel

Faculty

Gal Kaplun

PhD Student

Anat Kleiman

PhD Student

Depen Morwani

PhD Student

Costin-Andrei Oncescu

PhD Student

Cengiz Pehlevan

Faculty

Aayush Karan

PhD Student

David Brandfonbrener

Postdoctoral Fellow

Yonadav Shavit

PhD Student

Sunny Qin

PhD Student

Nikhil Vyas

Postdoctoral Fellow

Sara Kangaslahti

PhD Student

Roy Rinberg

PhD Student

Natalie Abreu

PhD Student

Clara Mohri

PhD Student

Hanlin Zhang

PhD Student

Mary Letey

PhD Student

Jonathan Geuter

PhD Student

David Alvarez Melis

Faculty

Sitan Chen

Faculty

Sheng Yang

Masters Student

Jacob Zavatone-Veth

PhD Student

Rosie Zhao

PhD Student

Runyu (Cathy) Zhang

PhD Student

Affiliated

Finale Doshi-Velez

Faculty

Hima Lakkaraju

Faculty

Yue Lu

Faculty

Na Li

Faculty

Michael Mitzenmacher

Faculty

Morgane Austern

Faculty

Emeritus

Preetum Nakkiran

PhD Student

Dimitris Kalimeris

PhD Student

Yamini Bansal

PhD Student

Sharon Qian

PhD Student

Tristan Yang

Undergraduate

Fred Zhang

PhD Student

Recent Publications

By our group and its members.

(This list is not comprehensive. Also, we’re sometimes slow in updates—see individual homepages and the arXiv for the latest publications.)

Hidden Progress in Deep Learning: SGD Learns Parities Near the Computational Limit

Boaz Barak, Benjamin L. Edelman, Surbhi Goel, Sham Kakade, Eran Malach, Cyril Zhang

NeurIPS 2022

Contrasting random and learned features in deep Bayesian linear regression

Jacob A. Zavatone-Veth, William L. Tong, Cengiz Pehlevan

Manuscript 2022

Deconstructing Distributions: A Pointwise Framework of Learning

Gal Kaplun, Nikhil Ghosh, Saurabh Garg, Boaz Barak, Preetum Nakkiran

Manuscript 2022

Depth induces scale-averaging in overparameterized linear Bayesian neural networks

Jacob A. Zavatone-Veth, Cengiz Pehlevan

55th Asilomar Conference 2021

Neural Networks as Kernel Learners: The Silent Alignment Effect

Alexander Atanasov*, Blake Bordelon*, Cengiz Pehlevan

ICLR 2022

Inductive Biases and Variable Creation in Self-Attention Mechanisms

Benjamin Edelman, Surbhi Goel, Sham Kakade, Cyril Zhang

ICML 2022.

Capacity of Group-invariant Linear Readouts from Equivariant Representations: How Many Objects can be Linearly Classified Under All Possible Views?

Matthew Farrell, Blake Bordelon, Shubhendu Trivedi, Cengiz Pehlevan

ICLR 2022

Revisiting Model Stitching to Compare Neural Representations

Yamini Bansal, Preetum Nakkiran, Boaz Barak

NeurIPS 2021

Learning Curves for SGD on Structured Features

Blake Bordelon, Cengiz Pehlevan

ICLR 2022

Out-of-Distribution Generalization in Kernel Regression

Abdulkadir Canatar, Blake Bordelon, Cengiz Pehlevan

NeurIPS 2021

Asymptotics of Representation Learning in Finite Bayesian Neural Networks

Jacob A. Zavatone-Veth, Abdulkadir Canatar, Benjamin S. Ruben, Cengiz Pehlevan

NeurIPS 2021

For Self-supervised Learning, Rationality Implies Generalization, Provably

Yamini Bansal*, Gal Kaplun*, Boaz Barak

ICLR 2020.

The Deep Bootstrap: Good Online Learners are Good Offline Generalizers

Preetum Nakkiran, Behnam Neyshabur, Hanie Sedghi

ICLR 2020.

Distributional Generalization: A New Kind of Generalization

Preetum Nakkiran*, Yamini Bansal*

Manuscript 2020.

Learning From Strategic Agents: Accuracy, Improvement, and Causality

Yonadav Shavit, Benjamin Edelman, Brian Axelrod.

ICML 2020.

Deep Double Descent: Where Bigger Models and More Data Hurt

Preetum Nakkiran, Gal Kaplun, Yamini Bansal, Tristan Yang, Boaz Barak, Ilya Sutskever.

ICLR 2020.

SGD on Neural Networks Learns Functions of Increasing Complexity

Preetum Nakkiran, Gal Kaplun, Dimitris Kalimeris, Tristan Yang, Benjamin L. Edelman, Fred Zhang, Boaz Barak.

NeurIPS 2019 spotlight talk (top 15% of accepted papers).

More Data Can Hurt for Linear Regression: Sample-wise Double Descent

Preetum Nakkiran.

Manuscript. 2019.

Computational Limitations in Robust Classification and Win-Win Results

Akshay Degwekar, Preetum Nakkiran, Vinod Vaikuntanathan.

COLT 2019.

Minnorm training: an algorithm for training over-parameterized deep neural networks

Yamini Bansal, Madhu Advani, David D Cox, Andrew M Saxe

Manuscript. 2019.

Adversarial Robustness May Be at Odds With Simplicity

Preetum Nakkiran.

Manuscript. 2019.

On the Information Bottleneck Theory of Deep Learning

Andrew Michael Saxe, Yamini Bansal, Joel Dapello, Madhu Advani, Artemy Kolchinsky, Brendan Daniel Tracey, David Daniel Cox

ICLR 2018.

Recent & Upcoming Talks

The ML Foundations Talks are now the Kempner Seminar Series organized by the ML Foundations Group. For more information about the series, see the line-up of speakers or visit the Kempner Institute events page.

Brian DePasquale

May 17, 2024 2:00 PM — 4:00 PM SEC LL2.224

Kim Stachenfeld - Predictive models for representation learning and simulation

In the context of deep learning, predictive models serve multiple purposes. One use is to drive representation learning, as the …

Apr 12, 2024 2:00 PM — 4:00 PM SEC LL2.224

Stefano Ermon - Score Entropy Discrete Diffusion Models

Diffusion models are at the core of many state-of-the-art generative AI systems for content such as images, videos, and audio. These …

Apr 5, 2024 2:00 PM — 4:00 PM SEC LL2.224

Andrea Montanari - Solving overparametrized systems of nonlinear equations

I will discuss the problem of solving a system of equations F(x)=0, for x a d-dimensional unit vectors and D a non-linear map from R^d …

Mar 22, 2024 2:00 PM — 4:00 PM SEC LL2.224

Tom Griffiths - Using the Tools of Cognitive Science to Understand the Behavior of Large Language Models

Large language models have been found to have surprising capabilities, even what have been called “sparks of artificial general …

Mar 15, 2024 2:00 PM — 4:00 PM SEC LL2.224

Larry Abbott - Modeling the Navigational Circuitry of the Fly

Navigation requires orienting oneself relative to landmarks in the environment, evaluating relevant sensory data, remembering goals, …

Feb 23, 2024 2:00 PM — 4:00 PM SEC LL2.224

Rajesh Rao - Active Predictive Coding: A Sensory-Motor Theory of the Neocortex and a Unifying Framework for AI

Recent neurophysiological experiments indicate that almost all cortical areas, even those traditionally labelled as primary sensory …

Feb 16, 2024 2:00 PM — 4:00 PM SEC LL2.224

Noam Brown - CICERO: Human-Level Performance in the Game of Diplomacy by Combining Language Models with Strategic Reasoning

In this talk I will describe CICERO, the first AI agent to achieve human-level performance in Diplomacy, a strategy game involving both …

Jan 26, 2024 2:00 PM — 4:00 PM SEC LL2.224

Carsen Stringer - Unsupervised pretraining in biological neural networks

Representation learning in neural networks may be implemented with supervised or unsupervised algorithms, distinguished by the presence …

Nov 3, 2023 2:00 PM — 4:00 PM SEC LL2.224

Emmanuel Abbe - Logic Reasoning and Generalization on the Unseen

Transformers have become the dominant neural network architecture in deep learning. While they are state of the art in language and …

Oct 13, 2023 2:00 PM — 4:00 PM SEC LL2.224

Denny Zhou - Teach language models to reason

Over the past decades, the machine learning community has developed tons of data-driven techniques aimed at enhancing learning …

Sep 15, 2023 2:15 PM — 4:00 PM SEC LL2.224

Tom Goldstein - Dataset security issues in generative AI

Machine learning systems are built using large troves of training data that may contain private or copyrighted content. In this talk, …

Sep 8, 2023 2:15 PM — 4:00 PM SEC LL2.224

Video

Yann LeCun - Towards Machines that can Learn, Reason, and Plan.

How could machines learn as efficiently as humans and animals? How could machines learn how the world works and acquire common sense? …

May 23, 2023 2:00 PM — 3:45 PM SEC 1.413

Video

Timothy Lillicrap - Model-based reinforcement learning and the future of language models

Large language models are capable of an incredible array of tasks. Language models are pre-trained on large amounts of text data from …

May 19, 2023 2:00 PM — 3:45 PM SEC 1.413

Yejin Choi - Common Sense: the Dark Matter of Language and Intelligence

Scale appears to be the winning recipe in today’s leaderboards. And yet, extreme-scale neural models are (un)surprisingly brittle …

May 12, 2023 2:15 PM — 4:00 PM SEC 1.413

Video

See all talks

Seminar Calendar

Below is the calendar of events in the Kempner ML Foundations seminar. Join the mailing list for talk announcements.

Opportunities

We are looking for undergraduate researchers, graduate students and postdocs in the ML foundations group.

For undergraduate students, we are only able to work with students at Harvard or MIT (with preference to the former). If you are a Harvard or MIT student interested in collaborating, informally or formally, with us, please fill out the following google form. Students might also be interested in taking Boaz’s Spring 2023 seminar on the foundations of deep learning.

For graduate students we have openings in Computer Science, Electrical Engineering,applied mathematics or statistics degrees. New: Kempner Institute Graduate Fellowship: See more details here

If you are applying for graduate studies in CS and are interested in machine learning foundations, please mark both “Machine Learning” and “Theory of Computation” as areas of interest. Please also list the names of faculty you want to work with on your application. ML foundations group faculty include Demba Ba (Electrical Engineering and Bioengineering), David Alvarez-Melis, Boaz Barak, Sitan Chen, Jonathan Frankle, Sham Kakade (Computer Science), Cengiz Pehlevan (Applied Mathematics), and Lucas Janson (Statistics). There are also ML foundations affiliated faculty in all of the above departments and more. All of us are also open to the possibilities of co-advising students, including across different departments and schools.

Postdoc opportunities for 2024-2025 Academic year:

There are a number of opportunities at Harvard for postdoc positions. Applying to multiple positions is not just allowed but encouraged, and we urge you to apply to any of those that are of interest to you.

Kempner Institute Fellows - a three-year prestigious position for postdocs in AI/ML/related areas interested in “fundamentally advancing our understanding of natural and artificial intelligence.” Apply by October 9 2023
Computer science postdocs Postdocs in the ML foundations, Rabin Fellowship, Privacy Tools, Theory of Society. Coming soon.
Postdoctoral positions at Harvard Data Science initiative
The George F. Carrier Postdoctoral Fellowship in Applied Mathematics.
Postdoctoral fellow in theoretical and computational neuroscience at the Swartz Program
Center for Research on Computation and Society (CRCS) postdoc position
Postdoc positions at the Materials Intelligence Group