Noam Razin

Postdoctoral Fellow

Princeton Language and Intelligence, Princeton University

I am a Postdoctoral Fellow at Princeton Language and Intelligence. My research focuses on the foundations of modern machine learning. I aim to develop theories that shed light on how neural networks operate, as well as bring forth principled methods for improving their efficiency, reliability, and performance. Most recently, I have been working on language model post-training.

In [1], we identify a vanishing gradients problem that arises during reinforcement learning approaches. Building on this optimization perspective, [2] investigates what makes a good reward model.
In [3], we characterize the causes for likelihood displacement — the counter-intuitive phenomenon where preference optimization decreases the probability of preferred responses instead of increasing it as intended. We show that likelihood displacement can lead to surprising failures in alignment and provide preventative guidelines

My work is supported in part by a Zuckerman Postdoctoral Scholarship. Previously, I obtained my PhD in Computer Science at Tel Aviv University, where I was fortunate to be advised by Nadav Cohen. During my PhD, I interned at Apple Machine Learning Research and the Microsoft Recommendations Team, and received the Apple Scholars in AI/ML and the Tel Aviv University Center for AI & Data Science fellowships.

Email: noamrazin (at) princeton.edu

🗞 News

Mar 25: New paper provides an optimization perspective on what makes a good reward model for RLHF. In particular, we establish that more accurate reward models are not necessarily better teachers!
Jan 25: Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization accepted to ICLR 2025.
Oct 24: New paper proving that the implicit bias of state space models (SSMs) can be poisoned with clean labels.

Oct 24: Honored to receive the Zuckerman and Israeli Council for Higher Education Postdoctoral Scholarships.
Sep 24: Joined Princeton Language and Intelligence as a Postdoctoral Fellow.
Aug 24: New lecture notes on the theory (and surprising practical applications) of linear neural networks.
May 24: Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States accepted to ICML 2024.
Jan 24: Two papers accepted to ICLR 2024: one identifying a vanishing gradients problem when using reinforcement learning to finetune language models and another analyzing length generalization of Transformers.
Sep 23: Two papers accepted to NeurIPS 2023: one on the ability of graph neural networks to model interactions and another on what makes data suitable for locally connected neural networks.
Sep 23: Interned at Apple Machine Learning Research.

Mar 22: Honored to receive the 2022 Apple Scholars in AI/ML PhD fellowship.
Oct 21: Honored to receive the Tel Aviv University Center for AI & Data Science excellence fellowship.

📄 Publications

* denotes equal contribution

What Makes a Reward Model a Good Teacher? An Optimization Perspective

Noam Razin, Zixuan Wang, Hubert Strauss, Stanley Wei, Jason D. Lee, Sanjeev Arora

arXiv:2503.15477, 2025

PDF Code

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

Noam Razin, Sadhika Malladi, Adithya Bhaskar, Danqi Chen, Sanjeev Arora, Boris Hanin

International Conference on Learning Representations (ICLR), 2025

PDF Code Poster

The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels

Yonatan Slutzky*, Yotam Alexander*, Noam Razin, Nadav Cohen

arXiv:2410.10473, 2024

PDF Code

Understanding Deep Learning via Notions of Rank

Noam Razin

arXiv:2408.02111 (PhD thesis), 2024

PDF

Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States

Noam Razin*, Yotam Alexander*, Edo Cohen-Karlik, Raja Giryes, Amir Globerson, Nadav Cohen

International Conference on Machine Learning (ICML), 2024

PDF Code Poster

Vanishing Gradients in Reinforcement Finetuning of Language Models

Noam Razin, Hattie Zhou, Omid Saremi, Vimal Thilak, Arwen Bradley, Preetum Nakkiran, Joshua Susskind, Etai Littwin

International Conference on Learning Representations (ICLR), 2024

PDF Code Poster

What Algorithms Can Transformers Learn? A Study in Length Generalization

Hattie Zhou, Arwen Bradley, Etai Littwin, Noam Razin, Omid Saremi, Joshua Susskind, Samy Bengio, Preetum Nakkiran

International Conference on Learning Representations (ICLR), 2024

PDF Code

Lecture Notes on Linear Neural Networks: A Tale of Optimization and Generalization in Deep Learning

Nadav Cohen, Noam Razin

arXiv:2408.13767, 2024

PDF

What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement

Yotam Alexander*, Nimrod De La Vega*, Noam Razin, Nadav Cohen

Advances in Neural Information Processing Systems (NeurIPS), 2023

PDF Code

On the Ability of Graph Neural Networks to Model Interactions Between Vertices

Noam Razin, Tom Verbin, Nadav Cohen

Advances in Neural Information Processing Systems (NeurIPS), 2023

PDF Code Poster Video

Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks

Noam Razin, Asaf Maman, Nadav Cohen

International Conference on Machine Learning (ICML), 2022

PDF Code Poster Video Blog

Implicit Regularization in Tensor Factorization

Noam Razin*, Asaf Maman*, Nadav Cohen

International Conference on Machine Learning (ICML), 2021

PDF Code Poster Video Blog

Implicit Regularization in Deep Learning May Not Be Explainable by Norms

Noam Razin, Nadav Cohen

Advances in Neural Information Processing Systems (NeurIPS), 2020

PDF Code Poster Blog

RecoBERT: A Catalog Language Model for Text-Based Recommendations

Itzik Malkiel, Oren Barkan, Avi Caciularu, Noam Razin, Ori Katz, Noam Koenigstein

Findings of the Association for Computational Linguistics: EMNLP, 2020

PDF

Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding

Oren Barkan*, Noam Razin*, Itzik Malkiel, Ori Katz, Avi Caciularu, Noam Koenigstein

AAAI Conference on Artificial Intelligence (AAAI), 2020

PDF Code Poster

💬 Selected Talks

Understanding and Overcoming Failures of Language Model Finetuning
The Flatiron Institute Machine Learning Seminar, November 2024
Slides
Analyses of Policy Gradient for Language Model Finetuning and Optimal Control
MPI MiS + UCLA Math Machine Learning Seminar, March 2024
Video Slides
Two Analyses of Modern Deep Learning: Graph Neural Networks and Language Model Finetuning
Princeton Alg-ML Seminar, December 2023
Slides
On the Ability of Graph Neural Networks to Model Interactions Between Vertices
Learning on Graphs and Geometry Reading Group, January 2023
Video Slides
Generalization in Deep Learning Through the Lens of Implicit Rank Lowering
ICTP Youth in High-Dimensions: Recent Progress in Machine Learning, High-Dimensional Statistics and Inference, June 2022
Video Slides
Generalization in Deep Learning Through the Lens of Implicit Rank Lowering
MPI MiS + UCLA Math Machine Learning Seminar, May 2022
Slides
Implicit Regularization in Tensor Factorization
The Hebrew University Machine Learning Club, Jerusalem, Israel, June 2021
Video Slides
Implicit Regularization in Deep Learning May Not Be Explainable by Norms
Tel Aviv University Machine Learning Seminar, Tel Aviv, Israel, May 2020
Slides

Noam Razin

Postdoctoral Fellow

Princeton Language and Intelligence, Princeton University

🗞 News

📄 Publications

💬 Selected Talks

📝 Blog Posts

👨‍🏫 Teaching