Noam Razin
Noam Razin
News
Publications
Talks
Blog Posts
Teaching
Publications
* denotes equal contribution.
Type
Conference paper
Preprint
Report
Thesis
Date
2025
2024
2023
2022
2021
2020
What Makes a Reward Model a Good Teacher? An Optimization Perspective
Noam Razin
,
Zixuan Wang
,
Hubert Strauss
,
Stanley Wei
,
Jason D. Lee
,
Sanjeev Arora
arXiv:2503.15477, 2025
PDF
Cite
Code
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
Noam Razin
,
Sadhika Malladi
,
Adithya Bhaskar
,
Danqi Chen
,
Sanjeev Arora
,
Boris Hanin
International Conference on Learning Representations (ICLR), 2025
PDF
Cite
Code
Poster
The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
Yonatan Slutzky*
,
Yotam Alexander*
,
Noam Razin
,
Nadav Cohen
arXiv:2410.10473, 2024
PDF
Cite
Code
Understanding Deep Learning via Notions of Rank
Noam Razin
arXiv:2408.02111 (PhD thesis), 2024
PDF
Cite
Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States
Noam Razin*
,
Yotam Alexander*
,
Edo Cohen-Karlik
,
Raja Giryes
,
Amir Globerson
,
Nadav Cohen
International Conference on Machine Learning (ICML), 2024
PDF
Cite
Code
Poster
Vanishing Gradients in Reinforcement Finetuning of Language Models
Noam Razin
,
Hattie Zhou
,
Omid Saremi
,
Vimal Thilak
,
Arwen Bradley
,
Preetum Nakkiran
,
Joshua Susskind
,
Etai Littwin
International Conference on Learning Representations (ICLR), 2024
PDF
Cite
Code
Poster
What Algorithms Can Transformers Learn? A Study in Length Generalization
Hattie Zhou
,
Arwen Bradley
,
Etai Littwin
,
Noam Razin
,
Omid Saremi
,
Joshua Susskind
,
Samy Bengio
,
Preetum Nakkiran
International Conference on Learning Representations (ICLR), 2024
PDF
Cite
Code
Lecture Notes on Linear Neural Networks: A Tale of Optimization and Generalization in Deep Learning
Nadav Cohen
,
Noam Razin
arXiv:2408.13767, 2024
PDF
Cite
What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement
Yotam Alexander*
,
Nimrod De La Vega*
,
Noam Razin
,
Nadav Cohen
Advances in Neural Information Processing Systems (NeurIPS), 2023
PDF
Cite
Code
On the Ability of Graph Neural Networks to Model Interactions Between Vertices
Noam Razin
,
Tom Verbin
,
Nadav Cohen
Advances in Neural Information Processing Systems (NeurIPS), 2023
PDF
Cite
Code
Poster
Video
Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks
Noam Razin
,
Asaf Maman
,
Nadav Cohen
International Conference on Machine Learning (ICML), 2022
PDF
Cite
Code
Poster
Video
Blog
Implicit Regularization in Tensor Factorization
Noam Razin*
,
Asaf Maman*
,
Nadav Cohen
International Conference on Machine Learning (ICML), 2021
PDF
Cite
Code
Poster
Video
Blog
Implicit Regularization in Deep Learning May Not Be Explainable by Norms
Noam Razin
,
Nadav Cohen
Advances in Neural Information Processing Systems (NeurIPS), 2020
PDF
Cite
Code
Poster
Blog
RecoBERT: A Catalog Language Model for Text-Based Recommendations
Itzik Malkiel
,
Oren Barkan
,
Avi Caciularu
,
Noam Razin
,
Ori Katz
,
Noam Koenigstein
Findings of the Association for Computational Linguistics: EMNLP, 2020
PDF
Cite
Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding
Oren Barkan*
,
Noam Razin*
,
Itzik Malkiel
,
Ori Katz
,
Avi Caciularu
,
Noam Koenigstein
AAAI Conference on Artificial Intelligence (AAAI), 2020
PDF
Cite
Code
Poster
Cite
×