Publications

* denotes equal contribution.

What Makes a Reward Model a Good Teacher? An Optimization Perspective
Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization
The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
Understanding Deep Learning via Notions of Rank
Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States
Vanishing Gradients in Reinforcement Finetuning of Language Models
What Algorithms Can Transformers Learn? A Study in Length Generalization
Lecture Notes on Linear Neural Networks: A Tale of Optimization and Generalization in Deep Learning
What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement
On the Ability of Graph Neural Networks to Model Interactions Between Vertices
Implicit Regularization in Hierarchical Tensor Factorization and Deep Convolutional Neural Networks
Implicit Regularization in Tensor Factorization
Implicit Regularization in Deep Learning May Not Be Explainable by Norms
RecoBERT: A Catalog Language Model for Text-Based Recommendations
Scalable Attentive Sentence-Pair Modeling via Distilled Sentence Embedding