Citation copied to clipboard
Incremental Learning of Sparse Attention Patterns in Transformers
In ICML International Conference on Machine Learning , 2026
Training Dynamics
Induction Heads Interpolate N-Grams
Francesco D'Angelo*Equal contribution, Oğuz Kaan Yüksel*Equal contribution, Swathi Shree Narashiman, Nicolas Flammarion
In ICML International Conference on Machine Learning , 2026
Mechanistic Interpretability
Generalization Bounds for Autoregressive Processes and In-Context Learning
Oğuz Kaan Yüksel, Nicolas Flammarion
In PriGM@EurIPS Workshop on Principles of Generative Modeling , 2025
Generalization
Incremental Learning of Sparse Attention Patterns in Transformers
In PriGM@EurIPS Workshop on Principles of Generative Modeling , 2025
Training Dynamics
On the Sample Complexity of Next-Token Prediction
Oğuz Kaan Yüksel, Nicolas Flammarion
In AISTATS Annual Conference on Artificial Intelligence and Statistics , 2025
Generalization
Long-Context Linear System Identification
Oğuz Kaan Yüksel, Mathieu Even, Nicolas Flammarion
In ICLR International Conference on Learning Representations , 2025
Generalization
First-order ANIL provably learns representations despite overparametrization
Oğuz Kaan Yüksel, Etienne Boursier, Nicolas Flammarion
In ICLR International Conference on Learning Representations , 2024
Meta Learning Training Dynamics
LatentCLR: A Contrastive Learning Approach for Unsupervised Discovery of Interpretable Directions
Oğuz Kaan Yüksel*Equal contribution, Enis Simsar*Equal contribution, Ezgi Gülperi Er, Pınar Yanardağ
In ICCV Proceedings of the IEEE International Conference on Computer Vision , 2021
Computer Vision
Semantic Perturbations with Normalizing Flows for Improved Generalization
In ICCV Proceedings of the IEEE International Conference on Computer Vision , 2021
Computer Vision
Semantic Perturbations with Normalizing Flows for Improved Generalization
In INNF+ Workshop, ICML ICML Workshop on Invertible Neural Networks, Normalizing Flows, and Explicit Likelihood Models , 2021
Computer Vision
[Re] Can gradient clipping mitigate label noise?
David Mizrahi, Oğuz Kaan Yüksel, Aiday Marlen Kyzy
In ReScience C ReScience C , 2021
Regularization
Adversarial Training with Orthogonal Regularization
Oğuz Kaan Yüksel, İnci Meliha Baytaş
In SIU Signal Processing and Communications Applications Conference , 2020
Adversarial Training Regularization
We propose orthogonal regularization during adversarial training to improve robustness. By encouraging weight matrices to stay orthogonal, the model learns more stable and transferable representations under attack.
No matching items