Martin Marek

I am a PhD student in Machine Learning at NYU, working with Andrew Gordon Wilson and Pavel Izmailov. My research focuses on empirical contributions to the science of deep learning.
Previously I studied Statistics and Machine Learning at UCL and interned as a Quant Researcher at QRT and ML Engineer at Snapchat.
Email me: martin.m@nyu.edu
Links
Personal projects
Selected papers
-
Small Batch Size Training for Language Models:
When Vanilla SGD Works, and Why Gradient Accumulation Is Wasteful
NeurIPS, 2025
[PDF, GitHub, Twitter] -
Can a Confident Prior Replace a Cold Posterior?
AABI workshop, 2024
[PDF, GitHub, Twitter]
Website design inspired by Pavel Izmailov.