Variance of Policy Gradient
Our preprint on Analyzing the variance of policy gradient estimators for LQR was accepted at the OptRL NeurIPS workshop.
Implicit Gradient Transport
Our paper on Reducing the variance in online optimization by transporting past gradients was accepted at NeurIPS as a spotlight contribution.
[ArXiv, pdf, website, code]
Our submission to the PyTorch Summer Hackathon won best in show! Check out the website to learn how to easily implement meta-learning algorithms with learn2learn.
East European Summer School
I will be attending the East-European Summer School this summer. Get in touch if you will too!