Reza Asad

I'm a PhD candidate at Simon Fraser University, co-advised by Sharan Vaswani and Manolis Savva. Previously, I worked as a data scientist in the San Francisco Bay Area for three years. Before that, I was a research and teaching assistant at the University of British Columbia in Applied Mathematics. I hold a Bachelor's degree in Pure Mathematics from the University of Toronto.

Email  /  Scholar  /  X  /  Github

profile photo

Research

I'm interested in optimization in ML and RL. My recent work focuses on designing and analyzing RL objectives based on mirror descent principles in the on-policy setting, as well as studying their off-policy variants with entropy regularization. More recently, I have been exploring applications of RL to training LLMs and am actively learning more about this area.

Papers

Revisiting Actor-Critic Methods in Discrete Action Off-Policy Reinforcement Learning
Reza Asad, Reza Babanezhad, Sharan Vaswani
NeurIPS Workshop on Aligning Reinforcement Learning Experimentalists and Theorists (ARLET), 2025.

Optimistic Actor-Critic with Parametric Policies: Unifying Sample Efficiency and Practicality
Max Qiushi Lin, Reza Asad, Kevin Tan, Haque Ishfaq, Csaba Szepesvári, Sharan Vaswani
NeurIPS Workshop on Aligning Reinforcement Learning Experimentalists and Theorists (ARLET), 2025.

Fast Convergence of Softmax Policy Mirror Ascent
Reza Asad, Reza Babanezhad, Issam Laradji, Nicolas Le Roux, Sharan Vaswani
AISTATS, 2025

Surrogate Minimization: An Optimization Algorithm for Training Large Neural Networks with Model Parallelism
Reza Asad, Reza Babanezhad, Issam Laradji, Nicolas Le Roux, Sharan Vaswani
Neurips Workshop on Optimization for Machine Learning, 2023

3DSSR: 3D Subscene Retrieval
Reza Asad, Manolis Savva
CVPR 2023 Workshop on Structural and Compositional Learning on 3D Data (spotlight), 2023

Steiner symmetrization along a certain equidistributed sequence of directions
Reza Asad, Almut Burchard
Arxiv, 2020

Cloudmaskgan: A content-aware unpaired image-to-image translation algorithm for remote sensing imagery
Sorour Mohajerani, Reza Asad, Kumar Abhishek2, Neha Sharma2, Alysha van Duynhoven3, Parvaneh Saeedi
ICIP, 2019

Embedded eigenvalues and the nonlinear Schrödinger equation
Reza Asad, Gideon Simpson
Journal of Mathematical Physics, 2011