Publications

(2023). Understanding Incremental Learning of Gradient Descent -- A Fine-grained Analysis of Matrix Sensing. arXiv preprint arXiv 2301.11500.

PDF Cite ArXiv

(2022). Minimax Optimal Kernel Operator Learning via Multilevel Training. In ICLR 2023 (spotlight).

PDF Cite ArXiv

(2022). Why Robust Generalization in Deep Learning is Difficult: Perspective of Expressive Power. arXiv preprint arXiv:2205.13863, accepted by NeurIPS 2022.

PDF Cite ArXiv

(2021). Non-convex Distributionally Robust Optimization: Non-asymptotic Analysis. In NeurIPS 2021.

PDF Cite ArXiv

(2020). Improved analysis of clipping algorithms for non-convex optimization. In NeurIPS 2020.

PDF Cite ArXiv