Melanie Weber

Assistant Professor

Harvard University

Biography

I am an Assistant Professor of Applied Mathematics and of Computer Science at Harvard, where I lead the Geometric Machine Learning Group. My research focuses on utilizing geometric structure in data for the design of efficient Machine Learning and Optimization methods with provable guarantees. This AI Magazine article surveys Geometric Machine Learning, including my work within this area.

In 2021-2022, I was a Hooke Research Fellow at the Mathematical Institute in Oxford and a Nicolas Kurti Junior Research Fellow at Brasenose College. In Fall 2021, I was a Research Fellow at the Simons Institute in Berkeley, where I participated in the program Geometric Methods for Optimization and Sampling. Previously, I received my PhD from Princeton University (2021) under the supervision of Charles Fefferman, held visiting positions at MIT and the Max Planck Institute for Mathematics in the Sciences and interned in the research labs of Facebook, Google and Microsoft.

My research is supported by the National Science Foundation, the Sloan Foundation, the Aramont Foundation, the Harvard Dean’s Fund and the Harvard Data Science Initiative.

Interests

Data Geometry
Graph Machine Learning
Optimization on Manifolds
Machine Learning in Non-Euclidean spaces
Label- and Resource-efficient Learning

Education

PhD in Applied Mathematics, 2021

Princeton University
BSc/MSc in Mathematics and Physics, 2016

University of Leipzig

Featured Publications

B. T. Kiani, L. Fesser, M. Weber

October, 2024 NeurIPS (spotlight)

Unitary convolutions for learning on graphs and groups

Group-convolutional architectures, which encode symmetries as inductive bias, have shown great success in applications, but can suffer from instabilities as their depth increases and often struggle to learn long range dependencies in data (e.g., over-smoothing in GNNs). We propose and study unitary group convolutions, which allow for deeper networks that are more stable during training. The main focus of the paper are graph neural networks, where we show that unitary graph convolutions provably avoid over-smoothing and achieve competitive performance on benchmark datasets compared to state-of-the-art graph neural networks. We complement our analysis with the study of general unitary convolutions and analyze their role in enhancing stability in general group convolutional architectures.

A. Cheng, V. Dixit, M. Weber

July, 2024 Under Review

Disciplined Geodesically Convex Programming

Convex programming plays a fundamental role in machine learning, data science, and engineering. Disciplined Programming tests and verifies convexity by decomposing a function into basic convex functions (atoms) using convexity-preserving compositions and transformations (rules). We extend disciplined programming to the geodesic setting, allowing for certifying convexity in nonlinear programs on geometric domains. We determine convexity-preserving compositions and transformations for geodesically convex functions on general Cartan-Hadamard manifolds, as well as for the special case of symmetric positive definite matrices, a common setting in matrix-valued optimization. Our paper is accompanied by a Julia package SymbolicAnalysis.jl, which interfaces with manifold optimization software, allowing for directly solving verified geodesically convex programs.

B. T. Kiani, J. Wang, M. Weber

June, 2024 NeurIPS (spotlight)

Hardness of Learning Neural Networks under the Manifold Hypothesis

The manifold hypothesis presumes that high-dimensional data lies on or near a low-dimensional manifold. While the utility of encoding such structure has been demonstrated empirically, rigorous analysis of its impact on the learnability of neural networks is largely missing. We ask which minimal assumptions on the curvature and regularity of the manifold, if any, render the learning problem efficiently learnable. We prove that learning is hard under input manifolds of bounded curvature, but that additional assumptions on the volume of the data manifold alleviate these fundamental limitations and guarantee learnability. Notable instances of this regime are manifolds which can be reliably reconstructed via manifold learning. We comment on and empirically explore intermediate regimes of manifolds, which have heterogeneous features commonly found in real world data.

B. T. Kiani, T. Le, H. Lawrence, S. Jegelka, M. Weber

January, 2024 ICLR (spotlight)

On the Hardness of Learning under Symmetries

We study the problem of learning equivariant neural networks via gradient descent. A recent line of learning theoretic research has demonstrated that learning shallow, fully-connected (i.e. non-symmetric) networks has exponential complexity in the correlational statistical query (CSQ) model, a framework encompassing gradient descent. We ask: are known problem symmetries sufficient to alleviate the fundamental hardness of learning neural nets with gradient descent? We answer this question in the negative. In particular, we give lower bounds for shallow graph neural networks, convolutional networks, invariant polynomials, and frame-averaged networks for permutation subgroups, which all scale either superpolynomially or exponentially in the relevant input dimension. Therefore, learning the complete classes of functions represented by equivariant neural networks via gradient descent remains hard.

N. García Trillos, M. Weber

June, 2023 Under Review

Continuum Limits of Ollivier’s Ricci Curvature on data clouds: pointwise consistency and global lower bounds

Let M in R^d denote a low-dimensional manifold and let X={x1, …, xn} be a collection of points uniformly sampled from M. We study the relationship between the curvature of a random geometric graph built from M and the curvature of the manifold M via continuum limits of Ollivier’s discrete Ricci curvature. We prove pointwise, non-asymptotic consistency results and also show that if M has Ricci curvature bounded from below by a positive constant, then the random geometric graph will inherit this global structural property with high probability. We discuss applications of the global discrete curvature bounds to contraction properties of heat kernels on graphs, as well as implications for manifold learning from data clouds. In particular, we show that the consistency results allow for characterizing the intrinsic curvature of a manifold from extrinsic curvature.

M. Weber, S. Sra

May, 2023 ICML

Global optimality for Euclidean CCCP under Riemannian convexity

We study geodesically convex problems that can be written as a difference of Euclidean convex functions. This structure arises in key applications such as matrix scaling, M-estimators of scatter matrices, and Brascamp-Lieb inequalities. We exploit this structure to make use of the Convex-Concave Procedure (CCCP), which helps us bypass potentially expensive Riemannian operations and leads to very competitive solvers. Importantly, unlike existing theory for CCCP that ensures convergence to stationary points, we exploit the overall g-convexity structure and provide iteration complexity results for mph{global optimality}.

M. Weber, S. Sra

July, 2022 Mathematical Programming

Riemannian Optimization via Frank-Wolfe Methods

We study projection-free methods for constrained Riemannian optimization. We propose a Riemannian Frank-Wolfe (RFW) method that handles constraints directly, in contrast to prior methods that rely on (potentially costly) projections. We analyze non-asymptotic convergence rates of RFW to an optimum for geodesically convex problems, and to a critical point for nonconvex objectives. We also present a practical setting under which RFW can attain a linear convergence rate. We complement our theoretical results with an empirical comparison of RFW against state-of-the-art Riemannian optimization methods, and observe that RFW performs competitively.

M. Weber, E. Saucan, J. Jost

December, 2017 Journal of Complex Networks

Characterizing complex networks with Forman-Ricci curvature and associated geometric flows

We introduce Forman-Ricci curvature and its corresponding flow as characteristics for complex networks attempting to extend the common approach of node-based network analysis by edge-based characteristics. Following a theoretical introduction and mathematical motivation, we apply the proposed network-analytic methods to static and dynamic complex networks and compare the results with established node-based characteristics. Our work suggests a number of applications for data mining, including denoising and clustering of experimental data, as well as extrapolation of network evolution.

Recent Publications

Quickly discover relevant content by filtering publications.

Y. Tian, Z. Lubberts, M. Weber (2025). Curvature-based Clustering on Graphs. JMLR.

PDF Project

Z. Shumaylov, P. Zaika, J. Rowbottom, F. Sherry, M. Weber, C.-B. Schönlieb (2025). Lie Algebra Canonicalization: Equivariant Neural Operators under arbitrary Lie Groups. ICLR.

PDF Project

A. Lee, M. Weber, F. Viegas, M. Wattenberg (2025). Shared Global and Local Geometry of Language Model Embeddings. Under Review.

PDF Project

L. Fesser, M. Weber (2025). Performance Heterogeneity in Graph Neural Networks: Lessons for Architecture Design and Preprocessing. Under Review.

PDF Project

R. Pellegrin, L. Fesser, M. Weber (2025). Enhancing the Utility of Higher-Order Information in Relational Learning. Under Review.

PDF Project

See all publications

Projects

Discrete Curvature and Machine Learning on Graphs

Discrete Ricci curvature for curvature-based analysis of complex networks.

Optimization on Manifolds

Exploiting geometric structure in (non)convex (constrained) optimization.

Learning to Control with Little Data

Provable, adaptive strategies for agnostic control.

Machine Learning in Non-Euclidean Spaces

Harnessing the geometric structure of data in Machine Learning.

Selected Recorded Talks

Data and Model Geometry in Deep Learning

Sep 7, 2024 Harvard, Cambridge, MA

Discrete Curvature and Applications in Graph Machine Learning

Jun 26, 2024 IMSA, Miami, FL

Exploiting geometric structure in (matrix-valued) optimization

Nov 27, 2023 Simons Institute, Berkeley, CA

Constrained Optimization On Riemannian Manifolds

Nov 29, 2021 Simons Institute, Berkeley, CA

Upcoming and Recent Talks

Developments in the Mathematical Sciences

Jun 16, 2025 Max Planck Institute for Mathematics in the Science, Leipzig, Germany

NetSci Satellite: Network Geometry. Theory and Applications

Jun 3, 2025 Maastricht, the Netherlands

Graph Signal Processing Workshop

May 14, 2025 Mila – Quebec AI Institute

Boston Symmetry Day

Mar 31, 2025 Northeastern University

Math4AI/AI4Math Workshop

Mar 12, 2025 Max Planck Institute for Mathematics in the Science, Leipzig, Germany

See all events

Teaching

APMTH 121: Introduction to Optimization: Models and Methods

APMTH 220: Geometric Methods for Machine Learning

Contact

mweber@seas.harvard.edu
Office hours by appointment. If you are a Harvard undergraduate or graduate student interested in working with me, please send me an email. Due to high email volume, I am unable to reply to most emails from non-Harvard students regarding admission, supervised projects or internships.